Python Programming for Biology: Bioinformatics and Beyond


Whether to align protein or DNA



Download 7,75 Mb.
Pdf ko'rish
bet166/514
Sana30.12.2021
Hajmi7,75 Mb.
#91066
1   ...   162   163   164   165   166   167   168   169   ...   514
Bog'liq
[Tim J. Stevens, Wayne Boucher] Python Programming

Whether to align protein or DNA

If  you’re  looking  at  the  coding  regions  of  genes  and  their  resulting  proteins,  when

approaching a particular sequence alignment problem, there is often the option of aligning

nucleotide sequences or protein sequences. In general, which kind of alignment to do will

be governed by the specific purpose of the investigation. For example, if you are studying

the  spread  of  an  influenza  virus  through  human  populations,  then  because  the  rates  of

nucleotide  change  in  the  virus  genome  are  high,  and  because  you  wish  to  study  every

genetic  change  in  fine  detail,  aligning  nucleotide  sequences  would  be  the  best  option.

Conversely,  if  you  are  interested  in  protein  structure,  maybe  by  doing  comparative

modelling,  then  using  nucleotide  sequences  would  introduce  unnecessary  error  and  you

would always use the protein sequence, albeit possibly translated from the DNA. In such



circumstances,  because  the  nucleotides  can  have  one  of  four  bases,  the  chances  of  a

random  match  are  quite  high  compared  to  an  amino  acid  which  is  one  of  20;  i.e.  the

chances of spurious matches in protein sequences are much smaller. Also worth noting is

the fact that some DNA changes have no effect on the protein at all, and thus are irrelevant

for many questions arising from protein sequence. Lastly, when dealing with amino acids,

we almost always calculate a score of an alignment based upon the degree to which they

match, rather than just saying how many are identical, which is often the case for DNA.

8

In general, this is possible because the chemical structure of amino acids allows you to say



how similar they are, for example, in terms of size and charge or in the ability to form a

loop.  This  can  reveal  relationships,  e.g.  conservation  caused  by  the  protein’s  structure,

which would otherwise not be visible by considering simple matches alone.


Download 7,75 Mb.

Do'stlaringiz bilan baham:
1   ...   162   163   164   165   166   167   168   169   ...   514




Ma'lumotlar bazasi mualliflik huquqi bilan himoyalangan ©hozir.org 2024
ma'muriyatiga murojaat qiling

kiriting | ro'yxatdan o'tish
    Bosh sahifa
юртда тантана
Боғда битган
Бугун юртда
Эшитганлар жилманглар
Эшитмадим деманглар
битган бодомлар
Yangiariq tumani
qitish marakazi
Raqamli texnologiyalar
ilishida muhokamadan
tasdiqqa tavsiya
tavsiya etilgan
iqtisodiyot kafedrasi
steiermarkischen landesregierung
asarlaringizni yuboring
o'zingizning asarlaringizni
Iltimos faqat
faqat o'zingizning
steierm rkischen
landesregierung fachabteilung
rkischen landesregierung
hamshira loyihasi
loyihasi mavsum
faolyatining oqibatlari
asosiy adabiyotlar
fakulteti ahborot
ahborot havfsizligi
havfsizligi kafedrasi
fanidan bo’yicha
fakulteti iqtisodiyot
boshqaruv fakulteti
chiqarishda boshqaruv
ishlab chiqarishda
iqtisodiyot fakultet
multiservis tarmoqlari
fanidan asosiy
Uzbek fanidan
mavzulari potok
asosidagi multiservis
'aliyyil a'ziym
billahil 'aliyyil
illaa billahil
quvvata illaa
falah' deganida
Kompyuter savodxonligi
bo’yicha mustaqil
'alal falah'
Hayya 'alal
'alas soloh
Hayya 'alas
mavsum boyicha


yuklab olish