Announcement

**gsgs** · November 21, 2009, 05:54 AM

Re: Sequence Analysis Using MUSCLE

I just tried muscle, but it ran out of memory and seems a little slow.
I had been using kalign.exe and MAFFT online.

For simple flu-alignments (no insertions,deletions, no different
HAs or NAs or NSs) I have a self-written simple program
which is very fast and needs little memory.
So, e.g. I can align 7000 PB2s in 2min.

**gsgs** · November 21, 2009, 05:58 AM

Re: Sequence Analysis Using MUSCLE

the nucleotide-sequences start with ~50 nucleotides
which are not decoded to amino-acids.
Often only parts of these 50 are given or none
The first occurrance of "ATG" is usually the first decoded amino-acid
(Methionine,Met,M)

also niman-H274Y is H275Y in N1
and D225G is D239G in H1

**mixin** · November 21, 2009, 07:13 AM

Re: Sequence Analysis Using MUSCLE

OK! The example I'm showing has the first "atg" for all three starting at position 9.

"We see the mutation at position 831 instead of 822;" so if I subtract 8, I'm at position 823... so I'm still 1 off? Niman's 275 makes it worse.

When I've seen the nucleotides and amino acids aligned for comparison purposes, the M is under the T.. so when I count, is the M position considered #1 or #2?

BTW, thank you for all your hours and patience.

**gsgs** · November 21, 2009, 07:18 AM

Re: Sequence Analysis Using MUSCLE

C823T(6,n)=H275Y(NA) CAC-->TAC

this is for starting to count at the coding region (=first amino acid, ATG), which I think is unusual

for ******:
S224P in PA is T670C(3)
M582L in PA is A1741C(3)
S91P in HA is T298C(4)
S206T in HA is T658A(4)
V323I in HA is G1012A(4)
V100I in NP is G298A(5)
T373I in NP is C1118T(5)
V106I in NA is G316A(6)
N247D in NA is A742G(6)

all the 3 can mutate, see list below

mutations at position 3 in a codon (3 consecutive nucleotides)
are usually synonymous
(don't change the encoded amino acid)

Alanine,Ala,A,4,GCT,GCC,GCA,GCG
Arginine,Arg,R,6,CGT,CGC,CGA,CGG,AGA,AGG
Asparagine,Asn,N,2,AAT,AAC
AsparticAcid,Asp,D,2,GAT,GAC
Cysteine,Cys,C,2,TGT,TGC
GlutamicAcid,Glu,E,2,GAA,GAG
Glutamine,Gln,Q,2,CAA,CAG
Glycine,Gly,G,4,GGT,GGC,GGA,GGG
Histidine,His,H,2,CAT,CAC
Isoleucine,Ile,I,3,ATT,ATC,ATA
Leucine,Leu,L,6,TTA,TTG,CTT,CTC,CTA,CTG
Lysine,Lys,K,2,AAA,AAG
Methionine,Met,M,1,ATG
Phenylalanine,Phe,F,2,TTT,TTC
Proline,Pro,P,4,CCT,CCC,CCA,CCG
Serine,Ser,S,6,TCT,TCC,TCA,TCG,AGT,AGC
Threonine,Thr,T,4,ACT,ACC,ACA,ACG
Tryptophan,Trp,W,1,TGG
Tyrosine,Tyr,Y,2,TAT,TAC
Valine,Val,V,4,GTT,GTC,GTA,GTG
STOP,Sto,},3,TAG,TGA,TAA

hydrophobic:GAVLIMFWP
hydrophilic:STCYNQ,DE,KRH

**sharon sanders** · November 21, 2009, 09:27 AM

Re: Sequence Analysis Using MUSCLE

Thank you very much, we all want to learn.

225G Preliminary Worldwide Tracking & Evaluation

225G Worldwide Tracking & Evaluation - FluTrackers News and Information

http://www.flutrackers.com/forum/showthread.php?t=134015

Norway - H1N1 "Mutation" Announced by Health Department

Norway - H1N1 "Mutation" Announced by Health Department - FluTrackers News and Information

http://www.flutrackers.com/forum/showthread.php?t=133897

Wales: Tamiflu-resistant swine flu spreads 'between patients'

FluTrackers News and Information

http://www.flutrackers.com/forum/showthread.php?t=133946

Tamiflu-resistant cluster in N. Carolina

FluTrackers News and Information

http://www.flutrackers.com/forum/showthread.php?t=133979

FluTrackers Swine Flu Genetic Forum

http://www.flutrackers.com/forum/for...lay.php?f=1527

**Sally Furniss** · November 27, 2009, 05:23 PM

Re: Sequence Analysis Using MUSCLE

Thanks, great instructions!

**Sally Furniss** · November 29, 2009, 10:12 PM

Re: Sequence Analysis Using MUSCLE

Converting bare sequences to FASTA format

1. Get bare sequence
2. paste in to the Readseq - biosequence conversion tool http://www.ebi.ac.uk/cgi-bin/readseq.cgi
3.Select PeasonFasta,
4 Select view in browser (or download to file ) click submit
5. Copy paste contents into MUSCLE http://www.ebi.ac.uk/Tools/muscle/

**Sally Furniss** · November 30, 2009, 12:28 AM

Re: Mutations in A/H1N1 Not Confirmed to Affect Effectiveness of Current Vaccine

We both used the same data, why is the numbering different?

**Sally Furniss** · November 30, 2009, 01:39 AM

Re: Mutations in A/H1N1 Not Confirmed to Affect Effectiveness of Current Vaccine

Originally posted by Sally View Post

Comparing both sequences against the vaccine. A/California/07/2009(H1N1) accession number FJ969540

Why would there be an R (2 of them) on the A/California/07/2009(H1N1) at about positions 715 and 718 ?

**Sally Furniss** · November 30, 2009, 03:51 AM

Stop codons: the Good, the Bad, and the Ugly

Comes with video
http://www.mcb.arizona.edu/courses/m.../XLateTut.html

Stop codons: the Good, the Bad, and the Ugly

Stop codons are a normal part of protein synthesis--they're the reason that all proteins don't go on 'forever'. Given a translation machinery that simply puts one foot in front of the other endlessly, a mechanism must exist for derailing the machine when its work is done. This machinery is the three Stop (or 'nonsense') codons and the proteins that read them. They're encoded by every gene, and are already there when the mRNA is produced--the whole process of translation is the interpretation of a ticker tape by an elegant machine (the ribosome) charged with 'translating' a nucleotide language into an amino acid language.

It is not known, at least by me, why there are 3 stop codons and why they are UAA, UAG and UGA (indeed, in some systems, such as some mitochondria, UGA actually specifies Trp instead of stop). But given that there are 64 possible codons and 3 mean 'stop', ON AVERAGE, with all other things being equal (which they never are...) 1 of 20 randomly selected codons says STOP. Similarly, if you're reading in an unanticipated/incorrect reading frame, you're in essence reading random codons, so will ON AVERAGE get about 20 amino acids before being stopped out. That's not very far!

The existence of stop codons needs to permeate your thinking about what is and is not 'fixable'. Sure, a -1 frameshift has the ability to compensate for a +1 frameshift--IF there is no intervening stop codon! Recall the translation tutorial (or review it if you can't recall it...). In the second movie shown, reading in the +1 frame (the result of a single nucleotide insertion) 'uncovered' a stop codon that derailed translation. In the third movie, our hero, in the form of a -1 frameshift (== nucleotide removal) fixed things 'just in time' such that reading frame was restored before the evil stop codon brought the party crashing down. Any mutations FURTHER DOWN (rightward, = the 3' direction) would have availed us naught.

Some simple questions to direct you thinking in fruitful ways about the influence of stop codons for good and ill:
--How can you pick a region such that you can be reasonably confident that a stop codon occurs in a given reading frame?
--if you don't wish to worry your pretty little head about the nasty possibility of stop codons, what locations will you choose to examine for your compensating mutations vis-a-vis the location of the mutation they're meant to fix?
--in general, what rules determine where a compensating mutation can occur relative to the mutation being 'fixed' or compensated for (this can be a little tricky, given most of our innate biases about who is the 'problem' and who the 'solution'--recall any frameshift is a drag unless corrected in a timely fashion, and that any solution is a good solution so long as we're still reading and reading in frame when we hit the 'business end' of the rIIb gene!

Page not found | Molecular and Cellular Biology

http://www.mcb.arizona.edu/courses/mcb422/SupplementsFolder/Stops.html

**JJackson** · November 30, 2009, 05:35 AM

Re: Mutations in A/H1N1 Not Confirmed to Affect Effectiveness of Current Vaccine

Originally posted by Sally View Post

We both used the same data, why is the numbering different?

The wonders of the numbering systems is something I have yet to master. I did use some other sequences in my alignment and a program called CLC sequence viewer for my nucleotide alignment. I exported a Custal .aln alignment file which I then loaded into Bioedit (because I am more familiar with it).
However if I adjust my aligned sequences so D225G really is at position 225 then the two non-change-changes are N2N (ANADTL) & D475D (HKCDNTC) or in nucleotide terms 6 & 1425.

EDIT:
oops this must be confusing the hell out of everyone as this is in the wrong thread and relates to Sally and my numbering differences on the Lviv sequences

**Sally Furniss** · November 30, 2009, 05:46 AM

Re: Mutations in A/H1N1 Not Confirmed to Affect Effectiveness of Current Vaccine

Originally posted by JJackson View Post

The wonders of the numbering systems is something I have yet to master. I did use some other sequences in my alignment and a program called CLC sequence viewer for my nucleotide alignment. I exported a Custal .aln alignment file which I then loaded into Bioedit (because I am more familiar with it).
However if I adjust my aligned sequences so D225G really is at position 225 then the two non-change-changes are N2N (ANADTL) & D475D (HKCDNTC) or in nucleotide terms 6 & 1425.

How did you get these easily. N2N (ANADTL) & D475D (HKCDNTC) . Do you have conversion program?

**JJackson** · November 30, 2009, 05:52 AM

Re: Sequence Analysis Using MUSCLE

I use a program call Bioedit. You just hold the Ctrl key down and press G to toggle backwards and forwards between the Protein & Nucleotide sequences.

**sharon sanders** · November 30, 2009, 06:02 AM

Re: Mutations in A/H1N1 Not Confirmed to Affect Effectiveness of Current Vaccine

Originally posted by JJackson View Post

The wonders of the numbering systems is something I have yet to master. I did use some other sequences in my alignment and a program called CLC sequence viewer for my nucleotide alignment. I exported a Custal .aln alignment file which I then loaded into Bioedit (because I am more familiar with it).
However if I adjust my aligned sequences so D225G really is at position 225 then the two non-change-changes are N2N (ANADTL) & D475D (HKCDNTC) or in nucleotide terms 6 & 1425.

EDIT:
oops this must be confusing the hell out of everyone as this is in the wrong thread and relates to Sally and my numbering differences on the Lviv sequences

This is good on this thread because this is the learning to read sequences thread.

Announcement

Sequence Analysis Using MUSCLE

Sequence Analysis Using MUSCLE

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment