Announcement

**JJackson** · March 15, 2020, 04:54 PM

I have been trying to work out what I can make the nextstrain tool do and was fairly sure it was capable of more than I had found by trial and error. My search led me to a lecture which I will come back to and link below.

re rosmarina's post I don't think there has been anything close to evidence that humans were involved in this viruses evolutionary history, beyond being unwitting hosts. I suspect the short fragment was not deemed worth uploading at the time but once COVID arrived it, and bits like it, suddenly became a lot more interesting. I expect more partial sequences but few full genomes. I looked at the human sequence data using nextstrain and looked at the AA mutation frequency across the full genome and its entropy (first image below) to get a feel for which parts were conserved and which changing. In the second image I zoom in to just the short section that matches the fragment (note the little black triangles at the bottom) and there are a few AA changes at random, with very low entropies, indicating they have little impact on the phylogenic tree's structure.

The sequence covers the region around 15,340 to 15,709 which is part of the RdRp gene which in turn is part of ORF1b. This accounts 1/75th of the genome and so would be expected to show high homology in a conserved region, and when the full sequence is blasted against the NCBI data set I get 89% homology in a range of bat and SARS-1 sequences. This is lower than I expected. It came up in today's TWiV (link below) that when the civit intermediate host for SARS-1 was found those sequences had 99% homology with the human strain (presumably across the full genome, based on context, all though not stated explicitly).

In the top image I highlighted one spike in green, this is C14408T (therefore outside KP876546) resulting in ORF1b P314L and creating clade A2a which is active in Northern Europe, hence its high entropy score.

The promised lecture link by Richard Neher, University of Basel on 6 March 2019 https://www.youtube.com/watch?v=YxTUF10redQ
He was a co-developer of Nextstrain and uses it as research tool. He starts with an intro on flu and then starts using it to analyse H3N2 data. Unfortunately in later parts of the video he is not at the podium and the sound is variable also he is pointing out features on graphics for the audience which we can not see which makes it trickier to follow. However if you persevere he looks at the predictive ability of the tree structures and how well their predictions for H3N2 over the years have compared with what actually occurred. The system is obviously making a better than average estimate of which branches would show mutations and become dominant.

This is a plug for the current TWiV which the panel, and I, both thought was Awesome! I am not even going to try list the areas covered in detail as the list of items not covered would be shorter. It is 2hrs long but I doubt you could improve you understanding of this epidemic with 2hrs spent in any other way. TWiV 591 http://www.microbe.tv/twiv/

**JJackson** · March 23, 2020, 08:46 AM

Has anyone seen anything on ORF1a L3606F? I was looking at the NTs which have been regularly cropping up and this one is interesting because it has occurred several times in different parts of the tree and the subsequent cases are holding their own against the wild type. It is the only change I have seen so far that appears to show signs of host adaption. As I do not even know which protein this changes I have no idea what functional change it may make. This is what I have been looking for but a search on ORF1a L3606F only found a few references none of which were particularly helpful.

**JJackson** · March 28, 2020, 04:59 AM

This paper is excellent and provides a ton of data I had not seen elsewhere on the coding regions and their proteins, functions, biological conformations. Not an easy read but worth the effort.

https://www.futuremedicine.com/doi/10.2217/fvl-2018-0008

**Shiloh** · April 4, 2020, 06:18 AM

Source: https://phys.org/news/2020-04-sars-c...ia-openly.html

First SARS-CoV-2 genomes in Austria openly available
by CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences

...Initial sequence analysis of the 29,900 nucleotide-long SARS-CoV-2 genomes from Austria revealed on average 6 mutations different to the reference genome isolated in Wuhan, the capital city of the province of Hubei, China in December 2019. The observed number of mutations is in line with other recently reported SARS-CoV-2 genomes. Most of the observed mutations lead to changes in viral proteins, providing evidence for positive selection pressure and evolution within the human population. Assessing the actual impact of these mutations for the virus life cycle and its interactions with both the host and the immune system will be within the scope of future investigations. Ongoing in-depth genomic analyses focus on mutational hotspots, dissect viral diversity between the Austrian strains and the strains from other countries as well as study of the mutational dynamics of pandemic SARS-CoV-2...

**JJackson** · April 6, 2020, 11:01 AM

The link is a useful lecture on SARS-2 and looks into how the virus persuades the host translational system to read its ORFs.

https://www.youtube.com/watch?v=8_bOhZd6ieM

**JJackson** · June 27, 2020, 06:34 AM

This is an interesting paper in terms of the technique and the interactive heat-map tool for looking at the data. The experiment produced engineered yeast cells each of which has a different SARS-CoV-2 spike RBD displayed on its surface. They replace every nucleotide at every position in the RBD and then measure, for each AA change, the effect on protein expression and binding affinity. They also show if that AA is in direct contact with the ACE2 receptor and what the consensus AA was at that position for each of SARS-1, RaTG13 and Pangolins. All of this data can be accessed by hovering the mouse over the squares on the heat map. This is useful data for vaccine formulation, especially mono or poly-clonal antibodies, as it shows functional biological constrained AA positions which will be more resistant to vaccine escape mutations.
https://www.biorxiv.org/content/10.1....157982v1.full
https://jbloomlab.github.io/SARS-CoV-2-RBD_DMS/ - Interactive heat-map tool.
https://www.microbe.tv/twievo/twievo-57/ - TWiEVO podcast discussion with the authors.

The heat map works well as an adjunct to the Nextstrain tool, discussed earlier, to examine how well the experimental data matches the actual mutational frequencies. It may also be helpful to have a look at my first post in this thread which looks at additional SL BetaCoV RBDs and the S1 protein structure around the RBD pocket.
https://flutrackers.com/forum/forum/...rsonal-opinion

**Pathfinder** · June 2, 2021, 01:46 PM

Originally posted by Pathfinder View Post

Mining coronavirus genomes for clues to the outbreak’s origins

By Jon CohenJan. 31, 2020 , 6:20 PM
...
“One of the biggest takeaway messages [from the viral sequences] is that there was a single introduction into humans and then human-to-human spread,” says Trevor Bedford, a bioinformatics specialist at the University of Washington and Fred Hutchinson Cancer Research Center.
...
The longer a virus circulates in a human populations, the more time it has to develop mutations that differentiate strains in infected people, and given that the 2019-nCoV sequences analyzed to date differ from each other by seven nucleotides at most, this suggests it jumped into humans very recently. But it remains a mystery which animal spread the virus to humans.
...
According to Xinhua, the state-run news agency, “environmental sampling” of the Wuhan seafood market has found evidence of 2019-nCoV. Of the 585 samples tested, 33 were positive for 2019-nCoV and all were in the huge market’s western portion, which is where wildlife were sold. “The positive tests from the wet market are hugely important,” says Edward Holmes, an evolutionary biologist at the University of Sydney ...
...
Yet there have been no preprints or official scientific reports on the sampling, so it’s not clear which, if any, animals tested positive. “Until you consistently isolate the virus out of a single species, it’s really, really difficult to try and determine what the natural host is,” says Kristian Andersen, an evolutionary biologist at Scripps Research.
...
It’s not just a “curious interest” to figure out what sparked the current outbreak, Daszak says. “If we don't find the origin, it could still be a raging infection at a farm somewhere, and once this outbreak dies, there could be a continued spillover that’s really hard to stop. But the jury is still out on what the real origins of this are.”

https://www.sciencemag.org/news/2020...reak-s-origins

Jimmy Tobias

@JamesCTobias
Very interesting email from the Fauci documents obtained by
@JasonLeopold
:

1:33 PM · Jun 1, 2021·Twitter Web App

Announcement

Discussion - 2019-nCoV genetics

Comment

Comment

Comment

Comment

Comment

Comment

Comment