Talk:Multiple sequence alignment
![]() | Multiple sequence alignment has been listed as one of the Natural sciences good articles under the good article criteria. If you can improve it further, please do so. If it no longer meets these criteria, you can reassess it. | ||||||||||||
| |||||||||||||
Current status: Good article |
![]() | This article is rated GA-class on Wikipedia's content assessment scale. It is of interest to the following WikiProjects: | |||||||||||||||||||
|
some clarifications
the statement
Because HMMs are probabilistic, they do not produce the same solution every time they are run on the same dataset; thus they cannot be guaranteed to converge to an optimal alignment. HMMs can produce both global and local alignments. Although HMM-based methods have been developed relatively recently, they offer significant improvements in computational speed, especially for sequences that contain overlapping regions.
is incorrect. HMMs are probablistic in the sense that they are a statistical model, however, they are completely deterministic and will produce the same result every time on a given dataset. HMM alignments use the same algorithms as local sequence alignments and therefore have no computational speed advantage.
MEME uses a PSSM (position specific scoring matrix), but does not contain insertion or deletion probabilities or other characteristics of a typical sequence HMM.
Gribskov 03:55, 20 September 2007 (UTC)
Given the specific technological and algorithmic and biological significance of short-read sequence alignment, I think this topic deserves its own page. For example, the differences between short read mapping and de-novo assembly in next-generation sequencing projects could be discussed on such a page. --Dan|(talk) 14:01, 22 January 2009 (UTC)
Alternative interpretations of MSAs
The main use/interpretation of columns in MSAs is that residues in the same column are "related" by either point substitutions or no substitutions at all.
However, there are applications of MSAs where residues in the same column are assumed to be "structurally" equivalent but not necessarily evolutionarily equivalent e.g. http://www.ncbi.nlm.nih.gov/pubmed/16733545 - indeed in some of these applications the aim is to avoid including "homologous" sequences in the alignment e.g. http://www.ncbi.nlm.nih.gov/pubmed/9920390
At the moment this distinction isn't made on the MSA wikipedia page - although the top of the sequence alignment wikipedia page does highlight different interpretations.
First wikipedia post ever here - not quite ready to be bold yet! - so wanted to ask/check whether anyone disagrees with introducing some changes to reflect this distinction to the MSA page? SiggyDood (talk) 12:45, 11 March 2009 (UTC)
GA Reassessment
- This discussion is transcluded from Talk:Multiple sequence alignment/GA1. The edit link for this section can be used to add comments to the reassessment.
Starting GA reassessment as part of the GA Sweeps process. Jezhotwells (talk) 19:18, 28 February 2010 (UTC)
Checking against GA criteria
- It is reasonably well written.
- a (prose):
b (MoS):
This article appears more like an essay or a paper than an encyclopaedia artcile. Consider a thorough copy-edit for style and clarity.Done
- a (prose):
- It is factually accurate and verifiable.
- a (references):
b (citations to reliable sources):
c (OR):
I repaired dead links using WP:CHECKLINKS. All references appear to be OKDone
There are many uncited paragraphs.Done
- a (references):
- It is broad in its coverage.
- a (major aspects):
b (focused):
- a (major aspects):
- It follows the neutral point of view policy.
- Fair representation without bias:
- Fair representation without bias:
- It is stable.
- No edit wars, etc.:
- No edit wars, etc.:
- It is illustrated by images, where possible and appropriate.
- a (images are tagged and non-free images have fair use rationales):
b (appropriate use with suitable captions):
Images such as File:Caspase-motif-alignment.png and File:RPLP0 90 ClustalW aln.gif are illegible in the article and appear to add little.Done
- a (images are tagged and non-free images have fair use rationales):
- Overall:
- Pass/Fail:
Main concerns: the style of the article is un-encyclopaedic, images add little, many uncited paragraphs. On hold until 7 March, major contributors and projects will be notified. Jezhotwells (talk) 19:29, 28 February 2010 (UTC)- OK, thanks for for fixing things up, keep GA status. Jezhotwells (talk) 18:49, 7 March 2010 (UTC)
- Pass/Fail:
Note
Most of the unencylopedic style was inserted by a well-meaning but seemingly novice editor. I've removed much of that content because it was redundant with other, cited parts of the article. I've also significantly enlarged the lead image. Without such detail, the image is useless to the reader unless he or she clicks through to the larger media file -- which is unlikely. All paragraphs now have at least one relevant reference. If are any issues that remain to be addressed for the purposes of this reassessment, please let me know. Thanks, Emw (talk) 08:10, 6 March 2010 (UTC)
- Hi, making the image larger doesn't really address the problem which is that the image does not convey any information. Please see WP:MOS#Avoid entering textual information as images and WP:MOS#Images which suggest no larger than 300px for lead images. Jezhotwells (talk) 14:51, 6 March 2010 (UTC)
- Images portraying multiple sequence alignments seem like a valid exception to the guideline discouraging the use of textual information in images. Presumably that guideline pertains to uses of natural language in images. In contrast, the text used in the two images in this article represent sequences of amino acids. It is the convention used to represent MSAs among reliable sources (i.e., textbooks, journal articles, reliable websites). Also, I think the lead image's detail necessitates its larger-than-usual dimensions. Such exceptions are provided for in the MOS: "Images containing important detail (for example, a map, diagram, or chart), and which may need larger sizes than usual." Emw (talk) 15:23, 6 March 2010 (UTC)
- OK, I'll buy that, Another possibility, which I ask tou to consider is moving the image elsewhere in teh artcile so that it doesn't sandwich the lead. Jezhotwells (talk) 18:49, 7 March 2010 (UTC)
- Images portraying multiple sequence alignments seem like a valid exception to the guideline discouraging the use of textual information in images. Presumably that guideline pertains to uses of natural language in images. In contrast, the text used in the two images in this article represent sequences of amino acids. It is the convention used to represent MSAs among reliable sources (i.e., textbooks, journal articles, reliable websites). Also, I think the lead image's detail necessitates its larger-than-usual dimensions. Such exceptions are provided for in the MOS: "Images containing important detail (for example, a map, diagram, or chart), and which may need larger sizes than usual." Emw (talk) 15:23, 6 March 2010 (UTC)
External links modified (February 2018)
Hello fellow Wikipedians,
I have just modified 4 external links on Multiple sequence alignment. Please take a moment to review my edit. If you have any questions, or need the bot to ignore the links, or the page altogether, please visit this simple FaQ for additional information. I made the following changes:
- Added archive https://web.archive.org/web/20100311140200/http://www.ebi.ac.uk/help/matrix.html to http://www.ebi.ac.uk/help/matrix.html
- Added archive https://web.archive.org/web/20060705082556/http://align.genome.jp/ to http://align.genome.jp/
- Added archive https://web.archive.org/web/20060909003117/http://pbil.univ-lyon1.fr/alignment.html to http://pbil.univ-lyon1.fr/alignment.html
- Added archive https://web.archive.org/web/20090821192233/http://bioinf.may.ie/school02/notes.html to http://bioinf.may.ie/school02/notes.html
When you have finished reviewing my changes, you may follow the instructions on the template below to fix any issues with the URLs.
This message was posted before February 2018. After February 2018, "External links modified" talk page sections are no longer generated or monitored by InternetArchiveBot. No special action is required regarding these talk page notices, other than regular verification using the archive tool instructions below. Editors have permission to delete these "External links modified" talk page sections if they want to de-clutter talk pages, but see the RfC before doing mass systematic removals. This message is updated dynamically through the template {{source check}}
(last update: 5 June 2024).
- If you have discovered URLs which were erroneously considered dead by the bot, you can report them with this tool.
- If you found an error with any archives or the URLs themselves, you can fix them with this tool.
Cheers.—InternetArchiveBot (Report bug) 04:37, 8 February 2018 (UTC)
- Wikipedia good articles
- Natural sciences good articles
- GA-Class Molecular Biology articles
- High-importance Molecular Biology articles
- GA-Class Genetics articles
- Mid-importance Genetics articles
- WikiProject Genetics articles
- GA-Class MCB articles
- Mid-importance MCB articles
- WikiProject Molecular and Cellular Biology articles
- GA-Class Computational Biology articles
- High-importance Computational Biology articles
- WikiProject Computational Biology articles
- All WikiProject Molecular Biology pages