参考资料
参考资料

参考资料

生物序列

🍎A general method applicable to the search for similarities in the amino acid sequence of two proteins
🍎 Identification of Common Molecular Subsequences
🍏.An improved algorithm for matching biological sequences
🍏Optimal alignment in linear space

概率论

🍎 Biological Sequence Analysis Chapter1 and Chapter 11
🍏Probability Theory: The logic of Science
🍏数学之美

复杂度分析

🍎Introduction to Algorithms Chapter1 and Chapter 2

DP和Greedy

🍎Introduction to Algorithms Chapter15 and Chapter 16

BLAST

🍎Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes
🍏Basic Local Alignment Search Tool
🍏Amino Acid substitution matrices from an information theoretic perspective
🍏A protein alignment scoring system sensitive at all evolution distances
🍏Local alignment statistics

HMM

🍎A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition
🍏Fundamentals of Speech Recognition

基因预测

🍏Prediction of complete gene structures in human genomic DNA
🍏Using ESTs to Improve the Accuracy of de novo Gene Prediction
🍏The Completion of the Mammalian Gene Collection (MGC)
🍏Closing in on the C.elegans ORFeome by Cloning TWINSCAN predictions
🍏Revealing missing human protein isoforms based on ab initio prediction, RNA-seq and proteomics

多序列比对

🍎Biological Sequence Analysis Chapter5 and Chapter 6

Alignment-free sequence analysis

🍏CVTree: a phylogenetic tree reconstruction tool based on whole genomes
🍏Composition-based classification of short metagenomic sequences elucidates the landscapes of taxonomic and functional enrichment of microorganisms
🍏Phymm and PhymmBL: metagenomic phylogenic classification with interpolated Markov models
🍏PhymmBL expanded: confidence scores, custom databases, parallelization and more

Motif finding

🍎Assessing computational tools for the discovery of transcription factor binding sites