CISC 667 Intro to Bioinformatics (Fall 2005) Hidden Markov Models (II)

CISC667, F05, Lec11, Liao CISC 667 Intro to Bioinformatics (Fall 2005) Hidden Markov Models (II) • The model likelihood: Forward algorithm, backward algorithm • Posterior decoding

Upload
eldon
Category

Documents
view
24
download
0

Embed Size (px):

description

CISC 667 Intro to Bioinformatics (Fall 2005) Hidden Markov Models (II). The model likelihood: Forward algorithm, backward algorithm Posterior decoding. 0.70. 0.65. A: .170 C: .368 G: .274 T: .188. A: .372 C: .198 G: .112 T: .338. 0.35. 0.30. +. −. - PowerPoint PPT Presentation

Transcript of CISC 667 Intro to Bioinformatics (Fall 2005) Hidden Markov Models (II)

CISC667, F05, Lec11, Liao

CISC 667 Intro to Bioinformatics(Fall 2005)

Hidden Markov Models (II)

• The model likelihood: Forward algorithm, backward algorithm

• Posterior decoding

Page 2: CISC 667 Intro to Bioinformatics (Fall 2005) Hidden Markov Models (II)

CISC667, F05, Lec11, Liao

The probability that sequence x is emitted by a state path π is:

P(x, π) = ∏i=1 to L eπi (xi) a πi πi+1

i:123456789 x:TGCGCGTAC π :--++++---

P(x, π) = 0.338 × 0.70 × 0.112 × 0.30 × 0.368 × 0.65 × 0.274 × 0.65 × 0.368 × 0.65 × 0.274 × 0.35 × 0.338 × 0.70 × 0.372 × 0.70 × 0.198.

Then, the probability to observe sequence x in the model is P(x) = π P(x, π),

which is also called the likelihood of the model.

A: .170C: .368G: .274T: .188

A: .372C: .198G: .112T: .338

0.35

0.30

0.65 0.70

+ −

Page 3: CISC 667 Intro to Bioinformatics (Fall 2005) Hidden Markov Models (II)

CISC667, F05, Lec11, Liao

How to calculate the probability to observe sequence x in the model?

P(x) = π P(x, π)

Let fk(i) be the probability contributed by all paths from the beginning up to (and include) position i with the state at position i being k.

The the following recurrence is true:

f k(i) = [ j f j(i-1) ajk ] ek(xi)

Graphically,

x1 x2 x3 xL

Again, a silent state 0 is introduced for better presentation

Page 4: CISC 667 Intro to Bioinformatics (Fall 2005) Hidden Markov Models (II)

CISC667, F05, Lec11, Liao

Forward algorithm

Initialization: f0(0) =1, fk(0) = 0 for k > 0.

Recursion: fk(i) = ek(xi) jfj(i-1) ajk.

Termination: P(x) = kfk(L) ak0.

Time complexity: O(N2L), where N is the number of states and L is the sequence length.

Page 5: CISC 667 Intro to Bioinformatics (Fall 2005) Hidden Markov Models (II)

CISC667, F05, Lec11, Liao

Let bk(i) be the probability contributed by all paths that pass state k at position i.

bk(i) = P(xi+1, …, xL | (i) = k)

Backward algorithm

Initialization: bk(L) = ak0 for all k.

Recursion (i = L-1, …, 1): bk(i) = j akj ek(xi+1) fj(i+1).

Termination: P(x) = k a0k ek(x1)bk(1).

Time complexity: O(N2L), where N is the number of states and L is the sequence length.

Page 6: CISC 667 Intro to Bioinformatics (Fall 2005) Hidden Markov Models (II)

CISC667, F05, Lec11, Liao

Posterior decoding

P(πi = k |x) = P(x, πi = k) /P(x) = fk(i)bk(i) / P(x)

Algorithm:

for i = 1 to L

do argmax k P(πi = k |x)

Notes: 1. Posterior decoding may be useful when there are multiple almost most probable paths, or when a function is defined on the states.

2. The state path identified by posterior decoding may not be most probable overall, or may not even be a viable path.

Page 7: CISC 667 Intro to Bioinformatics (Fall 2005) Hidden Markov Models (II)

CISC667, F05, Lec11, Liao

• The log transformation for Viterbi algorithm

vk(i) = ek(xi) maxj (vj(i-1) ajk);

ajk = log ajk;

ek(xi) = log ek(xi);

vk(i) = log vk(i);

vk(i) = ek(xi) + maxj (vj(i-1) + ajk);

Allewaa Alarabi Newspaper issue 667

Protein Tertiary Structure Prediction Structural Bioinformatics.

$CISC 4090 Theory of Computationstorm.cis.fordham.edu/leeds/cisc4090S17/Lecture1_feb13b.pdf2/13/2017 4 What is the language of M4? (page 38, Ex. 1.11) L(M4)={w| w ends and begins with$

CISC 4090 Theory of Computationstorm.cis.fordham.edu/leeds/cisc4090S17/Lecture1_feb13b.pdf2/13/2017 4 What is the language of M4? (page 38, Ex. 1.11) L(M4)={w| w ends and begins with

CISC Steel Design Series · 2017-09-21 · Part 2 Bolt Groups Subjected to an Eccentric and Inclined Point Load CANADIAN INSTITUTE OF STEEL CONSTRUCTION INSTITUT CANADIEN DE LA CONSTRUCTION

Dr. D. Y. PATIL BIOTECHNOLOGY & BIOINFORMATICS …biotech.dpu.edu.in/Documents/ConferencesWorkshops.pdf · (DEEMED UNIVERSITY) ... Dr. D. Y. PATIL BIOTECHNOLOGY & BIOINFORMATICS INSTITUTE,

Research Paper MicroRNA-21-5p mediates TGF-β-regulated ... · Genes and Genomes (KEGG) pathway analysis indicated that miR21-5p specifically binds to Smad7, while bioinformatics

Biological Research in India - · PDF fileRamachandran plot or a [φ,ψ] plot developed in 1963 by G. N. Ramachandran, C. ... biotechnology, bioinformatics, and the various ‘omics’

Less is more Approaches to biologist-driven analysis and next-generation sequencing data Paul Gordon Genome Canada Bioinformatics Platform University of.

667 Le Bon Gustave Psixologia Mazwn

$CISC 4090 Theory of Computation - Computer and …storm.cis.fordham.edu/leeds/cisc4090/Lecture1_sep25.pdf9/25/2017 4 What is the language of M4? (page 38, Ex. 1.11) L(M4)={w| w ends$

CISC 4090 Theory of Computation - Computer and …storm.cis.fordham.edu/leeds/cisc4090/Lecture1_sep25.pdf9/25/2017 4 What is the language of M4? (page 38, Ex. 1.11) L(M4)={w| w ends

Steven W. Yates - TRIUMF Workshop... · 2016. 5. 13. · M.J. Carson et al. Astroparticle Physics 21 (2004) 667–687 M.J. Carson et al. Astroparticle Physics 21 (2004) 667 – 687

The LIBI Grid Platform for Bioinformaticsmargara/page14/assets/LIBIchapter.pdf · The LIBI Grid Platform for Bioinformatics ... Mirco Mazzucato§, Salvatore My*&, Giovanna Selvaggi&

Membrane Bioinformatics SoSe 2009 Helms/Böckmann

EECS 730 Introduction to Bioinformatics Sequence Alignmentjhuan/EECS730_F12/slides/9... · 2012-09-30 · EECS 730 Introduction to Bioinformatics Sequence Alignment ... We know how

Introduction to Quantum Information Processing QIC 710 / CS 667 / PH 767 / CO 681 / AM 871

lezione18 AlbertoPaoluzziMauroCeccanti ... · Informatica Biomedica: Lezione 18 MolecularVisualization -shapes. Fonteessenziale: Chapter9ofbook: J.GuandP.E.Bourne, Structural Bioinformatics,Wiley(2009)

Ab initio methods: how/why do they work · EM. Crystallography. NMR. Biochemistry. FRET. Bioinformatics. Complementary techniques. AUC. Oligomeric . mixtures. Hierarchical systems.

BMC Bioinformatics BioMed Central - Springercides, and lipid-lowering drugs. Fibrates including bezafibrate, clofibrate, fenofibrate, WY-14,643, and oth-ers are a unique class of hypolipidemic

$ΣΗΜΕΙΩΣΕΙΣ ΣΕΜΙΝΑΡΙΑΚΩΝ ΔΙΑΛΕΞΕΩΝmde-lab.aegean.gr/images/stories/docs/matlab-marinakis.pdf · bioinfo\microarray - Bioinformatics Toolbox -- Microarray$

ΣΗΜΕΙΩΣΕΙΣ ΣΕΜΙΝΑΡΙΑΚΩΝ ΔΙΑΛΕΞΕΩΝmde-lab.aegean.gr/images/stories/docs/matlab-marinakis.pdf · bioinfo\microarray - Bioinformatics Toolbox -- Microarray

19 4 - Pharos Arts Foundation Music Festivals... · Σο *μπ 0ρ 2: Κουιντέτο για Πιάνο σε Λα μείζονα, D.667 «Πέστροφα» Yuja Wang / πιάνο,