Verna Vu & Timothy Abreo. δ C-doublecortin-like kinase (Dclk1) Transgenic mice express a...

Post on 02-Jan-2016

215 views 0 download

Transcript of Verna Vu & Timothy Abreo. δ C-doublecortin-like kinase (Dclk1) Transgenic mice express a...

Verna Vu & Timothy Abreo

δC-doublecortin-like kinase (Dclk1)

Transgenic mice express a constitutively active splice variant of doublecortin-like kinase 1 (Dclk1) gene.

Goals:Study the transcriptome of wild type vs transgenic mice

Compare microarray with DGE

Study the transcriptome of transgenic mice vs wild type

http://pubs.niaaa.nih.gov/http://www.lintoninst.co.uk

Microarray

Problems: • high back ground• measures only relative abundances of transcripts• only predefined sequences can be detected• transcripts expressed at low levels can’t be detected

5 different microarray• Affymetrix• Agilent • Illumina • Applied Biosystems• Home spotted

dnachips-microarrays.wikispaces.com/

Quantitative PCR analysis

• Cycle Based• Sybr Green: Binds to all ds PCR products• Gold standard: used to validate

Problems• Can’t Multiplex• Many cycles to detect low frequency RNA transcripts, can be lost in background noise• Can’t use on all transcripts

http://www.thermoscientificbio.com

Helpful Nomenclature

• Canonical sequence: sequences that reflects the most common choice of base at each position

• Noncanonical sequence: tags in the genome that map to any known exon either strand

• Transcripts per million: t.p.m.

Origins of Digital Gene Expression (DGE): SAGE

Problems:• Laborious• Sequencing steps• 100 k canonical tags would take up to a year • Considerable financial investment needed

www.sagenet.org/

Digital Gene Expression (DGE) tag profiling

Procedure

ENSEMBL transcripts

To enable comparison, all canonical sequence tags and microarray

probe sequences were put in FASTA format and then aligned to

the ENSEMBL transcript database.

Same samples used for DGE and microarray.

http://en.wikipedia.org

• Transcripts with different 3’ ends that are separated by at least one restriction site can be differentiated with DGE

• 47% of detected ENSEMBL transcripts were discovered by more than one tag, most likely a result

of alternative polyadenylation in 3’UTR

• 29% estimated previously based on EST sequences

Alternative Polyadenylation

http://www.nature.com

Antisense Transcription

• Found evidence of bidirectional transcription in 51% of gene clusters when considering canonical and non-canonical tags of abundance >2 t.p.m.

• Antisense transcripts were expressed at high levels, although sense transcripts were still most abundant

http://www.nature.com

Differentially Expressed Genes

● Sequencing of pooled samples caused problems; blood contamination in one sample

● Used Bayesian statistics to attempt to exclude genes that were not truly differentially expressed

Biological Implications

● Used Gene Ontology consortium and DGE data to identify differentially regulated gene sets

● Most affected pathway:○ Disturbances of microtubule guided transport of SNARE

containing synaptic vesicles due to changes in gene expression

● CaMK pathway was the second most affected pathway, possibly via feedback mechanisms○ involved in behavioral pathways in brain

http://eferrari.blogs.lincoln.ac.uk

Dynamic Range

● The ratio between the largest and smallest possible values of a changeable quantity

● 3 to 4 orders of magnitude

● Lowest frequency but most consistently detected was 2 t.p.m. (0.3 copies per cell)

The effect of sequencing depth on detection of differentially expressed genes

● Simulated SAGE: randomly took 1/60 of DGE reads.○ Detected a decrease in 15 fold from 3179 to 200 reads

● Simulated SAGE: ○ lowest: 91 t.p.m.

● DGE:○ average: >2 t.p.m. ○ median: 4 t.p.m○ lowest: 0.8 t.pm.

● Microarray platforms○ median: 106 t.p.m ○ Affymetrix had the most transcripts

in common with DGE■ 11 different probes per transcript

Comparison to SAGE and Microarray

Validation with qPCR

● 29 significant genes from DGE, 33 from microarray ● 43/62 showed some sort of variation between the wt and transgenic same

direction● only 5 were considered to be significant by both platforms● To test for accuracy

The correlation between the normalized number of counts from the summed individual samples in their laboratory and the pool analyzed in the other laboratory were

● 0.98 wild-type

● 0.96 transgenic

To test for precision

Technique allows collaboration with other labs.

False discovery rate was 8.5%

Reproducibility

1. unbiased view of the transcriptome2. detects high levels of differential polyadenylation and antisense

transcription 3. data are more precise & accurate than microarray data4. data analysis requires a lower number of preprocessing facilitates

interlaboratory comparisons5. high interlaboratory comparability of DGE data, probably due to the

avoidance of hybridization processes (notoriously difficult to standardize) 6. more sensitive in the detection of low-abundant transcripts and of small

changes in gene expression. ▪ Absence of background signal and saturation effects

Why use DGE over Microarrays?

Implications:

● Enhancements in sequencing depth to improve accuracy in particular for low abundant transcripts

● Improvements in sensitivity, resolution, interlaboratory consistency○ Boost field of expression profiling

● Basic research and comparative genomics fields will benefit from major improvement of data portability○ Once held back by extensive and lengthy standardization issues

Deep sequencing→ robustness, comparability and richness of expression profiling data.

Will boost collaborative, comparative and integrative genomics studies

RNAseq

What the Future has in Store

static1.squarespace.com/

Questions? Criticism?

http://www.clipartbest.com