Documentos de Académico
Documentos de Profesional
Documentos de Cultura
Gene Finding
Gene Finding
TECHNIQUES
• Genes are made up of DNA, act as instructions to make molecules called proteins.
• The Human Genome Project has estimated that humans have between 20,000 and
25,000 genes.
• 1st time Gregor Mendel gave the idea of gene from his experiment on pea plant. He didn't
use the term gene but use factor in his first law.
• Danish botanist Wilhelm Johannsen coined the word "gene" ("gen" in Danish and
German) in 1909 to describe these fundamental physical and functional units of heredity
GENE ANATOMY
WHAT IS GENE FINDING?
• Gene finding is one of the first and most important steps in understanding
the genome of a species once it has been sequenced.
GENE PREDICTION METHODS
• Simple approach based on finding similarity in genes sequences between EST and
protein.
• If there is similarity between certain genomic region like EST,DNA, protein ,this
similarity information is used to predict gene structure and function.
• BLAST( local alignment tool) is used to detect similarity to known genes ,EST, protein.
AB INITIO GENE PREDICTION
• Signal sensor : sequence motifs such as, splice site, branch point,
polypyrimidine tract, start codon and stop codon.
• DNA is translated in all six possible reading frames, three forward and
three backward.
• Any region of DNA between a start codon and stop codon could
potentially code for a polypeptide, and therefore an ORF.
RBSFINDER:
• Searches for ribosomal binding site for prediction of translational initiation
site
GENE PREDICTION IN EUKARYOTES
• Detect exons and to precisely locate the boundary between the exon and
the contiguous introns.
1. Initial exons, from the initiation codon to the first splice site.
2. Internal exons from splice site to splice site.
3. Terminal exons from splice site to stop codon.
4. Single introns corresponding to uninterrupted, intronless genes i.e.,
running from initiation codon to stop codon.
SENSITIVITY:
The frequency with which a programme detects ‘true’ splice sites.
SPECIFICITY:
Reflects the number of predicted sites which are correct.
TRANSCRIPTION SIGNALS
• Polyadenylation signal.
TRANSLATIONAL SIGNAL
• The termination codon(s) present in the terminal exon and absent from the
initial and internal exons.
GENE FINDING PROGRAMS IN
EUKARYOTES
AB INITIO BASED GENE PREDICTION
1. GENEID:
Is ab initio gene prediction tool which is used to predict gene in eukaryotes.
2. GRAIL:
A neural network based algorithm which is used to predict splice junctions,
start and stop codons, poly-A sites, promoters, and CpG island.
3. FGENESH:
programme to predict multiple genes in genomic DNA.
AB INITIO BASED GENE PREDICTION...
4. GenScan:
1. GenomeScan:
combination of GenScan prediction result with BLASTX similarity
searches.
2. TwinScan:
A gene finding server similar to GenomeScan.
CONSENSUS BASED GENE
PREDICTION
1. GeneComber:
Combination of HMM gene and GenScan result prediction.
2. DIGIT: