Annotation Statistics

Annotation revision: Mt3.5v5
Annotation date: 10/2011

Gene

Confidence class

Reference

Gene Loci

Gene Models

F
FL-cDNA

E
EST matches

H
Homology

I
Intrinsic

L
Low probability

1

4371

4581

784

850

1598

595

754

2

4917

5123

902

868

1981

558

814

3

5862

6115

1060

1131

2128

789

1007

4

6200

6478

1143

1245

2267

762

1061

5

7248

7527

1241

1321

2699

998

1268

6

2743

2799

274

398

1178

452

497

7

5443

5662

936

1026

2166

648

886

8

4342

4522

714

862

1647

538

761

0*

2989

3052

264

496

1300

475

517

Illumina

18264

18264

1913

5060

10137

801

353

Total

62379

64123

9231

13257

27101

6616

7918

* Chr 0 represents the unanchored BACs.


A Confidence Class is assigned for every gene based on the evidence as follows:

F, full coverage/FL-cDNA: The complete gene model from translation start to translation stop is covered by expressed Medicago sequence, e.g. FL-cDNA or EST alignments across the full length of the coding sequence.
E, expressed/EST matches: Expression of the gene is supported by Medicago EST sequence that matches the gene call (partially).
H, homology/heterologous: The gene call is supported by siilarity to Medicago or other ESTs, protein, FL-cDNA, genomic or other sequences with partial of full length alignments.
I, intrinsic/ab initio/inferred/hypothetical: The gene call is based only on intrinsic prediction tools such as FGENESH, Genescan or Eugene, and no significant alignments to other sequences are available.
L, low probability: A very small gene call with less than 100 amino acids without respect to other evidence.
*This classification will be done top-down, so any gene call that does not fall under F will fall under E, and it it does not satisfy the requirements of E, it will be H and all gene calls that do not fulfill H will be called I.