Schema for Ensembl Genes

Home
Genomes
Genome Browser
Tools
Mirrors
- Euro/Asia Mirrors
- Mirroring Instructions
- US Server
- European Server
- Asian Server
Downloads
My Data
Projects
Help
About Us
- News
- Publications
- Blog
- Cite Us
- Credits
- Release Log
- Staff
- Conditions of Use
- Our History
- Jobs
- Licenses
- Contact Us

field

example

SQL type

info

description

bin

585

smallint(5) unsigned

range

Indexing field to speed chromosome range queries.

name

ENSDNOT00000038925.1

varchar(255)

values

Name of gene (usually transcript_id from GTF)

chrom

AAGV03000462

varchar(255)

values

Reference sequence chromosome or scaffold

strand

char(1)

values

+ or - for strand

txStart

665

int(10) unsigned

range

Transcription start position (or end position for minus strand item)

txEnd

1525

int(10) unsigned

range

Transcription end position (or start position for minus strand item)

cdsStart

665

int(10) unsigned

range

Coding region start (or end position for minus strand item)

cdsEnd

1525

int(10) unsigned

range

Coding region end (or start position for minus strand item)

exonCount

int(10) unsigned

range

Number of exons

exonStarts

665,1151,1506,

longblob

Exon start positions (or end positions for minus strand item)

exonEnds

720,1392,1525,

longblob

Exon end positions (or start positions for minus strand item)

score

int(11)

range

score

name2

ENSDNOG00000040597.1

varchar(255)

values

Alternate name (e.g. gene_id from GTF)

cdsStartStat

cmpl

enum('none', 'unk', 'incmpl', 'cmpl')

values

Status of CDS start annotation (none, unknown, incomplete, or complete)

cdsEndStat

incmpl

enum('none', 'unk', 'incmpl', 'cmpl')

values

Status of CDS end annotation (none, unknown, incomplete, or complete)

exonFrames

0,1,2,

longblob

Reading frame of the start of the CDS region of the exon, in the direction of transcription (0,1,2), or -1 if there is no CDS region.

      dasNov3.ensGtp.transcript (via ensGene.name)
      dasNov3.ensPep.name (via ensGene.name)
      dasNov3.ensemblSource.name (via ensGene.name)
      dasNov3.ensemblToGeneName.name (via ensGene.name)
      knownGeneV39.knownToEnsembl.value (via ensGene.name)

bin

name

chrom

strand

txStart

txEnd

cdsStart

cdsEnd

exonCount

exonStarts

exonEnds

score

name2

cdsStartStat

cdsEndStat

exonFrames

585

ENSDNOT00000038925.1

AAGV03000462

665

1525

665

1525

665,1151,1506,

720,1392,1525,

ENSDNOG00000040597.1

cmpl

incmpl

0,1,2,

585

ENSDNOT00000032866.1

AAGV03001531

1269

2,599,659,942,

211,657,938,1269,

ENSDNOG00000040886.1

incmpl

cmpl

1,0,0,0,

585

ENSDNOT00000054944.1

AAGV03002204

1190

20,834,

270,1190,

ENSDNOG00000049893.1

none

-1,-1,

585

ENSDNOT00000043187.1

AAGV03003272

996

55,120,197,842,

118,182,752,996,

ENSDNOG00000042161.1

incmpl

cmpl

0,1,1,0,

585

ENSDNOT00000049509.1

AAGV03003599

296

29,58,134,

56,130,296,

ENSDNOG00000037889.1

none

-1,-1,-1,

585

ENSDNOT00000038859.1

AAGV03004835

640

748

640,

748,

ENSDNOG00000044719.1

none

-1,

585

ENSDNOT00000053939.1

AAGV03005238

1231

20,626,

241,1231,

ENSDNOG00000052549.1

none

-1,-1,

585

ENSDNOT00000035300.1

AAGV03005904

941

1105

941,

1105,

ENSDNOG00000032163.1

none

-1,

585

ENSDNOT00000006480.2

AAGV03006121

169

598

169

598

169,

598,

ENSDNOG00000006485.2

incmpl

585

ENSDNOT00000040232.1

AAGV03006342

139

724

139

724

139,389,

206,724,

ENSDNOG00000042773.1

incmpl

0,1,

Description

These gene predictions were generated by Ensembl.

For more information on the different gene tracks, see our Genes FAQ.

Methods

For a description of the methods used in Ensembl gene predictions, please refer to Hubbard et al. (2002), also listed in the References section below.

Data access

Ensembl Gene data can be explored interactively using the Table Browser or the Data Integrator. For local downloads, the genePred format files for dasNov3 are available in our downloads directory as ensGene.txt.gz or in our genes download directory in GTF format.

For programmatic access, the data can be queried from the REST API or directly from our public MySQL servers. Instructions on this method are available on our MySQL help page and on our blog.

Previous versions of this track can be found on our archive download server.

Credits

We would like to thank Ensembl for providing these gene annotations. For more information, please see Ensembl's genome annotation page.

References

Hubbard T, Barker D, Birney E, Cameron G, Chen Y, Clark L, Cox T, Cuff J, Curwen V, Down T et al. The Ensembl genome database project. Nucleic Acids Res. 2002 Jan 1;30(1):38-41. PMID: 11752248; PMC: PMC99161