Schema for Ensembl Genes - Ensembl Genes
|
|
Database: xenTro3 Primary Table: ensGene Row Count: 24,197   Data last updated: 2019-01-11
Format description: A gene prediction with some additional info. On download server: MariaDB table dump directory
field | example | SQL type | info | description |
bin | 585 | smallint(5) unsigned | range | Indexing field to speed chromosome range queries. |
name | ENSXETT00000065882.1 | varchar(255) | values | Name of gene (usually transcript_id from GTF) |
chrom | GL172637 | varchar(255) | values | Reference sequence chromosome or scaffold |
strand | - | char(1) | values | + or - for strand |
txStart | 33 | int(10) unsigned | range | Transcription start position (or end position for minus strand item) |
txEnd | 148 | int(10) unsigned | range | Transcription end position (or start position for minus strand item) |
cdsStart | 148 | int(10) unsigned | range | Coding region start (or end position for minus strand item) |
cdsEnd | 148 | int(10) unsigned | range | Coding region end (or start position for minus strand item) |
exonCount | 1 | int(10) unsigned | range | Number of exons |
exonStarts | 33, | longblob | | Exon start positions (or end positions for minus strand item) |
exonEnds | 148, | longblob | | Exon end positions (or start positions for minus strand item) |
score | 0 | int(11) | range | score |
name2 | ENSXETG00000030486.1 | varchar(255) | values | Alternate name (e.g. gene_id from GTF) |
cdsStartStat | none | enum('none', 'unk', 'incmpl', 'cmpl') | values | Status of CDS start annotation (none, unknown, incomplete, or complete) |
cdsEndStat | none | enum('none', 'unk', 'incmpl', 'cmpl') | values | Status of CDS end annotation (none, unknown, incomplete, or complete) |
exonFrames | -1, | longblob | | Exon frame {0,1,2}, or -1 if no frame for exon |
|
| |
|
|
Connected Tables and Joining Fields
|
|
Sample Rows
|
|
bin | name | chrom | strand | txStart | txEnd | cdsStart | cdsEnd | exonCount | exonStarts | exonEnds | score | name2 | cdsStartStat | cdsEndStat | exonFrames |
---|
585 | ENSXETT00000065882.1 | GL172637 | - | 33 | 148 | 148 | 148 | 1 | 33, | 148, | 0 | ENSXETG00000030486.1 | none | none | -1, |
585 | ENSXETT00000061796.1 | GL172637 | - | 605 | 720 | 720 | 720 | 1 | 605, | 720, | 0 | ENSXETG00000031766.1 | none | none | -1, |
585 | ENSXETT00000065862.1 | GL172637 | + | 20028 | 25277 | 20028 | 25212 | 2 | 20028,24513, | 20172,25277, | 0 | ENSXETG00000001053.3 | incmpl | cmpl | 0,0, |
585 | ENSXETT00000002304.3 | GL172637 | + | 20061 | 25277 | 20061 | 25212 | 3 | 20061,20263,24513, | 20176,20265,25277, | 0 | ENSXETG00000001053.3 | incmpl | cmpl | 0,1,0, |
73 | ENSXETT00000065636.1 | GL172637 | - | 57442 | 207405 | 59436 | 207405 | 15 | 57442,63050,64382,74344,75918,76176,76539,86918,87876,91226,96838,106424,122799,175875,206783, | 59508,63125,64491,74423,75974,76263,76636,87048,87920,91269,96934,106470,122929,175881,207405, | 0 | ENSXETG00000001054.3 | cmpl | cmpl | 0,0,2,1,2,2,1,0,1,0,0,2,1,1,0, |
585 | ENSXETT00000002307.2 | GL172637 | - | 57442 | 91597 | 59436 | 91485 | 11 | 57442,63050,64382,74344,75918,76176,76539,86918,87798,91226,91449, | 59508,63125,64491,74423,75974,76263,76636,87048,87920,91269,91597, | 0 | ENSXETG00000001054.3 | cmpl | cmpl | 0,0,2,1,2,2,1,0,1,0,0, |
585 | ENSXETT00000059679.1 | GL172637 | - | 95521 | 95679 | 95679 | 95679 | 1 | 95521, | 95679, | 0 | ENSXETG00000029479.1 | none | none | -1, |
73 | ENSXETT00000061874.1 | GL172637 | - | 116563 | 215329 | 116760 | 207405 | 4 | 116563,122799,206780,215261, | 116803,122929,207516,215329, | 0 | ENSXETG00000001054.3 | cmpl | cmpl | 2,1,0,-1, |
73 | ENSXETT00000066017.1 | GL172637 | + | 272961 | 446773 | 272961 | 446773 | 9 | 272961,291361,299478,301570,321877,322105,435374,446521,446690, | 273255,291478,299538,301662,321971,322130,435413,446536,446773, | 0 | ENSXETG00000030397.1 | cmpl | cmpl | 0,0,0,0,2,0,1,1,1, |
588 | ENSXETT00000002310.2 | GL172637 | - | 463007 | 495412 | 463456 | 494069 | 11 | 463007,472734,474142,482057,484803,486452,487908,491398,492665,493967,495379, | 463516,473021,474215,482129,484989,486620,488067,491594,492757,494077,495412, | 0 | ENSXETG00000001057.2 | cmpl | cmpl | 0,1,0,0,0,0,0,2,0,0,-1, |
|
Note: all start coordinates in our database are 0-based, not
1-based. See explanation
here.
| |
|
|
Ensembl Genes (ensGene) Track Description
|
|
Description
These gene predictions were generated by Ensembl.
For more information on the different gene tracks, see our Genes FAQ.
Methods
For a description of the methods used in Ensembl gene predictions, please refer to
Hubbard et al. (2002), also listed in the References section below.
Data access
Ensembl Gene data can be explored interactively using the
Table Browser or the
Data Integrator.
For local downloads, the genePred format files for xenTro3 are available in our
downloads directory as ensGene.txt.gz or in our
genes download directory in GTF format.
For programmatic access, the data can be queried from the
REST API or
directly from our public MySQL
servers. Instructions on this method are available on our
MySQL help page and on
our blog.
Previous versions of this track can be found on our archive download server.
Credits
We would like to thank Ensembl for providing these gene annotations. For more information, please see
Ensembl's genome annotation page.
References
Hubbard T, Barker D, Birney E, Cameron G, Chen Y, Clark L, Cox T, Cuff J,
Curwen V, Down T et al.
The Ensembl genome database project.
Nucleic Acids Res. 2002 Jan 1;30(1):38-41.
PMID: 11752248; PMC: PMC99161
| |
|
|
|