Human Gene CFAP107 (ENST00000614859.5) from GENCODE V44
  Description: Homo sapiens chromosome 1 open reading frame 158 (C1orf158), transcript variant 1, mRNA. (from RefSeq NM_152290)
Gencode Transcript: ENST00000614859.5
Gencode Gene: ENSG00000157330.10
Transcript (Including UTRs)
   Position: hg38 chr1:12,746,200-12,763,699 Size: 17,500 Total Exon Count: 4 Strand: +
Coding Region
   Position: hg38 chr1:12,746,431-12,760,951 Size: 14,521 Coding Exon Count: 4 

Page IndexSequence and LinksPrimersRNA StructureProtein StructureOther Species
mRNA DescriptionsOther NamesMethods
Data last updated at UCSC: 2023-08-18 00:09:47

-  Sequence and Links to Tools and Databases
Genomic Sequence (chr1:12,746,200-12,763,699)mRNA (may differ from genome)Protein (194 aa)
Gene SorterGenome BrowserOther Species FASTATable SchemaAlphaFoldBioGPS
EnsemblEntrez GeneExonPrimerGencodeGeneCardsHPRD
LynxMGIneXtProtPubMedUniProtKBBioGrid CRISPR DB

-  Primer design for this transcript

Primer3Plus can design qPCR Primers that straddle exon-exon-junctions, which amplify only cDNA, not genomic DNA.
Click here to load the transcript sequence and exon structure into Primer3Plus

Exonprimer can design one pair of Sanger sequencing primers around every exon, located in non-genic sequence.
Click here to open Exonprimer with this transcript

To design primers for a non-coding sequence, zoom to a region of interest and select from the drop-down menu: View > In External Tools > Primer3

-  mRNA Secondary Structure of 3' and 5' UTRs
RegionFold EnergyBasesEnergy/Base
Display As
5' UTR -46.25231-0.200 Picture PostScript Text
3' UTR -1179.702748-0.429 Picture PostScript Text

The RNAfold program from the Vienna RNA Package is used to perform the secondary structure predictions and folding calculations. The estimated folding energy is in kcal/mol. The more negative the energy, the more secondary structure the RNA is likely to have.

-  Protein Domain and Structure Information
  ModBase Predicted Comparative 3D Structure on Q8N1D5
The pictures above may be empty if there is no ModBase structure for the protein. The ModBase structure frequently covers just a fragment of the protein. You may be asked to log onto ModBase the first time you click on the pictures. It is simplest after logging in to just click on the picture again to get to the specific info on that model.

-  Orthologous Genes in Other Species
  Orthologies between human, mouse, and rat are computed by taking the best BLASTP hit, and filtering out non-syntenic hits. For more distant species reciprocal-best BLASTP hits are used. Note that the absence of an ortholog in the table below may reflect incomplete annotations in the other species rather than a true absence of the orthologous gene.
MouseRatZebrafishD. melanogasterC. elegansS. cerevisiae
Genome BrowserGenome BrowserGenome BrowserNo orthologNo orthologNo ortholog
Gene Details     
Gene Sorter     
Protein SequenceProtein SequenceProtein Sequence   

-  Descriptions from all associated GenBank mRNAs
  BC029894 - Homo sapiens chromosome 1 open reading frame 158, mRNA (cDNA clone MGC:35194 IMAGE:5171302), complete cds.
AK026705 - Homo sapiens cDNA: FLJ23052 fis, clone LNG02660.
AK298760 - Homo sapiens cDNA FLJ56581 complete cds.
JD456261 - Sequence 437285 from Patent EP1572962.
CU688782 - Synthetic construct Homo sapiens gateway clone IMAGE:100016757 5' read C1orf158 mRNA.
HQ447108 - Synthetic construct Homo sapiens clone IMAGE:100070393; CCSB009255_01 chromosome 1 open reading frame 158 (C1orf158) gene, encodes complete protein.
KJ899961 - Synthetic construct Homo sapiens clone ccsbBroadEn_09355 C1orf158 gene, encodes complete protein.
BX647383 - Homo sapiens mRNA; cDNA DKFZp686C0236 (from clone DKFZp686C0236).
JD506222 - Sequence 487246 from Patent EP1572962.
JD441235 - Sequence 422259 from Patent EP1572962.
JD064086 - Sequence 45110 from Patent EP1572962.
JD490158 - Sequence 471182 from Patent EP1572962.
JD512517 - Sequence 493541 from Patent EP1572962.
JD097442 - Sequence 78466 from Patent EP1572962.
JD050846 - Sequence 31870 from Patent EP1572962.
JD364604 - Sequence 345628 from Patent EP1572962.
JD318883 - Sequence 299907 from Patent EP1572962.
JD144009 - Sequence 125033 from Patent EP1572962.
JD189786 - Sequence 170810 from Patent EP1572962.
JD230073 - Sequence 211097 from Patent EP1572962.
JD269639 - Sequence 250663 from Patent EP1572962.
JD366789 - Sequence 347813 from Patent EP1572962.

-  Other Names for This Gene
  Alternate Gene Symbols: C1orf158, CA158_HUMAN, ENST00000614859.1, ENST00000614859.2, ENST00000614859.3, ENST00000614859.4, NM_152290, Q5VUY4, Q8N1D5, uc001auh.1, uc001auh.2, uc001auh.3, uc001auh.4, uc001auh.5
UCSC ID: ENST00000614859.5
RefSeq Accession: NM_152290
Protein: Q8N1D5 (aka CA158_HUMAN)

-  Methods, Credits, and Use Restrictions
  Click here for details on how this gene model was made and data restrictions if any.