Skip to content
Surf Wiki
Save to docs
general/human-proteins

From Surf Wiki (app.surf) — the open knowledge base

FAM200A

Protein-coding gene in the species Homo sapiens

FAM200A

Protein-coding gene in the species Homo sapiens

This article is written in AMERICAN ENGLISH. Please DO NOT change the spelling to match your personal preference. If you think there is a compelling reason to change the spelling style, please first read Wikipedia:Manual of Style (spelling) and then begin a discussion on the talk page.--

C7orf38 is a gene located on chromosome 7 in the human genome. The gene is expressed in nearly all tissue types at very low levels. Evolutionarily, it can be found throughout the kingdom animalia. While the function of the protein is not fully understood by the scientific community, bioinformatic tools have shown that the protein bares much similarity to zinc finger or transposase proteins. Many of its orthologs, paralogs, and neighboring genes have been shown to possess zinc finger domains. The protein contains a hAT dimerization domain nears its C-terminus. This domain is highly conserved in transposase enzymes.

Gene

C7orf38 is located on chromosome 7 at q22.1. Its genomic sequence contains 5,612 bp. The predominant transcript contains two exons and is 2,507 bp in length. The translated protein contains 573 amino acids.

C7orf38 Gene Loci.

Protein composition

The 573 amino acid protein has a molecular weight of 66,280.05. The isoelectric point was found to occur at a pH of 5.775, about 1.6 pH lower than that of the average human pH. Two deviations from prototypical human proteins are evident. The protein contains a less than expected number of glycine residues, and is rich in leucine residues. There are not sections of strong hydrophobicity or hydrophilicity. Thus, it is not predicted to be a transmembrane protein.

Hydrophilicity Analysis.

Gene neighborhood

The four genes in closest proximity to C7orf38 on chromosome 7 exhibit similar function, many of which are transcription factors.

Gene Neighborhood
NameOrientationFunction
ZNF789Start: 98,908,451 bp from pterThe gene encodes the zinc finger protein 789. Functionally, the gene has been proposed to participate in regulation of transcription. It is expected to use zinc ion binding.
ZNF394Start: 98,928,790 bp from pterThe gene encodes zinc finger protein 394. Over expression over ZNF394 inhibits the transchription of c-jun and Ap-1. Suggesting that it is a transcriptional repressor.
ZKSCAN5Start: 98,940,209 bp from pterThe gene encodes zinc finger with KRAB and SCAN domains 5. This gene encodes a zinc finger protein of the Kruppel family. The protein contains a SCAN box and a KRAB A domain.
ZNF655Start: 98,993,981 bp from pterThe gene encodes zinc finger protein 655. Numerous alternatively spliced transcripts encoding distinct isoforms have been discovered.
MihuyaStart: 99,149,738 bp from pterThe Mihuya gene does not encode a large or known functional protein. The antisense relationship to C7orf38 raises the possibility for regulation of expression.

Paralogs

Eight paralogs are found in the human proteome. Similar to the neighboring genes, many of the paralogs function as zinc fingers, or transcription factors.

NameNCBI Accession NumberLength (AA)% Identity to C7orf38% Similarity to C7orf38
hypothetical protein LOC285550NP_001138663.16577991
zinc finger MYM-type protein 6NP_009098.313253860
SCAN domain-containing protein 3NP_443155.113253960
zinc finger BED domain-containing protein 5NP_067034.26923557
transposon-derived Buster3 transposase-likeNP_071373.25943253
general transcription factor II-I repeat domain-containing protein 2BNP_001003795.19492546
GTF2I repeat domain containing 2NP_775808.29492445
EPM2A interacting protein 1NP_055620.16072242

Orthologs

Orthologs to C7orf38 can be traced back evolutionarily through plants. The following is not an extensive list of orthologs. It is intended to provide an evolutionary overview of the conservation of C7orf38.

Common nameGenus & speciesNCBI accession numberLength (AA)% Identity to C7orf38% Similarity to C7orf38
ChimpPan troglodytesXP_001139775.15739999
Macaque monkeyMacaca fascicularisBAE01234.15739698
HorseEquus caballusXP_001915370.15738184
PigSus scrofaXP_00192919413233961
CowBos taurusXP_875656.213203861
MouseMus musculusCAM15594.111573760
Domestic dogCanis lupus familiarisABF22701.16093760
RatRattus rattusNP_001102151.112493759
OpossumMonodelphis domesticaXP_001372983.16083759
ChickenGallus gallusXP_424913.26413758
FrogXenopus (Silurana) tropicalisABF20551.16563756
Zebra fishDanio rerioXP_001340213.16093756
Pea aphidAcyrthosiphon pisumXP_001943527.16593654
BeatleTribolium castaneumABF20545.15993555
Sea squirtCiona intestinalisXP_002119512.15243452
HydraHydra magnipapillataXP_002165429.15722952
Puffer fishTetraodon nigroviridisCAF95678.15392847
MosquitoAnopheles gambiaeXP_558399.55912847
Sea urchinStrongylocentrotus purpuratusABF20546.16252747
Grass plantSorghum bicolorXP_002439156.15242540
Broad leaf treePopulus trichocarpaXP_002319808.17882139

Structure

Protein

CBLast was used to determine a structurally related protein with experimentally determined structure. The protein Hermes DNA transposase, of the Hermes DBD superfamily, was shown to be structurally similar (Evalue: 1E-6). The hAT dimerization domain is found at the C-terminus of transposase elements belonging to the Activator superfamily (hAT element superfamily). The isolated dimerization domain forms extremely stable dimers in vitro.

Hermes DNA Transposase.

mRNA

The MFOLD program available at Rensselaer BioInformatics Server was used to predict secondary structure of the mature mRNA sequence. The primary sequence of the mRNA secondary structures displayed high levels of conservation in orthologs, suggesting structural importance.

MFOLD Secondary Structure Prediction.

Tissue distribution

The gene appears to be expressed in most tissue types. Very low levels of expression were observed through est profiles, and no deviation was observed between health or developmental states.

Est profile based on tissue type.
Est profile based on health state.
Est profile based on developmental stage.

References

References

  1. "University of California Santa Cruz".
  2. "NCBI UniGene".
  3. "NCBI BLAST".
  4. "KEGG".
  5. (2000). "A highly conserved domain of the maize activator transposase is involved in dimerization.". Plant Cell.
  6. (24 February 2010). "Fam200A".
  7. "NCBI Protein Accession Number".
  8. (December 2016). "AAStats. SDSC Biology WorkBench".
  9. (December 2016). "IP. SDSC Biology WorkBench".
  10. (December 2016). "SAPS. SDSC Biology WorkBench".
  11. "AceView".
  12. "Hermes DNA Transposase".
  13. "Fam200A".
  14. "NCBI UniGene".
Info: Wikipedia Source

This article was imported from Wikipedia and is available under the Creative Commons Attribution-ShareAlike 4.0 License. Content has been adapted to SurfDoc format. Original contributors can be found on the article history page.

Want to explore this topic further?

Ask Mako anything about FAM200A — get instant answers, deeper analysis, and related topics.

Research with Mako

Free with your Surf account

Content sourced from Wikipedia, available under CC BY-SA 4.0.

This content may have been generated or modified by AI. CloudSurf Software LLC is not responsible for the accuracy, completeness, or reliability of AI-generated content. Always verify important information from primary sources.

Report