Normal Clinical Tissue Alliance: Data Format

From TheGPMWiki
Jump to: navigation, search

The formats for the tables which record the tissue information can be found below. Any changes to how data is stored will also be listed here.

Table Names

All tissue data is contained in fourteen tables. A summary table called BTO_ids contains summary information about each release of information such as release number, release date and a short description. The other fourteen tables correspond to the fourteen tissue types, and are named accordingly.

Data Format

The tables listed below have the following columns, in the order presented.




BTO_ids table
Column Name Data Type Notes
BTO_id Text < 256 characters Always Present
release_id non-negative integer Always Present
release_date date as YYYY-MM-DD Always Present
tissue_type Text <= 50 characters Always Present




All tissue tables
Column Name Data Type Notes
ENSEMBL text <= 40 characters Always Present
release_id non-negative integer Always present
HGNC text <= 30 characters Always Present
Entrez_ID non-negative integer
Uniprot text <= 15 characters Always Present
description text Always Present
min_expect floating point number Always Present
max_expect floating point number Always Present
count non-negative integer Always Present
Personal tools