Normal Clinical Tissue Alliance: Data Format

From TheGPMWiki
Revision as of 23:44, 13 February 2008 by WikiSysop (Talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

The formats for the tables which record the tissue information can be found below. Any changes to how data is stored will also be listed here.

Table Names

All tissue data is contained in fifteen tables. One table (called process_ids) is not used directly by information consumers, but is included below for reference. A summary table called BTO_ids contains summary information about each release of information such as release number, release date and a short description. The other thirteen tables correspond to the thirteen BTO numbers and are named accordingly.

Data Format

The tables listed below have the following columns, in the order presented.

process_ids table
Column Name Data Type Notes
proc_id non-negative integer Always Present
description text <= 512 characters Always Present




BTO_ids table
Column Name Data Type Notes
BTO_id Text < 256 characters Always Present
processing_id non-negative integer < 65,535 Always Present
release_id non-negative integer Always Present
release_date date as YYYY-MM-DD Always Present
description Text < 256 characters Always Present
tissue_type Text <= 50 characters Always Present




All BTO Number tables
Column Name Data Type Notes
ENSEMBL text <= 40 characters Always Present
release_id non-negative integer Always present
processing_id non-negative integer < 65,535 Always Present
HGNC text <= 30 characters Always Present
Entrez_ID non-negative integer
Uniprot text <= 15 characters Always Present
description text Always Present
Personal tools