Normal Clinical Tissue Alliance: Data Format

From TheGPMWiki
(Difference between revisions)
Jump to: navigation, search
(New page: The formats for the tables which record the tissue information can be found below. Any changes to how data is stored will also be listed here. == Table Names == All tissue data is contai...)
Line 2: Line 2:
== Table Names ==
== Table Names ==
-
All tissue data is contained in fifteen tables.  One table (called '''process_ids''') is not used directly by information consumers, but is included below for reference.  A summary table called '''BTO_ids''' contains summary information about each release of information such as release number, release date and a short description.  The other thirteen tables correspond to the thirteen BTO numbers and are named accordingly.
+
All tissue data is contained in fourteen tables.  A summary table called '''BTO_ids''' contains summary information about each release of information such as release number, release date and a short description.  The other fourteen tables correspond to the fourteen tissue types, and are named accordingly.
== Data Format ==
== Data Format ==
The tables listed below have the following columns, in the order presented.
The tables listed below have the following columns, in the order presented.
-
 
-
{|border="1" cellspacing="0" cellpadding="3" align="center"
 
-
|+'''process_ids''' table
 
-
!Column Name
 
-
!Data Type
 
-
!Notes
 
-
|-
 
-
|proc_id
 
-
|non-negative integer
 
-
|Always Present
 
-
|-
 
-
|description
 
-
|text <= 512 characters
 
-
|Always Present
 
-
|-
 
-
|}
 
Line 35: Line 19:
|BTO_id
|BTO_id
|Text < 256 characters
|Text < 256 characters
-
|Always Present
 
-
|-
 
-
|processing_id
 
-
|non-negative integer < 65,535
 
|Always Present
|Always Present
|-
|-
Line 47: Line 27:
|release_date
|release_date
|date as YYYY-MM-DD
|date as YYYY-MM-DD
-
|Always Present
 
-
|-
 
-
|description
 
-
|Text < 256 characters
 
|Always Present
|Always Present
|-
|-
Line 64: Line 40:
{|border="1" cellspacing="0" cellpadding="3" align="center"
{|border="1" cellspacing="0" cellpadding="3" align="center"
-
|+'''All BTO Number''' tables
+
|+All '''tissue''' tables
!Column Name
!Column Name
!Data Type
!Data Type
Line 76: Line 52:
|non-negative integer
|non-negative integer
|Always present
|Always present
-
|-
 
-
|processing_id
 
-
|non-negative integer < 65,535
 
-
|Always Present
 
|-
|-
|HGNC
|HGNC
Line 97: Line 69:
|Always Present
|Always Present
|-
|-
 +
|min_expect
 +
|floating point number
 +
|Always Present
 +
|-
 +
|max_expect
 +
|floating point number
 +
|Always Present
 +
|-
 +
|count
 +
|non-negative integer
 +
|Always Present
|}
|}

Revision as of 18:45, 29 February 2008

The formats for the tables which record the tissue information can be found below. Any changes to how data is stored will also be listed here.

Table Names

All tissue data is contained in fourteen tables. A summary table called BTO_ids contains summary information about each release of information such as release number, release date and a short description. The other fourteen tables correspond to the fourteen tissue types, and are named accordingly.

Data Format

The tables listed below have the following columns, in the order presented.




BTO_ids table
Column Name Data Type Notes
BTO_id Text < 256 characters Always Present
release_id non-negative integer Always Present
release_date date as YYYY-MM-DD Always Present
tissue_type Text <= 50 characters Always Present




All tissue tables
Column Name Data Type Notes
ENSEMBL text <= 40 characters Always Present
release_id non-negative integer Always present
HGNC text <= 30 characters Always Present
Entrez_ID non-negative integer
Uniprot text <= 15 characters Always Present
description text Always Present
min_expect floating point number Always Present
max_expect floating point number Always Present
count non-negative integer Always Present
Personal tools