Technical Overview, protein table

From TheGPMWiki
(Difference between revisions)
Jump to: navigation, search
(ZiXBebJWdAyIytlfY)
m (Reverted edits by 94.23.1.28 (Talk) to last revision by WikiSysop)
Line 1: Line 1:
-
Why is anyone tanlikg about pathetic failure of religous beliefs? You might as well wallow in the dirt like some ignorant savages. The time for being controlled slaves is 100 years ago. So not fall victim to magic and bullshit.No-religion is the one defining thing that the red communists got correct!
+
The ''protein'' table records the details for each protein identification in a GPM [[Technical Overview, result table|result]] file.  Each entry in the ''protein'' table corresponds to a single protein from a single file.  If a file contained no identifications, the corresponding resultid for that file will not appear in this table.
 +
 
 +
The contents of this table are updated incrementally as part of the [[Technical Overview, GPMDB Updates|update]] procedure which executes daily.
 +
 
 +
Columns:
 +
*'''proid''': the unique identifier for a single identification for a single protein.
 +
*'''resultid''': the unique identifier for the result file in which this protein identification occurred.  All entries in the ''protein'' table from a single file will have the same value.
 +
*'''proseqid''': the unique identifier for the protein sequence associated with this identification. All identifications of a given protein will have the same value.
 +
*'''expect''': the expectation score; a floating point (decimal) number which is the base-10 logarithm of the odds that this particular identification of this protein was a chance occurrence (i.e., a false positive).  If the odds of this identification are 1 in 1000, the expect score would be -3.0.
 +
*'''pida''': the 1-based sequence of this mass spectrum in the original data file.
 +
*'''pidb''': the identifier for this particular identification.
 +
*'''uid''': a unique identifier for this protein, which is determined by the search software.

Revision as of 18:53, 25 August 2012

The protein table records the details for each protein identification in a GPM result file. Each entry in the protein table corresponds to a single protein from a single file. If a file contained no identifications, the corresponding resultid for that file will not appear in this table.

The contents of this table are updated incrementally as part of the update procedure which executes daily.

Columns:

  • proid: the unique identifier for a single identification for a single protein.
  • resultid: the unique identifier for the result file in which this protein identification occurred. All entries in the protein table from a single file will have the same value.
  • proseqid: the unique identifier for the protein sequence associated with this identification. All identifications of a given protein will have the same value.
  • expect: the expectation score; a floating point (decimal) number which is the base-10 logarithm of the odds that this particular identification of this protein was a chance occurrence (i.e., a false positive). If the odds of this identification are 1 in 1000, the expect score would be -3.0.
  • pida: the 1-based sequence of this mass spectrum in the original data file.
  • pidb: the identifier for this particular identification.
  • uid: a unique identifier for this protein, which is determined by the search software.
Personal tools