ProteoRed - Instituto Nacional de Protemica Jul 30th, 2010 
ProteoRed Multi-centric Experiment 6 page


Last update: 05-Ene-2010     
Databases - PME 5

Back to Data Analysis section
     
     

Here you can download the database(s) that you will have to use in your searches for the experiment.

The original database is composed by all proteins from ECOLI extracted from the Swiss-Prot database, and also contains the sequences from the four spiked proteins (see Sample description page). In total = 4363 + 4 proteins.

In order to retrieve a False Discovery Rate (FDR) in your searches, we provide, in addition to the normal database (FORWARD database), a RANDOM database and a DECOY database.

The RANDOM database has been created using a ProteinScape scrip that generate a database randomizing the amino acids for each protein sequence in the FORWARD database.

The DECOY database has been created concatenating de FORWARD database and the DECOY database and is the one that we recommend to use as possible.

     

DECOY (FORWARD+RANDOM) database: ftp://estrellapolar.cnb.csic.es/PME5/PME5_Decoy_3.0.zip *recomended*
Although we encourage you to use the DECOY database, we also provide the following FORWARD and RANDOM databases if they are needed by some tool that you use for the analysis:
FORWARD database: ftp://estrellapolar.cnb.csic.es/PME5/PME5_3.0.zip  
RANDOM database: ftp://estrellapolar.cnb.csic.es/PME5/PME5_RND_3.0.zip  

         
         
      Important: MASCOT provides an option for automatic decoy search. With this option, it is not necessary to create and install a decoy or random database in order to obtain a False Discovery Rate (FDR) estimation. However, in order to minimize the variability between different decoy searching approaches, we recommend installing the DECOY database (FORWARD + RANDOM) in your MASCOT server and then calculate manually the FDR.
       
     

In case of the ProteoRed Phenyx server, we have installed the DECOY database and we have called it: PME5_ForwardDecoy (indicating that is composed by the FORWARD and the RANDOM database).

                   
                   
                   
Instructions for installing databases in your local MASCOT server
       
     
  • As we have stated previously, we recommend installing the DECOY database (FORWARD + RANDOM) in your MASCOT server and then calculate manually the FDR.
     
  • Rule to parse accession string from all three databases: ">..|\([^|]*\)"
     
  • Rule to parse description string from all three databases: ">[^ ]* \(.*\)"
     
  • An exhaustive description about how to install these databases in MASCOT has been written here:
        How to install the PME5 sequence databases in MASCOT How to install the PME5 sequence databases in MASCOT
                   
       
       
Instructions for generating the FDR pept matches report in PHENYX
       
      As you can see in the GeneBio's wiki page here, Phenyx provides a way to calculate the FDR from your results.
      Steps:
     
  • Search your peak-list using the database called PME5_ForwardDecoy
     
Select PME5_ForwardDecoy database in Phenyx

 

     
  • In the main page of phenyx, select your job ID and click on "Exports" option at the left of the screen.
     
  • Select the "false discovery rate pept matches (excel)" option.
     
  • In "extra arguments" wrrte: --acregexpfalse='rnd.+' in order to indicate that the search has been run on a concatenated version of a forward and decoy databank, and that all AC from the decoy databank start by "rnd".
     
FDR export in Phenyx
       
       
                   
                   
Back to Data Analysis section

 


Working Group 1 | Working Group 2 | Working Group 3 | Working Group 4
Working Group 5 | Working Group 6 | Working Group 7

@2009 ProteoRed. All rights reserved. Credits