- E. Datema (1)
- M.W.E.J. Fiers (2)
- J.C.W. Groot de (4)
- R.C.H.J. Ham van (2)
- E.A. Herniou (1)
- S.I.A. Moco (1)
- M.M. Oers van (1)
- S. Peters (1)
- H.A. Verhoeven (1)
- J.J.M. Vervoort (1)
- J.M. Vlak (1)
- O.F.J. Vorst (1)
- C.H. Vos de (1)
High-throughput bioinformatics with the Cyrille2 pipeline system.
Fiers, M.W.E.J. ; Burgt, A. van der; Datema, E. ; Groot, J.C.W. de; Ham, R.C.H.J. van - \ 2008
BMC Bioinformatics 9 (2008). - ISSN 1471-2105 - 10 p.
workflows - services - tool
Background - Modern omics research involves the application of high-throughput technologies that generate vast volumes of data. These data need to be pre-processed, analyzed and integrated with existing knowledge through the use of diverse sets of software tools, models and databases. The analyses are often interdependent and chained together to form complex workflows or pipelines. Given the volume of the data used and the multitude of computational resources available, specialized pipeline software is required to make high-throughput analysis of large-scale omics datasets feasible. Results - We have developed a generic pipeline system called Cyrille2. The system is modular in design and consists of three functionally distinct parts: 1) a web based, graphical user interface (GUI) that enables a pipeline operator to manage the system; 2) the Scheduler, which forms the functional core of the system and which tracks what data enters the system and determines what jobs must be scheduled for execution, and; 3) the Executor, which searches for scheduled jobs and executes these on a compute cluster. Conclusion - The Cyrille2 system is an extensible, modular system, implementing the stated requirements. Cyrille2 enables easy creation and execution of high throughput, flexible bioinformatics pipelines.
Post alignment clustering procedure for comparative quantitative proteomics LC-MS Data
Groot, J.C.W. de; Fiers, M.W.E.J. ; Ham, R.C.H.J. van; America, A.H.P. - \ 2008
Proteomics 8 (2008)1. - ISSN 1615-9853 - p. 32 - 36.
identification - proteins
Comparative LC-MS is a powerful method for detailed quantitative comparison of complex protein mixtures. Dedicated software is required for detection, matching, and alignment of peaks in multiple LC-MS datasets. However, retention time shifts, saturation effects, limitations of experimental accuracy, and possible occurrence of split peaks make it difficult for software to perfectly match all chromatograms. We describe a procedure to assess the above problems and show that dataset quality can be enhanced with the aid of cluster analysis.
A liquid chromatography-mass spectrometry-based metabolome database for tomato
Moco, S.I.A. ; Bino, R.J. ; Vorst, O.F.J. ; Verhoeven, H.A. ; Groot, J.C.W. de; Beek, T.A. van; Vervoort, J.J.M. ; Vos, C.H. de - \ 2006
Plant Physiology 141 (2006)4. - ISSN 0032-0889 - p. 1205 - 1218.
steroidal alkaloid glycosides - solid-phase extraction - lycopersicon-esculentum - hydroxycinnamic acids - alpha-tomatine - quantitative-analysis - antioxidant activity - phenolic-compounds - fruit - vegetables
For the description of the metabolome of an organism, the development of common metabolite databases is of utmost importance. Here we present the Metabolome Tomato Database (MoTo DB), a metabolite database dedicated to liquid chromatography-mass spectrometry (LC-MS)- based metabolomics of tomato fruit (Solanum lycopersicum). A reproducible analytical approach consisting of reversed-phase LC coupled to quadrupole time-of-flight MS and photodiode array detection (PDA) was developed for large-scale detection and identification of mainly semipolar metabolites in plants and for the incorporation of the tomato fruit metabolite data into the MoTo DB. Chromatograms were processed using software tools for mass signal extraction and alignment, and intensity-dependent accurate mass calculation. The detected masses were assigned by matching their accurate mass signals with tomato compounds reported in literature and complemented, as much as possible, by PDA and MS/MS information, as well as by using reference compounds. Several novel compounds not previously reported for tomato fruit were identified in this manner and added to the database. The MoTo DB is available at http://appliedbioinformatics.wur.nl and contains all information so far assembled using this LC-PDA-quadrupole time-of-flight MS platform, including retention times, calculated accurate masses, PDA spectra, MS/MS fragments, and literature references. Unbiased metabolic profiling and comparison of peel and flesh tissues from tomato fruits validated the applicability of the MoTo DB, revealing that all flavonoids and -tomatine were specifically present in the peel, while several other alkaloids and some particular phenylpropanoids were mainly present in the flesh tissue
Genome sequence of Chrysodeixis chalcites nucleopolyhedrovirus, a baculovirus with two DNA photolyase genes
Oers, M.M. van; Abma-Henkens, M.H.C. ; Herniou, E.A. ; Groot, J.C.W. de; Peters, S. ; Vlak, J.M. - \ 2005
Journal of General Virology 86 (2005)7. - ISSN 0022-1317 - p. 2069 - 2080.
nuclear polyhedrosis-virus - granulovirus genome - mamestra-configurata - maximum-likelihood - organization - protein - entomopoxvirus - identification - search - rates
The complete genome sequence of a single nucleocapsid nucleopolyhedrovirus recently isolated from Chrysodeixis chalcites (ChchNPV) was determined. The viral genome has a size of 149 622 bp and an overall G+C content of 39·1 mol%. The sequence contains 151 predicted open reading frames (ORFs) with a minimal size of 50 codons. The similarity of these ORFs with those of other completely sequenced baculoviruses was calculated using a newly developed database, named GECCO. Phylogenetic analysis of the whole genome confirmed the evolutionary relationship of ChchNPV with group II NPVs, as did the absence of the NPV group I-specific gp64 gene. It is the first group II NPV to encode proliferating cell nuclear antigen. Most noteworthy is the presence of two ORFs encoding a class II cyclobutane pyrimidine dimer DNA photolyase. These two ORFs share only 45 % amino acid identity and have different promoter motifs. Twenty-two additional unique baculovirus genes were identified, including a gene encoding a novel putative RING finger protein with a possible homologue in poxviruses