Skip to content

Welcome to ReDU (Reanalysis of Data User Interface)

"What is ReDU?"

Finding and reusing data is hard. ReDU aims to make that easier. ReDU is a community-minded approach to find and reuse public data containing tandem MS data at the repository scale. ReDU is the bridge between the Global Natural Product Social Molecular Networking Analysis Platform (GNPS) and MassIVE, a public data repository for mass spectrometry data. The key to making this work is a systemic way of capturing basic information about each file contributed to ReDU (i.e. sample information aka metadata). Using the sample information, users can select public data for re-analysis in GNPS or provide their own data along with public data to perform co-analysis. Our aim is to empower researchers to put their data in the context of public data as well as explore questions using public data at the repository scale.

image

How do I contribute to ReDU?

This is a community effort and everyone is encouraged to participate by submitting their own data and sample information instructions. The sharing of new applications (and code) which use ReDU is highly encouraged.


Components of ReDU

  1. ReDU Sample Information
    • information capture system including a template - link here - as well as a validator - link here.
    • a controlled vocabulary and ontologies (e.g. Disease Ontology, Uber Ontology, Mass Spectrometry Ontology) are used.
  2. ReDU File Selector
    • the ReDU File Selector is the tool that enables users to find files based on ReDU sample information (i.e. metadata) and assemble files into cohorts for analysis in GNPS.
    • One can select only public data files in ReDU to perform "re-analysis" or one can select public data files in ReDU and add their own data file to perform "co-analysis".
  3. GNPS (gnps.ucsd.edu)
    • GNPS is an analysis platform supporting ReDU.
    • Chemical annotations are performed via GNPS spectral library search (i.e. MS/MS spectral library matching)
    • Molecular networking (co- and re-analyses) are performed in GNPS.
  4. MassIVE (massive.ucsd.edu)
    • MassIVE is the data repository that ReDU utlizes for mass spectrometry data as well as the sample information (i.e. metadata).
  5. Emperor
    • Emperor is a visualization tool used to display the results of principal component analysis (PCA) performed on the chemicals annotated in each file in ReDU.
    • Users can explore all files in ReDU or project their own data onto the PCA space using a GNPS task ID.

We are developing two main branches of functionality built to re-analyze public data in ReDU or co-analysis of user data with public data in ReDU

  • Repository-scale Molecular Networking via GNPS - link here
  • Chemical Explorer in ReDU - link here
  • Multivariate Analysis (PCA) visualized via Emperor - link here
  • Group Comparator in ReDU - link here
  • Annotation by Library Search via GNPS - link here

ReDU Schematic

clicking on the image below launches an interactive version or this link - link here

Mermaid Live


Data and Code Availability

  • All data can be downloaded from MassIVE
  • All sample information can be downloaded from the ReDU-MS2 homepage by clicking "Download Database".
  • The ReDU identification database is publicly available and accessible via (GNPS), MSV000084206.
  • Code for ReDU can be found on GitHub - link here

Recommended Citation

Issues and Suggestions

Page Contributions

Alan K. Jarmusch (UCSD), Mingxun Wang (UCSD), Christine M. Aceves (UCSD)


Last update: May 20, 2020 07:34:09