Skip to content
Ignacio edited this page Mar 30, 2014 · 28 revisions

Welcome to the OpenCGA wiki!

OpenCGA aims to provide a Big Data storage engine and analysis framework for genomic scale data analysis of hundreds of TBs or even PBs. For this, we are exploring and using the most modern advanced technologies from different fields:

  • Big data: Apache Hadoop for big data processing and storage
  • NoSQL
  • High-performance Computing (HPC)
  • HTML5 and RESTful web services

Its features will include uploading and downloading files to a repository, storing their information in a generic way (non-dependant of the original file-format) and retrieving this information efficiently.

Goals and Motivation

Technical Documentation

At this section you can find some useful links and information for researchers and software developers who are planning to deploy and/or integrating OpenCGA services with their software applications and tools. These are working documents:

  • Data models: For a description of data models for representing Variant and alignment data visit...
  • Architecture: For a description of the technologies and architecture of OpenCGA and some other implementation details visit the [architecture] (https://github.com/opencb/opencga/wiki/Architecture) section.
  • Storage implementation:
  • Download and install
  • Releases and Roadmap: Do you want to know what's coming next? Please visit our

##Getting Involved Examples: http://www.chromium.org/getting-involved

http://tomcat.apache.org/getinvolved.html

Support

Support

About

Clone this wiki locally