CloudBioLinux offers genome analysis resources for cloud computing platforms such as Amazon EC2. We develop freely available, community maintained software images and data repositories for biological analysis.

Motivation

Many bioinformatics workflows involve large datasets in which high performance computing is needed. Cloud computing provides researchers with the ability to perform computations using a practically unlimited pool of virtual machines, using platforms such as Amazon EC2, Eucalyptus or VirtualBox. CloudBioLinux utilizes these resources to enable instant access to biological software, programming libraries and data.

CloudBioLinux is a community project and we welcome contributors and feedback. Software and data are built using Fabric for fully automated installation and deployment. Packages are specified in simple configuration files for both Linux packages and programming language libraries. Please fork our code on GitHub and suggest improvements and additions.

These resources are designed for biologists as well as programmers. With the help of the NEBC Bio-Linux development team, images include the biological software and libraries available in local installations along with a FreeNX desktop environment designed to ease the transition to remote computational analysis.

Resources

Documentation