Bimber Lab Software
While we are primarily an infectious disease lab, creating software to analyze and manage data is a substantial component of our work. Where possible, we try to package this into broadly useful, stable products and make these available to the broader community. Below are the primary categories of software we generate and support.
DISCVR-seq Toolkit
DISCVR-seq Toolkit is a suite of tools written using the GATK4 engine. Many of our sequence and variant analysis tools are writen into this toolkit. These tools include VariantQC, which can be used to create HTML summary reports from VCF files.
LabKey Modules
We heavily use LabKey Server for a range of tasks, including basic lab functions (data and sample management), as well as management and analysis of most of our sequence data. Our modules fall into several categories:
-
Laboratory Module: this is the base module for our lab data management and the core of our operations. It can be used for basic data and experiment management (vaguely like an Electronic Lab Notebook). It is in use beyond our group.
-
SequenceAnalysis: This module is the core of our sequence data management and analysis. It allows central management of raw data, analysis products, and resources (genomes, gene annotations, etc.). Related modules allow integration with BLAST, the open source web-based genome browser JBrowse, and our cluster module enables submission of LabKey jobs to either slurm or HTCondor clusters.
-
Other notable modules:
- OpenLdapSync: This is a standalone module that allows a LabKey Server instance to automatically sync user and/groups with an LDAP Server. LabKey offers a premium (not free or open source) feature to sync groups. This feature was actually created in 2018 by cloning our code.
R Packages
- cellhashR is an R package designed to work with cell hashing and CITE-seq data. It can be used for QC of the raw data, as well as generation of cell hashing calls.
- geneSetViz is a R package / Shiny application designed to help interpret set of genes, such as those obtained from RNA-seq experiments. It allows the user to query a given set of genes against many reference sets and pathway tools from a single portal.
External Contributions
Lab members have contributed to the following projects: