BMC Caller can be used to assign the type and inventory the contents of Bacterial Microcompartment loci from protein sequences (link to paper describing BMC Caller). For general information on Bacterial Microcompartments (BMCs) see Wikipedia (link) or a recent review (link).
There are two modes for BMC Caller, 1) analysis of BMC protein components (includes the shell, the core enzymes and any other conserved loci-associated proteins) and 2) a BMC shell protein-only mode.
Input: Both modes accept multi-FASTA files as input with a first line marked by ">" followed by the sequence identifier and the following line(s) containing the protein sequence (with no terminator character) and subsequent sequences starting again with a new line and ">" followed by sequence identifier. Also see example link on submission page for a sample input.
The order of the sequences should represent the order of the genes from the genome for optimal BMC type assignment but is not needed to identify the individual components. Generally a range of +/- 12 genes from any shell protein gene contains the whole BMC locus and is optimal for BMC type assignment.
For BMC shell protein analysis:Whole proteomes can be used as input, only shell proteins will be displayed as output. This is a good way to find the best input sequences for BMC locus analysis and might already properly identify all the BMC loci in the proteome/genome.
JGI IMG/M: search for the target genome, download the .faa file and then use the BMC shell protein analysis to find BMC locus sites and/or copy-paste the desired protein sequences for BMC locus analysis.
Uniprot Proteomes: search for genome, then download all entries in uncompressed "FASTA (canonical)" format. BMC locus sites can be identified with the BMC shell protein analysis tool or searching the FASTA file with relevant terms like "microcompartment" or "carboxysome".
NCBI taxonomy: search for the desired organism and follow the links to organism details; click on the Protein/Direct links number in red. Click the "Sent to:" button and choose FASTA format and default order.
Abbreviation   | Description | Comments / Alternate Nomenclature / Updates |
---|---|---|
ACI | Acidobacterium microcompartment | |
ARO | Aromatic substrate microcompartment | |
BUF | Bacterial microcompartment of unknown function (and no AldDh) | BUF1 is also known as Xau / Xanthine utilization (link) |
CsomeACh | Alpha-carboxysome of chemoautotroph bacteria | |
CsomeACy | Alpha-carboxysome of cyanobacteria | |
CsomeBCy | Beta-carboxysome of cyanobacteria | |
ETU | Ethanol utilizing microcompartment | |
EUT | Ethanolamine utilizing microcompartment | |
EUT2x | EUT BMC, missing signature enzyme | |
FRAG | Fragmented type microcompartment | |
GRM | Glycyl radical enzyme containing microcompartment | GRM1/2 process choline, and are also known as Cut BMCs, GRM 3/4/6 process propanediol, GRM5 fucose/rhamnose |
GRM5like | GRM5 BMC with missing signature enzyme | |
GRMguf | GRM BMC with glycyl radical enzyme of unknown function | A function of the glycyl radical enzyme has recently proposed as isethionate C-S bond cleavage (link) |
HO | BMCs related to Haliangium ochraceum model BMC locus | |
MIC | Metabolosome loci where the majority of loci do not encode an AlcDH and/or PTAC (incomplete core) | |
MUF | Metabolosome microcompartment of unknown function | |
PDU | Propanediol utilization microcompartment | |
PVM | Planctomycete and Verrucomicrobia microcompartment | |
PVMlike | Microcompartment with similar enzyme complement as PVM | |
RMM | Rhodococcus and Mycobacteria microcompartment | RMM are also known as AAUM for aminoacetone utilizing microcompartment (link) |
SPU | Sugar phosphate utilization microcompartment |