CPL - Chalmers Publication Library
| Utbildning | Forskning | Styrkeområden | Om Chalmers | In English In English Ej inloggad.

HirBin: High-resolution identification of differentially abundant functions in metagenomes

Tobias Österlund (Institutionen för matematiska vetenskaper, Tillämpad matematik och statistik) ; Viktor Jonsson (Institutionen för matematiska vetenskaper, Tillämpad matematik och statistik) ; Erik Kristiansson (Institutionen för matematiska vetenskaper, Tillämpad matematik och statistik)
BMC Genomics (1471-2164). Vol. 18 (2017), 1,
[Artikel, refereegranskad vetenskaplig]

Background: Gene-centric analysis of metagenomics data provides information about the biochemical functions present in a microbiome under a certain condition. The ability to identify significant differences in functions between metagenomes is dependent on accurate classification and quantification of the sequence reads (binning). However, biological effects acting on specific functions may be overlooked if the classes are too general. Methods: Here we introduce High-Resolution Binning (HirBin), a new method for gene-centric analysis of metagenomes. HirBin combines supervised annotation with unsupervised clustering to bin sequence reads at a higher resolution. The supervised annotation is performed by matching sequence fragments to genes using well-established protein domains, such as TIGRFAM, PFAM or COGs, followed by unsupervised clustering where each functional domain is further divided into sub-bins based on sequence similarity. Finally, differential abundance of the sub-bins is statistically assessed. Results:We show that HirBin is able to identify biological effects that are only present at more specific functional levels. Furthermore we show that changes affecting more specific functional levels are often diluted at the more general level and therefore overlooked when analyzed using standard binning approaches. Conclusions: HirBin improves the resolution of the gene-centric analysis of metagenomes and facilitates the biological interpretation of the results. HirBin is implemented as a Python package and is freely available for download at http://bioinformatics.math.chalmers.se/hirbin.

Nyckelord: Binning; Differential abundance; Functional annotation; Metagenomics; Next-generation sequencing; Statistical analysis; TIGRFAM



Denna post skapades 2017-06-14. Senast ändrad 2017-06-20.
CPL Pubid: 249824

 

Läs direkt!

Lokal fulltext (fritt tillgänglig)

Länk till annan sajt (kan kräva inloggning)


Institutioner (Chalmers)

Institutionen för matematiska vetenskaper, Tillämpad matematik och statistikInstitutionen för matematiska vetenskaper, Tillämpad matematik och statistik (GU)

Ämnesområden

Bioinformatik (beräkningsbiologi)
Genetik
Bioinformatik och systembiologi

Chalmers infrastruktur