MEGAN

Metagenomic analysis software


title: "MEGAN" type: doc version: 1 created: 2026-02-28 author: "Wikipedia contributors" status: active scope: public tags: ["metagenomics-software", "phylogenetics-software", "molecular-biology"] description: "Metagenomic analysis software" topic_path: "science/biology" source: "https://en.wikipedia.org/wiki/MEGAN" license: "CC BY-SA 4.0" wikipedia_page_id: 0 wikipedia_revision_id: 0

::summary Metagenomic analysis software ::

::data[format=table title="Infobox software"]

FieldValue
nameMEGAN
developerDaniel Huson et al.
latest release version6.25.10
latest release date2024
repo
programming languageJava
operating systemWindows, Unix, Linux, macOS
platformJava
genreBioinformatics
licenseFree open source "community edition", commercial "Ultimate edition" licensed by Computomics
website
::

| name =MEGAN | developer = Daniel Huson et al. | latest release version = 6.25.10 | latest release date = 2024 | repo = | programming language = Java | operating system = Windows, Unix, Linux, macOS | platform = Java | genre = Bioinformatics | license = Free open source "community edition", commercial "Ultimate edition" licensed by Computomics | website =

MEGAN (MEtaGenome ANalyzer) is a computer program that allows optimized analysis of large metagenomic datasets.

Metagenomics is the analysis of the genomic sequences from a usually uncultured environmental sample. One of its long-term goals is to inventory and measure the extent and role of microbial biodiversity in the ecosystem, based on discoveries that the diversity of microbial organisms and viral agents in the environment is far greater than previously estimated. MEGAN is an example of a tool that allows the investigation of very large datasets from environmental samples (using shotgun sequencing techniques in particular). It is designed to sample and investigate the unknown biodiversity of environmental samples where more precise techniques with smaller, better known samples, cannot be used.

Fragments of DNA from a metagenomics sample, such as ocean water or soil, are compared against databases of known DNA sequences using BLAST or another sequence comparison tool to assemble the segments into discrete, comparable sequences. MEGAN is then used to compare the resulting sequences with gene sequences from GenBank in NCBI. This program was used to investigate the DNA of a woolly mammoth recovered from the Siberian permafrost and the Sargasso Sea dataset.{{ Cite journal | title = Environmental Genome Shotgun Sequencing of the Sargasso Sea | journal = Science |vauthors=Venter JC, Remington K, Heidelberg JF, Halpern AL, Rusch D, Eisen JA, Wu D, Paulsen I, Nelson KE, Nelson W, Fouts DE, Levy S, Knap AH, Lomas MW, Nealson K, White O, Peterson J, Hoffman J, Parsons R, Baden-Tillson H, Pfannkoch C, Rogers YH, Smith HO | doi=10.1126/science.1093857 | pmid=15001713 | bibcode = 2004Sci...304...66V | volume=304 | issue=5667 |date=April 2004 | pages=66–74| citeseerx = 10.1.1.124.1840 | s2cid = 1454587 }}

Introduction

Metagenomics is the study of genomic content of samples from the same habitat, aimed at determining the role and extent of species diversity. Both targeted and random sequencing are commonly used, with comparisons made against sequence databases. Recent developments in sequencing technology have led to an increase in the number of metagenomics samples. MEGAN is a tool for analyzing metagenomics data. The first version of MEGAN was released in 2007,{{cite journal | last = Huson | first = H. |author2=A. Auch |author3=Ji Qi |author4=S. C. Schuster | title = MEGAN Analysis of Metagenomic Data | journal=Genome Research | volume = 17 | issue = 3| pages = 377–386 | year = 2007 | url = http://www.genome.org/cgi/reprint/gr.5969107v1 | doi = 10.1101/gr.5969107| access-date = April 3, 2008 | pmid = 17255551 | pmc = 1800929 }} and the most recent version is MEGAN6. |last1=Huson |first1=Daniel H |last2=S. Beier |last3=I. Flade |last4=A. Gorska |last5=M. El-Hadidi |last6=H. Ruscheweyh |last7=R. Tappu |title=MEGAN Community Edition - Interactive exploration and analysis of large-scale microbiome sequencing data |journal=PLOS Computational Biology |volume=12 |issue=6 |article-number=e1004957 |doi=10.1371/journal.pcbi.1004957 |pmid=27327495 |pmc=4915700 |bibcode=2016PLSCB..12E4957H |year=2016 |doi-access=free While the initial version could analyze the taxonomic content of a single dataset, later versions can handle multiple datasets and include new features such as querying different databases and employing updated algorithms.

MEGAN Pipeline

MEGAN analysis starts with collecting reads from any shotgun platform. Then, the reads are compared with sequence databases using BLAST or similar tools. After that, MEGAN assigns a taxon ID to processed read results based on NCBI taxonomy, creating a MEGAN file that contains the necessary information for statistical and graphical analysis. Lastly, the lowest common ancestor (LCA) algorithm can be run to inspect assignments, analyze data, and create summaries based on different NCBI taxonomy levels. The LCA algorithm identifies the lowest common ancestor among different species.

Release history

::data[format=table]

VersionRelease dateMajor highlights
1.025 January 2007Initial standalone interactive tool for single metagenome analysis
2.0Multiple dataset support, read extraction by taxon, COGs analysis, accession numbers for read identification, basic charting capabilities
3.0New RMA file format
4.020 June 2011Introduced integrative, comparative, functional analyses
5.0April 2013COG/EGGNOG analysis, new SEED and KEGG mapping files, PCoA analysis of taxonomy and function, improved LCA algorithm, new charts, biome extraction methods
6.021 June 2016New RMA6 file format; supports NCBI/SILVA taxonomy, InterPro2GO, SEED, eggNOG, and KEGG; gene-centric read assembly; enhanced GUI and metadata tools
7.02 February 2024last1=Huson
::

References

References

  1. (2004). "More than meets the eye". Nature.
  2. Frias-Lopez, Jorge. (March 11, 2008). "Microbial community gene expression in ocean surface waters". PNAS.
  3. Poinar, Hendrik N.. (2007). "Metagenomics to Paleogenomics: Large-Scale Sequencing of Mammoth DNA". Science.
  4. (30 March 2010). "User Manual for MEGAN V3.9".
  5. (21 November 2014). "MEGAN5 - MEtaGenome ANalyzer".
  6. (26 June 2025). "MEGAN 7: Metagenome Analyzer User Manual".

::callout[type=info title="Wikipedia Source"] This article was imported from Wikipedia and is available under the Creative Commons Attribution-ShareAlike 4.0 License. Content has been adapted to SurfDoc format. Original contributors can be found on the article history page. ::

metagenomics-softwarephylogenetics-softwaremolecular-biology