<tools xmlns="biotoolsSchema" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="biotoolsSchema file:///E:/repos/GitHub/biotoolsShim/genericxml2xml/versions/biotools-3.3.0/biotools_3.3.0.xsd"><tool><name>pathotypr</name><description>A fast, offline, pathogen-agnostic toolkit for lineage classification and marker-driven genotyping from whole-genome sequencing data. Bring your own SNP marker panel for any microbial pathogen &#8212; pathotypr handles the rest. Ships with curated panels for Mycobacterium tuberculosis complex (MTBC): 3,707 lineage-defining SNPs and over 102,000 resistance-associated mutations. Works with assembled genomes (FASTA) and raw reads (FASTQ). Available as command-line interface and cross-platform desktop GUI (Tauri).</description><homepage>https://github.com/PathoGenOmics-Lab/pathotypr</homepage><biotoolsID>pathotypr</biotoolsID><biotoolsCURIE>biotools:pathotypr</biotoolsCURIE><version>1.0.0</version><toolType>Command-line tool</toolType><toolType>Desktop application</toolType><topic><uri>http://edamontology.org/topic_0622</uri><term>Genomics</term></topic><topic><uri>http://edamontology.org/topic_3301</uri><term>Microbiology</term></topic><topic><uri>http://edamontology.org/topic_0625</uri><term>Genotype and phenotype</term></topic><topic><uri>http://edamontology.org/topic_3168</uri><term>Sequencing</term></topic><topic><uri>http://edamontology.org/topic_3305</uri><term>Public health and epidemiology</term></topic><topic><uri>http://edamontology.org/topic_3474</uri><term>Machine learning</term></topic><language>Rust</language><license>AGPL-3.0</license><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_3196</uri><term>Genotyping</term></operation><input><data><uri>http://edamontology.org/data_2044</uri><term>Sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_0920</uri><term>Genotype/phenotype report</term></data><format><uri>http://edamontology.org/format_3475</uri><term>TSV</term></format></output><note>train &#8212; Build a Random Forest classifier from labeled genomes.</note></function><function><operation><uri>http://edamontology.org/operation_3225</uri><term>Variant classification</term></operation><input><data><uri>http://edamontology.org/data_2044</uri><term>Sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_0920</uri><term>Genotype/phenotype report</term></data><format><uri>http://edamontology.org/format_3475</uri><term>TSV</term></format></output><note>predict &#8212; Assign lineages to genomes using a pre-trained Random Forest model.</note></function><function><operation><uri>http://edamontology.org/operation_3196</uri><term>Genotyping</term></operation><operation><uri>http://edamontology.org/operation_3225</uri><term>Variant classification</term></operation><input><data><uri>http://edamontology.org/data_2044</uri><term>Sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_0920</uri><term>Genotype/phenotype report</term></data><format><uri>http://edamontology.org/format_3475</uri><term>TSV</term></format></output><note>classify &#8212; Call known SNP markers in assembled genomes against a user-defined marker panel.</note></function><function><operation><uri>http://edamontology.org/operation_3196</uri><term>Genotyping</term></operation><input><data><uri>http://edamontology.org/data_2044</uri><term>Sequence</term></data><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format></input><output><data><uri>http://edamontology.org/data_0920</uri><term>Genotype/phenotype report</term></data><format><uri>http://edamontology.org/format_3475</uri><term>TSV</term></format></output><note>split-fastq &#8212; Alignment-free genotyping directly from raw reads using k-mer matching.</note></function><function><operation><uri>http://edamontology.org/operation_0346</uri><term>Sequence similarity search</term></operation><input><data><uri>http://edamontology.org/data_2044</uri><term>Sequence</term></data><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format></input><output><data><uri>http://edamontology.org/data_0920</uri><term>Genotype/phenotype report</term></data><format><uri>http://edamontology.org/format_3475</uri><term>TSV</term></format></output><note>match &#8212; Find the closest reference genome from a set of references for raw reads.</note></function><download><url>https://github.com/PathoGenOmics-Lab/pathotypr/releases</url><type>Binaries</type></download><download><url>https://github.com/PathoGenOmics-Lab/pathotypr</url><type>Source code</type></download><documentation><url>https://github.com/PathoGenOmics-Lab/pathotypr/tree/main/docs</url><type>User manual</type></documentation><publication><doi>10.64898/2026.03.24.714002</doi><type>Primary</type></publication><credit><name>Paula Ruiz-Rodriguez</name><orcidid>https://orcid.org/0000-0003-0727-5974</orcidid><typeEntity>Person</typeEntity><typeRole>Developer</typeRole><typeRole>Maintainer</typeRole></credit><credit><name>Mireia Coscolla</name><orcidid>https://orcid.org/0000-0003-0752-0538</orcidid><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole></credit><credit><name>I2SysBio (CSIC - Universitat de Val&#232;ncia)</name><typeEntity>Institute</typeEntity><typeRole>Provider</typeRole></credit><credit><name>PathoGenOmics Lab</name><typeEntity>Division</typeEntity><typeRole>Provider</typeRole></credit></tool><tool><name>MHCcluster</name><description>Functional cluster of MHC class I molecules (MHCI) based on their predicted binding specificity.</description><homepage>http://cbs.dtu.dk/services/MHCcluster/</homepage><biotoolsID>mhccluster</biotoolsID><biotoolsCURIE>biotools:mhccluster</biotoolsCURIE><version>2.0</version><toolType>Web application</toolType><topic><uri>http://edamontology.org/topic_2830</uri><term>Immunoproteins, genes and antigens</term></topic><operatingSystem>Linux</operatingSystem><operatingSystem>Windows</operatingSystem><operatingSystem>Mac</operatingSystem><license>Other</license><maturity>Emerging</maturity><cost>Free of charge (with restrictions)</cost><function><operation><uri>http://edamontology.org/operation_3459</uri><term>Functional clustering</term></operation><input><data><uri>http://edamontology.org/data_1353</uri><term>Sequence motif</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_0872</uri><term>Phylogenetic tree</term></data><format><uri>http://edamontology.org/format_2006</uri><term>Phylogenetic tree format</term></format></output><output><data><uri>http://edamontology.org/data_1636</uri><term>Heat map</term></data><format><uri>http://edamontology.org/format_2333</uri><term>Binary format</term></format></output><note>provides heat-map and graphical tree-based visualizations of the functional relationship between MHC class I and class II variants</note></function><link><url>http://cbs.dtu.dk/services</url><type>Software catalogue</type></link><documentation><url>http://www.cbs.dtu.dk/services/MHCcluster-2.0/instructions.php</url><type>General</type></documentation><publication><doi>10.1007/s00251-013-0714-9</doi><pmid>23775223</pmid><pmcid>PMC3750724</pmcid><type>Primary</type></publication><credit><name>CBS</name><typeEntity>Institute</typeEntity><typeRole>Provider</typeRole></credit><credit><name>Morten Nielsen</name><email>mniel@cbs.dtu.dk</email><orcidid>http://orcid.org/0000-0001-7885-4311</orcidid><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole></credit></tool><tool><name>HTSlib</name><description>The main purpose of HTSlib is to provide access to genomic information files, both alignment data (SAM, BAM, and CRAM formats) and variant data (VCF and BCF formats). The library also provides interfaces to access and index genome reference data in FASTA format and tab-delimited files with genomic coordinates. It is utilized and incorporated into both SAMtools and BCFtools.</description><homepage>http://www.htslib.org/</homepage><biotoolsID>htslib</biotoolsID><biotoolsCURIE>biotools:htslib</biotoolsCURIE><version>1.0</version><version>1.1</version><version>1.2</version><version>1.2.1</version><version>1.3</version><version>1.3.1</version><version>1.3.2</version><version>1.4</version><version>1.4.1</version><version>1.5</version><version>1.6</version><version>1.7</version><version>1.8</version><version>1.9</version><version>1.10</version><version>1.10.1</version><version>1.10.2</version><version>1.11</version><version>1.12</version><version>1.13</version><version>1.14</version><version>1.15</version><version>1.15.1</version><version>1.16</version><version>1.17</version><version>1.18</version><version>1.19</version><version>1.20</version><version>1.21.1</version><version>1.22</version><version>1.22.1</version><version>1.22.2</version><version>1.23</version><version>1.23.1</version><toolType>Library</toolType><topic><uri>http://edamontology.org/topic_3071</uri><term>Data management</term></topic><operatingSystem>Windows</operatingSystem><operatingSystem>Linux</operatingSystem><operatingSystem>Mac</operatingSystem><language>C</language><license>MIT</license><collectionID>Animal and Crop Genomics</collectionID><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_2409</uri><term>Data handling</term></operation><input><data><uri>http://edamontology.org/data_0924</uri><term>Sequence trace</term></data><format><uri>http://edamontology.org/format_3462</uri><term>CRAM</term></format><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format><format><uri>http://edamontology.org/format_2573</uri><term>SAM</term></format><format><uri>http://edamontology.org/format_2572</uri><term>BAM</term></format><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format></input><input><data><uri>http://edamontology.org/data_3498</uri><term>Sequence variations</term></data><format><uri>http://edamontology.org/format_3020</uri><term>BCF</term></format><format><uri>http://edamontology.org/format_3016</uri><term>VCF</term></format></input><output><data><uri>http://edamontology.org/data_0924</uri><term>Sequence trace</term></data><format><uri>http://edamontology.org/format_3462</uri><term>CRAM</term></format><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format><format><uri>http://edamontology.org/format_2573</uri><term>SAM</term></format><format><uri>http://edamontology.org/format_2572</uri><term>BAM</term></format><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format></output><output><data><uri>http://edamontology.org/data_3498</uri><term>Sequence variations</term></data><format><uri>http://edamontology.org/format_3020</uri><term>BCF</term></format><format><uri>http://edamontology.org/format_3016</uri><term>VCF</term></format></output></function><link><url>https://github.com/samtools/htslib</url><type>Repository</type></link><link><url>http://www.htslib.org/support/#lists</url><type>Mailing list</type></link><link><url>https://github.com/samtools/htslib/issues</url><type>Issue tracker</type></link><download><url>http://www.htslib.org/download/</url><type>Downloads page</type></download><documentation><url>http://www.htslib.org/doc/#manual-pages</url><type>User manual</type></documentation><relation><biotoolsID>samtools</biotoolsID><type>usedBy</type></relation><relation><biotoolsID>bcftools</biotoolsID><type>usedBy</type></relation><publication><doi>10.1093/gigascience/giab007</doi><pmid>33594436</pmid><pmcid>PMC7931820</pmcid><type>Primary</type><note>HTSlib: C library for reading/writing high-throughput sequencing data.</note></publication><credit><name>Wellcome Sanger Institute</name><email>samtools@sanger.ac.uk</email><url>https://www.sanger.ac.uk/</url><typeEntity>Institute</typeEntity><typeRole>Provider</typeRole><typeRole>Primary contact</typeRole></credit><credit><name>Samtools Help mailing list</name><url>https://lists.sourceforge.net/lists/listinfo/samtools-help</url><typeEntity>Project</typeEntity><typeRole>Support</typeRole></credit></tool><tool><name>QUAST</name><description>QUAST stands for QUality ASsessment Tool.  
It evaluates a quality of genome assemblies by computing various metrics and providing nice reports.</description><homepage>http://quast.sourceforge.net/quast</homepage><biotoolsID>quast</biotoolsID><biotoolsCURIE>biotools:quast</biotoolsCURIE><version>v.5.3.0</version><toolType>Workflow</toolType><topic><uri>http://edamontology.org/topic_0196</uri><term>Sequence assembly</term></topic><operatingSystem>Linux</operatingSystem><operatingSystem>Mac</operatingSystem><language>Perl</language><language>Python</language><language>C</language><license>GPL-2.0</license><collectionID>ONTeater</collectionID><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_0337</uri><term>Visualisation</term></operation><operation><uri>http://edamontology.org/operation_3180</uri><term>Sequence assembly validation</term></operation><input><data><uri>http://edamontology.org/data_1234</uri><term>Sequence set (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><note># Running quast on a eukaryotic genome</note><cmd>quast -ek assembly.fa --out output_prefix</cmd></function><link><url>https://github.com/ablab/quast</url><type>Repository</type></link><link><url>https://github.com/ablab/quast/issues</url><type>Issue tracker</type></link><documentation><url>http://quast.bioinf.spbau.ru/</url><type>General</type></documentation><publication><doi>10.1093/bioinformatics/btt086</doi><pmid>23422339</pmid><pmcid>PMC3624806</pmcid></publication><credit><name>QUAST Support</name><email>quast.support@cab.spbu.ru</email><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole></credit></tool><tool><name>WEBnma</name><description>WEBnm@ provides quick, automated computation and analysis of low-frequency normal modes for protein structures.</description><homepage>http://apps.cbu.uib.no/webnma</homepage><biotoolsID>webnma</biotoolsID><biotoolsCURIE>biotools:webnma</biotoolsCURIE><version>3.5</version><toolType>Web API</toolType><toolType>Suite</toolType><topic><uri>http://edamontology.org/topic_2814</uri><term>Protein structure analysis</term></topic><topic><uri>http://edamontology.org/topic_0736</uri><term>Protein folds and structural domains</term></topic><operatingSystem>Linux</operatingSystem><operatingSystem>Windows</operatingSystem><operatingSystem>Mac</operatingSystem><language>Python</language><license>GPL-3.0</license><collectionID>BiB tools</collectionID><collectionID>CBU tools</collectionID><collectionID>UiB tools</collectionID><collectionID>ELIXIR-NO</collectionID><collectionID>ELIXIR-Norway</collectionID><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><elixirCommunity>3D-BioInfo</elixirCommunity><elixirNode>Norway</elixirNode><function><operation><uri>http://edamontology.org/operation_0570</uri><term>Structure visualisation</term></operation><operation><uri>http://edamontology.org/operation_0244</uri><term>Protein flexibility and motion analysis</term></operation><input><data><uri>http://edamontology.org/data_1460</uri><term>Protein structure</term></data><format><uri>http://edamontology.org/format_1476</uri><term>PDB</term></format></input><output><data><uri>http://edamontology.org/data_2884</uri><term>Plot</term></data><format><uri>http://edamontology.org/format_3508</uri><term>PDF</term></format></output><output><data><uri>http://edamontology.org/data_1354</uri><term>Sequence profile</term></data><format><uri>http://edamontology.org/format_2330</uri><term>Textual format</term></format></output><note>Constructs elastic network model from alpha carbon coordinates of the protein, and computes properties to describe large scale conformations. Computes normal modes, fluctuation profiles, inter-residue correlations, conformational overlap analysis and vector field representations. Structural amino acid profiles, and normal mode characteristics describing protein motion, visualized in plots and decorated structure visualizations. White space delimited tabular data for normal modes and the provided plots</note></function><function><operation><uri>http://edamontology.org/operation_2487</uri><term>Protein structure comparison</term></operation><operation><uri>http://edamontology.org/operation_0337</uri><term>Visualisation</term></operation><input><data><uri>http://edamontology.org/data_0886</uri><term>Structure alignment</term></data><format><uri>http://edamontology.org/format_2200</uri><term>FASTA-like (text)</term></format></input><input><data><uri>http://edamontology.org/data_1460</uri><term>Protein structure</term></data><format><uri>http://edamontology.org/format_1476</uri><term>PDB</term></format></input><output><data><uri>http://edamontology.org/data_2884</uri><term>Plot</term></data><format><uri>http://edamontology.org/format_3508</uri><term>PDF</term></format></output><output><data><uri>http://edamontology.org/data_0889</uri><term>Structural profile</term></data><format><uri>http://edamontology.org/format_2330</uri><term>Textual format</term></format></output><note>Performs comparative analysis of the normal modes of protein structures. Computes the Bhattacharyya Coefficient (BC) and the Root Mean Squared Inner Product (RMSIP) of aligned parts of the proteins. Alignment of sets of proteins to be compared. Multiple protein structures Heatmaps, dendrograms and structural amino acid profiles for visual comparison of structural similarity. White space delimited tabular data for the provided plots</note></function><link><url>https://github.com/reuter-group/webnma3</url><type>Repository</type><type>Issue tracker</type></link><link><url>https://elixir.no/helpdesk</url><type>Helpdesk</type><note>Helpdesk and support for ELIXIR Norway services.</note></link><documentation><url>http://apps.cbu.uib.no/webnma3/howto/single</url><type>Quick start guide</type></documentation><documentation><url>http://apps.cbu.uib.no/webnma3/qanda</url><type>FAQ</type></documentation><documentation><url>http://apps.cbu.uib.no/webnma3/about</url><type>General</type><type>Citation instructions</type></documentation><relation><biotoolsID>numpy</biotoolsID><type>uses</type></relation><relation><biotoolsID>biopython</biotoolsID><type>uses</type></relation><relation><biotoolsID>matplotlib</biotoolsID><type>uses</type></relation><relation><biotoolsID>mustang</biotoolsID><type>uses</type></relation><relation><biotoolsID>dssp</biotoolsID><type>uses</type></relation><publication><doi>10.1186/s12859-014-0427-6</doi><pmid>25547242</pmid><pmcid>PMC4339738</pmcid><type>Primary</type><version>2.0</version></publication><publication><doi>10.1186/1471-2105-6-52</doi><pmid>15762993</pmid><pmcid>PMC1274249</pmcid><type>Primary</type><version>1.0</version></publication><credit><name>Nathalie Reuter</name><email>Nathalie.Reuter@uib.no</email><url>http://www.cbu.uib.no/reuter/</url><orcidid>https://orcid.org/0000-0002-3649-7675</orcidid><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole><typeRole>Developer</typeRole></credit><credit><name>Sandhya P Tiwari</name><orcidid>https://orcid.org/0000-0002-0747-3826</orcidid><typeEntity>Person</typeEntity><typeRole>Developer</typeRole></credit><credit><name>Kidane M Tekle</name><typeEntity>Person</typeEntity><typeRole>Developer</typeRole></credit><credit><name>Tristan Cragnolini</name><typeEntity>Person</typeEntity><typeRole>Developer</typeRole></credit><credit><name>Svenn H Grindhaug</name><typeEntity>Person</typeEntity><typeRole>Developer</typeRole></credit><credit><name>Lars Skj&#230;rven</name><typeEntity>Person</typeEntity><typeRole>Developer</typeRole></credit><credit><name>Gisle Salensminde</name><typeEntity>Person</typeEntity><typeRole>Developer</typeRole></credit><credit><name>Edvin Fuglebakk</name><typeEntity>Person</typeEntity><typeRole>Developer</typeRole></credit><credit><name>Siv M Hollup</name><typeEntity>Person</typeEntity><typeRole>Developer</typeRole></credit><credit><name>Department of Molecular Biology, University of Bergen, Norway</name><typeEntity>Institute</typeEntity><typeRole>Provider</typeRole></credit><credit><name>Computational Biology Unit, Department of Informatics, University of Bergen, Norway</name><typeEntity>Institute</typeEntity><typeRole>Provider</typeRole></credit><credit><name>UiB</name><typeEntity>Institute</typeEntity><typeRole>Provider</typeRole></credit></tool><tool><name>Mapler</name><description>Mapler is a pipeline for assessing assembly quality in taxonomically rich metagenomes sequenced with HiFi reads. It incorporates state-of-the-art metrics and facilitates the comparison of assembly strategies.</description><homepage>https://github.com/Nimauric/Mapler</homepage><biotoolsID>mapler</biotoolsID><biotoolsCURIE>biotools:mapler</biotoolsCURIE><toolType>Workflow</toolType><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_3174</uri><term>Metagenomics</term></topic><topic><uri>http://edamontology.org/topic_0196</uri><term>Sequence assembly</term></topic><language>Bash</language><language>Python</language><license>AGPL-3.0</license><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_0310</uri><term>Sequence assembly</term></operation><input><data><uri>http://edamontology.org/data_0924</uri><term>Sequence trace</term></data><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format></input><output><data><uri>http://edamontology.org/data_1234</uri><term>Sequence set (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></output></function><function><operation><uri>http://edamontology.org/operation_3180</uri><term>Sequence assembly validation</term></operation><operation><uri>http://edamontology.org/operation_3731</uri><term>Sample comparison</term></operation><input><data><uri>http://edamontology.org/data_1234</uri><term>Sequence set (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_3914</uri><term>Quality control report</term></data><format><uri>http://edamontology.org/format_3751</uri><term>DSV</term></format><format><uri>http://edamontology.org/format_2330</uri><term>Textual format</term></format><format><uri>http://edamontology.org/format_2331</uri><term>HTML</term></format></output></function><function><operation><uri>http://edamontology.org/operation_3798</uri><term>Read binning</term></operation><input><data><uri>http://edamontology.org/data_1234</uri><term>Sequence set (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_1234</uri><term>Sequence set (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></output></function><link><url>https://github.com/Nimauric/Mapler</url><type>Repository</type></link><publication><doi>10.1093/BIOINFORMATICS/BTAF334</doi><pmid>40478660</pmid><pmcid>PMC12205171</pmcid></publication><credit><name>Nicolas Maurice</name><email>nicolas.maurice@inria.fr</email><orcidid>https://orcid.org/0009-0009-9615-2765</orcidid><typeEntity>Person</typeEntity></credit><credit><name>Claire Lemaitre</name><orcidid>https://orcid.org/0000-0001-8675-170X</orcidid></credit><credit><name>Riccardo Vicedomini</name><orcidid>https://orcid.org/0000-0002-7706-0998</orcidid></credit><credit><name>Cl&#233;mence Frioux</name><email>clemence.frioux@inria.fr</email><orcidid>https://orcid.org/0000-0003-2114-0697</orcidid><typeEntity>Person</typeEntity></credit></tool><tool><name>NetPhos</name><description>Neural network predictions for serine, threonine and tyrosine phosphorylation sites in eukaryotic proteins.</description><homepage>http://cbs.dtu.dk/services/NetPhos/</homepage><biotoolsID>netphos</biotoolsID><biotoolsCURIE>biotools:netphos</biotoolsCURIE><version>2.0</version><toolType>Command-line tool</toolType><toolType>Web application</toolType><toolType>Web service</toolType><topic><uri>http://edamontology.org/topic_0160</uri><term>Sequence sites, features and motifs</term></topic><operatingSystem>Linux</operatingSystem><operatingSystem>Windows</operatingSystem><operatingSystem>Mac</operatingSystem><license>Other</license><maturity>Emerging</maturity><cost>Free of charge (with restrictions)</cost><function><operation><uri>http://edamontology.org/operation_3092</uri><term>Protein feature detection</term></operation><input><data><uri>http://edamontology.org/data_2044</uri><term>Sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_1277</uri><term>Protein features</term></data><format><uri>http://edamontology.org/format_2330</uri><term>Textual format</term></format></output><output><data><uri>http://edamontology.org/data_2955</uri><term>Sequence report</term></data><format><uri>http://edamontology.org/format_2333</uri><term>Binary format</term></format></output><note>produces neural network predictions for serine, threonine and tyrosine phosphorylation sites in eukaryotic proteins</note></function><link><url>http://cbs.dtu.dk/services</url><type>Software catalogue</type></link><documentation><url>http://www.cbs.dtu.dk/services/NetPhos/instructions.php</url><type>General</type></documentation><publication><doi>10.1006/jmbi.1999.3310</doi><pmid>10600390</pmid><type>Primary</type></publication><credit><name>CBS</name><typeEntity>Institute</typeEntity><typeRole>Provider</typeRole></credit><credit><url>http://www.bioinformatics.dtu.dk/english/Service/Contact</url><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole></credit><credit><name>Nicolaj Blom</name><email>nblom@kt.dtu.dk</email><orcidid>http://orcid.org/0000-0001-7787-7853</orcidid></credit></tool><tool><name>BigSeqKit</name><description>The Next Generation Sequencing (NGS) raw data are stored in FASTA and FASTQ text-based file formats. Common operations on FASTA/Q files include searching, filtering, sampling, deduplication and sorting, among others. We can find several tools in the literature for FASTA/Q file manipulation but none of them are well fitted for large files of tens of GB (likely TBs in the near future) since mostly they are based on sequential processing. The exception is seqkit that allows some routines to use a few threads but, in any case, the scalability is very limited. To deal with this issue, we introduce BigSeqKit, a parallel toolkit to manipulate FASTA/Q files at scale with speed and scalability at its core. BigSeqKit takes advantage of an HPC-Big Data framework (IgnisHPC) to parallelize and optimize the commands included in seqkit. In this way, in most cases it is from tens to hundreds of times faster than other state-of-the-art tools such as seqkit, samtools and pyfastx.</description><homepage>https://github.com/citiususc/BigSeqKit</homepage><biotoolsID>bigseqkit</biotoolsID><biotoolsCURIE>biotools:bigseqkit</biotoolsCURIE><toolType>Library</toolType><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_0080</uri><term>Sequence analysis</term></topic><operatingSystem>Linux</operatingSystem><language>Python</language><license>GPL-3.0</license><function><operation><uri>http://edamontology.org/operation_3192</uri><term>Sequence trimming</term></operation><operation><uri>http://edamontology.org/operation_0372</uri><term>DNA transcription</term></operation><operation><uri>http://edamontology.org/operation_0371</uri><term>DNA translation</term></operation><operation><uri>http://edamontology.org/operation_0233</uri><term>Sequence conversion</term></operation><input><data><uri>http://edamontology.org/data_3494</uri><term>DNA sequence</term></data><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_3494</uri><term>DNA sequence</term></data><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></output></function></tool><tool><name>PLAST</name><description>PLAST is a heuristical method to search for highest scoring local alignments between a DNA sequence query and a graphical pangenome. It takes as input a plain DNA sequence and a pangenome which may either be a set of (multiple) FASTA or FASTQ files or a sequence graph constructed by the tool Bifrost. It then outputs statistically meaningful (gapped) alignments in the style of the NCBI BLAST standard output format. Alignments are calculated based on a "seed-and-extend approach" while traversing the sequence graph. Biologically meaningful alignments are filtered by using an alignment statistic explicitly developed for sequence-to-graph alignments involving graphical pangenomes.</description><homepage>https://github.com/tischulz1/plast</homepage><biotoolsID>pangenome-blast</biotoolsID><biotoolsCURIE>biotools:pangenome-blast</biotoolsCURIE><version>0.0.1-0.2.0</version><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_0797</uri><term>Comparative genomics</term></topic><topic><uri>http://edamontology.org/topic_0080</uri><term>Sequence analysis</term></topic><operatingSystem>Linux</operatingSystem><operatingSystem>Mac</operatingSystem><operatingSystem>Windows</operatingSystem><language>C++</language><license>GPL-3.0</license><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_0495</uri><term>Local alignment</term></operation><input><data><uri>http://edamontology.org/data_3494</uri><term>DNA sequence</term></data><format><uri>http://edamontology.org/format_1211</uri><term>unambiguous pure nucleotide</term></format></input><input><data><uri>http://edamontology.org/data_1234</uri><term>Sequence set (nucleic acid)</term></data><format><uri>http://edamontology.org/format_3975</uri><term>GFA 1</term></format><format><uri>http://edamontology.org/format_2333</uri><term>Binary format</term></format></input><output><data><uri>http://edamontology.org/data_1383</uri><term>Nucleic acid sequence alignment</term></data><format><uri>http://edamontology.org/format_1333</uri><term>BLAST results</term></format></output><note>In order to search for alignments within the pangenome graph, 
A pangenome graph used to search for alignments consists of (1) a file in GFA format containing all sequences of the graph, (2) a binary file produced by the tool itself or the software "Bifrost" and (3) a program-specific index data structure in binary format.</note><cmd>PLAST Search -i pangenomeGraphCommonFilePrefix -q fileContainingOneQueryPerLine</cmd></function><function><operation><uri>http://edamontology.org/operation_0227</uri><term>Indexing</term></operation><input><data><uri>http://edamontology.org/data_0850</uri><term>Sequence set</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format></input><output><data><uri>http://edamontology.org/data_0850</uri><term>Sequence set</term></data><format><uri>http://edamontology.org/format_3975</uri><term>GFA 1</term></format><format><uri>http://edamontology.org/format_2333</uri><term>Binary format</term></format></output><note>If a pangenome graph already exists and only an index has to be built, FASTA/FASTQ files are not needed.</note><cmd>PLAST Build -i pangenomeGraphCommonFilePrefix -R *.fasta</cmd></function><link><url>https://gitlab.ub.uni-bielefeld.de/gi/plast</url><type>Repository</type></link><link><url>https://github.com/tischulz1/plast</url><type>Mirror</type></link><documentation><url>https://gitlab.ub.uni-bielefeld.de/gi/plast/-/blob/master/README.md</url><type>General</type></documentation><publication><doi>10.1093/bioinformatics/btab077</doi><pmid>33532821</pmid><pmcid>PMC8388040</pmcid><type>Primary</type><type>Method</type><type>Benchmarking study</type></publication><credit><name>Bielefeld University</name><url>https://www.uni-bielefeld.de/</url><typeEntity>Institute</typeEntity><typeRole>Provider</typeRole></credit><credit><name>Genome Informatics</name><url>https://gi.cebitec.uni-bielefeld.de/</url><typeEntity>Institute</typeEntity><typeRole>Provider</typeRole></credit><credit><name>Tizian Schulz</name><email>plast-service@cebitec.uni-bielefeld.de</email><orcidid>https://orcid.org/0000-0003-0744-7078</orcidid><typeEntity>Person</typeEntity><typeRole>Developer</typeRole></credit></tool><tool><name>cuffcompare</name><description>Compare assembled transcripts to a reference annotation and track Cufflinks transcripts across multiple experiments.</description><homepage>http://cole-trapnell-lab.github.io/cufflinks/</homepage><biotoolsID>cuffcompare</biotoolsID><biotoolsCURIE>biotools:cuffcompare</biotoolsCURIE><version>2.2.1.0</version><toolType>Web application</toolType><topic><uri>http://edamontology.org/topic_3168</uri><term>Sequencing</term></topic><operatingSystem>Linux</operatingSystem><operatingSystem>Windows</operatingSystem><operatingSystem>Mac</operatingSystem><language>C++</language><collectionID>Cufflinks</collectionID><collectionID>galaxyPasteur</collectionID><maturity>Mature</maturity><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_0492</uri><term>Multiple sequence alignment</term></operation><input><data><uri>http://edamontology.org/data_0006</uri><term>Data</term></data><format><uri>http://edamontology.org/format_1975</uri><term>GFF3</term></format><format><uri>http://edamontology.org/format_2306</uri><term>GTF</term></format></input><input><data><uri>http://edamontology.org/data_0863</uri><term>Sequence alignment</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_0006</uri><term>Data</term></data><format><uri>http://edamontology.org/format_3475</uri><term>TSV</term></format></output><output><data><uri>http://edamontology.org/data_0006</uri><term>Data</term></data><format><uri>http://edamontology.org/format_2306</uri><term>GTF</term></format></output></function><link><url>https://galaxy.pasteur.fr/tool_runner?tool_id=toolshed.pasteur.fr/repos/fmareuil/cuffcompare/cuffcompare/2.2.1.0</url><type>Galaxy service</type></link><documentation><url>http://cole-trapnell-lab.github.io/cufflinks/cuffcompare/</url><type>General</type></documentation><publication><doi>10.1038/nbt.1621</doi><pmid>20436464</pmid><pmcid>PMC3146043</pmcid><type>Primary</type></publication><publication><doi>10.1093/nar/gkw343</doi><pmid>27137889</pmid><pmcid>PMC4987906</pmcid><type>Other</type></publication><publication><doi>10.7490/f1000research.1114334.1</doi><type>Other</type></publication><credit><name>Galaxy Support Team</name><email>galaxy@pasteur.fr</email><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole></credit></tool><tool><name>fur</name><description>The program fur takes as input a set of target genome sequences and a set of related genome sequences, the neighbors. It returns the sequence regions common to all targets that are absent form the neighbors. Such regions can be used as candidate genetic markers.</description><homepage>https://github.com/evolbioinf/fur</homepage><biotoolsID>fur</biotoolsID><biotoolsCURIE>biotools:fur</biotoolsCURIE><topic><uri>http://edamontology.org/topic_3344</uri><term>Biomedical science</term></topic><language>Go</language><function><operation><uri>http://edamontology.org/operation_2998</uri><term>Nucleic acid comparison</term></operation><input><data><uri>http://edamontology.org/data_2887</uri><term>Nucleic acid sequence record</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_2977</uri><term>Nucleic acid sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></output></function></tool><tool><name>metabuli</name><description>Metabuli: specific and sensitive metagenomic classification via joint analysis of DNA and amino acid</description><homepage>https://metabuli.steineggerlab.com</homepage><biotoolsID>metabuli</biotoolsID><biotoolsCURIE>biotools:metabuli</biotoolsCURIE><toolType>Command-line tool</toolType><toolType>Desktop application</toolType><topic><uri>http://edamontology.org/topic_0637</uri><term>Taxonomy</term></topic><operatingSystem>Windows</operatingSystem><operatingSystem>Mac</operatingSystem><operatingSystem>Linux</operatingSystem><language>C++</language><license>GPL-3.0</license><maturity>Mature</maturity><cost>Free of charge</cost><function><operation><uri>http://edamontology.org/operation_3460</uri><term>Taxonomic classification</term></operation><input><data><uri>http://edamontology.org/data_3494</uri><term>DNA sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_3028</uri><term>Taxonomy</term></data></output></function><link><url>https://github.com/steineggerlab/Metabuli/issues</url><type>Issue tracker</type></link><link><url>https://github.com/steineggerlab/Metabuli</url><type>Repository</type><note>github repository</note></link><relation><biotoolsID>mmseqs2</biotoolsID><type>uses</type></relation><relation><biotoolsID>gtdb</biotoolsID><type>uses</type></relation><publication><doi>10.1038/s41592-024-02273-y</doi><pmid>38769467</pmid></publication></tool><tool><name>sourmash</name><description>Compute and compare MinHash signatures for DNA data sets.</description><homepage>https://sourmash.readthedocs.io/en/latest/</homepage><biotoolsID>sourmash</biotoolsID><biotoolsCURIE>biotools:sourmash</biotoolsCURIE><version>2.0a8</version><toolType>Library</toolType><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_3697</uri><term>Microbial ecology</term></topic><topic><uri>http://edamontology.org/topic_3174</uri><term>Metagenomics</term></topic><operatingSystem>Linux</operatingSystem><operatingSystem>Mac</operatingSystem><license>BSD-3-Clause</license><maturity>Emerging</maturity><cost>Free of charge</cost><function><operation><uri>http://edamontology.org/operation_0346</uri><term>Sequence similarity search</term></operation><operation><uri>http://edamontology.org/operation_0289</uri><term>Sequence distance matrix generation</term></operation><input><data><uri>http://edamontology.org/data_2975</uri><term>Nucleic acid sequence (raw)</term></data><format><uri>http://edamontology.org/format_2200</uri><term>FASTA-like (text)</term></format></input></function><function><operation><uri>http://edamontology.org/operation_3460</uri><term>Taxonomic classification</term></operation><input><data><uri>http://edamontology.org/data_3494</uri><term>DNA sequence</term></data><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format></input><output><data><uri>http://edamontology.org/data_3028</uri><term>Taxonomy</term></data><format><uri>http://edamontology.org/format_3751</uri><term>DSV</term></format></output></function><link><url>https://github.com/sourmash-bio/sourmash</url><type>Repository</type><note>github repository</note></link><relation><biotoolsID>gtdb</biotoolsID><type>uses</type></relation><publication><doi>10.12688/f1000research.19675.1</doi><pmid>31508216</pmid><pmcid>PMC6720031</pmcid></publication><credit><name>C. Titus Brown</name><email>ctbrown@ucdavis.edu</email><url>http://github.com/dib-lab/sourmash/</url><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole></credit></tool><tool><name>Bakta</name><description>Rapid &amp; standardized annotation of bacterial genomes, MAGs &amp; plasmids</description><homepage>https://github.com/oschwengers/bakta</homepage><biotoolsID>bakta</biotoolsID><biotoolsCURIE>biotools:bakta</biotoolsCURIE><version>v1.12.0</version><toolType>Web application</toolType><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_0622</uri><term>Genomics</term></topic><topic><uri>http://edamontology.org/topic_0080</uri><term>Sequence analysis</term></topic><topic><uri>http://edamontology.org/topic_0091</uri><term>Bioinformatics</term></topic><operatingSystem>Linux</operatingSystem><operatingSystem>Mac</operatingSystem><language>Python</language><license>GPL-3.0</license><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><elixirPlatform>Tools</elixirPlatform><elixirNode>Germany</elixirNode><function><operation><uri>http://edamontology.org/operation_0362</uri><term>Genome annotation</term></operation><input><data><uri>http://edamontology.org/data_0925</uri><term>Sequence assembly</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><input><data><uri>http://edamontology.org/data_2914</uri><term>Sequence features metadata</term></data><format><uri>http://edamontology.org/format_3475</uri><term>TSV</term></format></input><input><data><uri>http://edamontology.org/data_2012</uri><term>Sequence coordinates</term></data><format><uri>http://edamontology.org/format_1975</uri><term>GFF3</term></format></input><input><data><uri>http://edamontology.org/data_2886</uri><term>Protein sequence record</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><input><data><uri>http://edamontology.org/data_1364</uri><term>Hidden Markov model</term></data><format><uri>http://edamontology.org/format_3329</uri><term>HMMER3</term></format></input><output><data><uri>http://edamontology.org/data_1270</uri><term>Feature table</term></data><format><uri>http://edamontology.org/format_1936</uri><term>GenBank format</term></format><format><uri>http://edamontology.org/format_1975</uri><term>GFF3</term></format><format><uri>http://edamontology.org/format_3464</uri><term>JSON</term></format><format><uri>http://edamontology.org/format_1927</uri><term>EMBL format</term></format><format><uri>http://edamontology.org/format_3475</uri><term>TSV</term></format></output><output><data><uri>http://edamontology.org/data_2886</uri><term>Protein sequence record</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></output><output><data><uri>http://edamontology.org/data_2887</uri><term>Nucleic acid sequence record</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></output><output><data><uri>http://edamontology.org/data_2884</uri><term>Plot</term></data><format><uri>http://edamontology.org/format_3604</uri><term>SVG</term></format><format><uri>http://edamontology.org/format_3603</uri><term>PNG</term></format></output><output><data><uri>http://edamontology.org/data_1772</uri><term>Score</term></data><format><uri>http://edamontology.org/format_3475</uri><term>TSV</term></format></output><cmd>bakta --db &lt;db-path&gt; --prefix &lt;prefix&gt; --output &lt;output-path&gt; genome.fasta</cmd></function><link><url>https://github.com/oschwengers/bakta</url><type>Repository</type></link><link><url>https://github.com/oschwengers/bakta/issues</url><type>Issue tracker</type></link><link><url>https://bioconda.github.io/recipes/bakta/README.html</url><type>Other</type></link><link><url>https://bakta.computational.bio</url><type>Service</type></link><download><url>https://zenodo.org/records/14916843</url><type>Other</type><note>Mandatory annotation database</note><version>v6.0</version></download><documentation><url>https://github.com/oschwengers/bakta/blob/main/README.md</url><type>General</type></documentation><documentation><url>https://github.com/oschwengers/bakta/blob/main/CONTRIBUTION.md</url><type>Contributions policy</type></documentation><documentation><url>https://github.com/oschwengers/bakta/blob/main/CODE_OF_CONDUCT.md</url><type>Code of conduct</type></documentation><documentation><url>https://bakta.readthedocs.io/</url><type>User manual</type></documentation><relation><biotoolsID>diamond</biotoolsID><type>uses</type></relation><relation><biotoolsID>hmmer3</biotoolsID><type>uses</type></relation><relation><biotoolsID>infernal</biotoolsID><type>uses</type></relation><relation><biotoolsID>trnascan-se</biotoolsID><type>uses</type></relation><relation><biotoolsID>blast</biotoolsID><type>uses</type></relation><relation><biotoolsID>aragorn</biotoolsID><type>uses</type></relation><relation><biotoolsID>pilercr</biotoolsID><type>uses</type></relation><relation><biotoolsID>deepsig</biotoolsID><type>uses</type></relation><publication><doi>10.1099/mgen.0.000685</doi><pmid>34739369</pmid><pmcid>PMC8743544</pmcid><type>Primary</type><version>1.1</version></publication><publication><doi>10.1093/nar/gkaf335</doi><pmid>40271661</pmid><pmcid>PMC12230652</pmcid><type>Primary</type></publication><credit><name>Oliver Schwengers</name><email>oliver.schwengers@cb.jlug.de</email><url>https://github.com/oschwengers</url><orcidid>https://orcid.org/0000-0003-4216-2721</orcidid><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole><typeRole>Developer</typeRole><typeRole>Maintainer</typeRole></credit><credit><name>Justus Liebig University Giessen</name><url>https://www.uni-giessen.de</url><typeEntity>Institute</typeEntity><typeRole>Provider</typeRole></credit></tool><tool><name>MetaPhlAn</name><description>Computational tool for profiling the composition of microbial communities from metagenomic shotgun sequencing data.</description><homepage>http://segatalab.cibio.unitn.it/tools/metaphlan/index.html</homepage><biotoolsID>metaphlan</biotoolsID><biotoolsCURIE>biotools:metaphlan</biotoolsCURIE><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_3174</uri><term>Metagenomics</term></topic><topic><uri>http://edamontology.org/topic_0194</uri><term>Phylogenomics</term></topic><operatingSystem>Linux</operatingSystem><operatingSystem>Mac</operatingSystem><operatingSystem>Windows</operatingSystem><language>Python</language><license>MIT</license><collectionID>Animal and Crop Genomics</collectionID><function><operation><uri>http://edamontology.org/operation_3460</uri><term>Taxonomic classification</term></operation><input><data><uri>http://edamontology.org/data_3494</uri><term>DNA sequence</term></data><format><uri>http://edamontology.org/format_2573</uri><term>SAM</term></format><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format></input><output><data><uri>http://edamontology.org/data_3028</uri><term>Taxonomy</term></data><format><uri>http://edamontology.org/format_3751</uri><term>DSV</term></format></output><cmd>metaphlan &lt;fastq_input&gt; --input_type fastq -o &lt;output&gt;</cmd></function><function><operation><uri>http://edamontology.org/operation_3460</uri><term>Taxonomic classification</term></operation><input><data><uri>http://edamontology.org/data_3028</uri><term>Taxonomy</term></data><format><uri>http://edamontology.org/format_3751</uri><term>DSV</term></format></input><output><data><uri>http://edamontology.org/data_3028</uri><term>Taxonomy</term></data><format><uri>http://edamontology.org/format_3751</uri><term>DSV</term></format></output><note>Convert SGB-based profile to GTDB taxonomy</note><cmd>sgb_to_gtdb_profile.py -i &lt;metaphlan_output&gt; -o &lt;gtdb_metaphlan_output&gt;</cmd></function><documentation><url>https://github.com/biobakery/MetaPhlAn</url><type>General</type></documentation><relation><biotoolsID>gtdb</biotoolsID><type>uses</type></relation><publication><doi>10.1038/nmeth.2066</doi><pmid>22688413</pmid><pmcid>PMC3443552</pmcid><type>Primary</type></publication><credit><url>https://groups.google.com/forum/#!forum/metaphlan-users</url><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole></credit></tool><tool><name>kraken2</name><description>Kraken 2 is the newest version of Kraken, a taxonomic classification system using exact k-mer matches to achieve high accuracy and fast classification speeds. This classifier matches each k-mer within a query sequence to the lowest common ancestor (LCA) of all genomes containing the given k-mer. The k-mer assignments inform the classification algorithm.
Any assumption that Kraken&#8217;s raw read assignments can be directly translated into species or strain-level abundance estimates is flawed. Bracken (Bayesian Reestimation of Abundance after Classification with KrakEN), estimates species abundances in metagenomics samples by probabilistically re-distributing reads in the taxonomic tree. (Lu, Jennifer et al. &#8220;Bracken: estimating species abundance in metagenomics data.&#8221;)</description><homepage>https://github.com/DerrickWood/kraken2</homepage><biotoolsID>kraken2</biotoolsID><biotoolsCURIE>biotools:kraken2</biotoolsCURIE><version>2.0.8-beta</version><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_0637</uri><term>Taxonomy</term></topic><topic><uri>http://edamontology.org/topic_3174</uri><term>Metagenomics</term></topic><topic><uri>http://edamontology.org/topic_3697</uri><term>Microbial ecology</term></topic><topic><uri>http://edamontology.org/topic_3301</uri><term>Microbiology</term></topic><language>Perl</language><language>C++</language><license>MIT</license><collectionID>ONTeater</collectionID><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_3460</uri><term>Taxonomic classification</term></operation><input><data><uri>http://edamontology.org/data_3494</uri><term>DNA sequence</term></data><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_3028</uri><term>Taxonomy</term></data><format><uri>http://edamontology.org/format_3475</uri><term>TSV</term></format></output><cmd>`kraken2 --db &lt;kraken2_database&gt; &lt;input.fastq&gt;`</cmd></function><link><url>https://github.com/DerrickWood/kraken2</url><type>Repository</type></link><link><url>https://github.com/DerrickWood/kraken2/issues</url><type>Issue tracker</type></link><download><url>https://github.com/DerrickWood/kraken2/archive/v2.0.8-beta.tar.gz</url><type>Source code</type><version>2.0.8-beta</version></download><documentation><url>https://github.com/DerrickWood/kraken2/wiki/Manual</url><type>User manual</type></documentation><documentation><url>https://benlangmead.github.io/aws-indexes/k2</url><type>User manual</type><note>Links to multiple Kraken 2 and bracken databases and indexes</note></documentation><relation><biotoolsID>kraken</biotoolsID><type>isNewVersionOf</type></relation><relation><biotoolsID>bracken</biotoolsID><type>usedBy</type></relation><relation><biotoolsID>gtdb</biotoolsID><type>uses</type></relation><publication><doi>10.1101/762302</doi></publication><credit><name>Derrick E. Wood</name><orcidid>http://orcid.org/0000-0002-7429-1854</orcidid><typeEntity>Person</typeEntity></credit><credit><name>Jennifer Lu</name><orcidid>http://orcid.org/0000-0001-9167-2002</orcidid><typeEntity>Person</typeEntity></credit><credit><name>Ben Langmead</name><email>langmea@cs.jhu.edu</email><orcidid>http://orcid.org/0000-0003-2437-1976</orcidid><typeEntity>Person</typeEntity></credit></tool><tool><name>SingleM</name><description>Novelty-inclusive microbial community profiling of shotgun metagenomes</description><homepage>https://wwood.github.io/singlem/</homepage><biotoolsID>singlem</biotoolsID><biotoolsCURIE>biotools:singlem</biotoolsCURIE><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_3174</uri><term>Metagenomics</term></topic><operatingSystem>Linux</operatingSystem><language>Python</language><license>GPL-3.0</license><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_3460</uri><term>Taxonomic classification</term></operation><input><data><uri>http://edamontology.org/data_3494</uri><term>DNA sequence</term></data><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_3028</uri><term>Taxonomy</term></data><format><uri>http://edamontology.org/format_3752</uri><term>CSV</term></format></output></function><documentation><url>https://wwood.github.io/singlem/</url><type>General</type><type>Command-line options</type><type>Installation instructions</type></documentation><relation><biotoolsID>gtdb</biotoolsID><type>uses</type></relation><relation><biotoolsID>globdb</biotoolsID><type>uses</type></relation><publication><doi>10.1101/2024.01.30.578060</doi></publication></tool><tool><name>FastOMA</name><description>FastOMA is a scalable software package to infer orthology relationship.</description><homepage>https://github.com/dessimozlab/FastOMA</homepage><biotoolsID>fastoma</biotoolsID><biotoolsCURIE>biotools:fastoma</biotoolsCURIE><version>0.5.1</version><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_0797</uri><term>Comparative genomics</term></topic><operatingSystem>Mac</operatingSystem><operatingSystem>Linux</operatingSystem><language>Python</language><license>MPL-2.0</license><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><elixirNode>Switzerland</elixirNode><function><operation><uri>http://edamontology.org/operation_3947</uri><term>Phylogenetic tree reconciliation</term></operation><operation><uri>http://edamontology.org/operation_2479</uri><term>Protein sequence analysis</term></operation><operation><uri>http://edamontology.org/operation_3209</uri><term>Genome comparison</term></operation><operation><uri>http://edamontology.org/operation_0291</uri><term>Sequence clustering</term></operation><operation><uri>http://edamontology.org/operation_0540</uri><term>Phylogenetic inference (from molecular sequences)</term></operation><input><data><uri>http://edamontology.org/data_2886</uri><term>Protein sequence record</term></data><format><uri>http://edamontology.org/format_2200</uri><term>FASTA-like (text)</term></format></input><input><data><uri>http://edamontology.org/data_3028</uri><term>Taxonomy</term></data><format><uri>http://edamontology.org/format_1910</uri><term>newick</term></format></input><output><data><uri>http://edamontology.org/data_1245</uri><term>Sequence cluster (protein)</term></data></output><output><data><uri>http://edamontology.org/data_3148</uri><term>Gene family report</term></data><format><uri>http://edamontology.org/format_3850</uri><term>OrthoXML</term></format></output></function><link><url>https://github.com/dessimozlab/FastOMA</url><type>Repository</type></link><link><url>https://omabrowser.org</url><type>Service</type></link><documentation><url>https://github.com/DessimozLab/FastOMA/blob/main/README.md</url><type>Quick start guide</type></documentation><relation><biotoolsID>omamer</biotoolsID><type>uses</type></relation><publication><doi>10.1038/s41592-024-02552-8</doi><pmid>39753922</pmid><pmcid>PMC11810774</pmcid></publication></tool><tool><name>globalamfungi</name><description>The GlobalAMFungi Database is an open-access, global repository of arbuscular mycorrhizal (AM) fungal occurrence data compiled from high-throughput sequencing studies. It aggregates millions of DNA sequence observations of AM fungi from thousands of geographically referenced environmental samples, together with rich metadata on sampling location and environmental context, to map the global distribution and diversity of these key plant symbionts. The resource is designed to be community-driven and FAIR (Findable, Accessible, Interoperable, Reusable), supporting research into AM fungal ecology, biogeography, and how environmental factors shape their diversity.</description><homepage>https://globalamfungi.com/</homepage><biotoolsID>globalamfungi</biotoolsID><biotoolsCURIE>biotools:globalamfungi</biotoolsCURIE><toolType>Database portal</toolType><toolType>Web application</toolType><topic><uri>http://edamontology.org/topic_3050</uri><term>Biodiversity</term></topic><topic><uri>http://edamontology.org/topic_3697</uri><term>Microbial ecology</term></topic><topic><uri>http://edamontology.org/topic_3810</uri><term>Agricultural science</term></topic><language>R</language><license>CC-BY-4.0</license><collectionID>ELIXIR-CZ</collectionID><collectionID>Czech Republic</collectionID><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><elixirPlatform>Data</elixirPlatform><elixirCommunity>Plant Sciences</elixirCommunity><elixirNode>Czech Republic</elixirNode><function><operation><uri>http://edamontology.org/operation_3431</uri><term>Data deposition</term></operation><operation><uri>http://edamontology.org/operation_0226</uri><term>Annotation</term></operation><operation><uri>http://edamontology.org/operation_3460</uri><term>Taxonomic classification</term></operation><input><data><uri>http://edamontology.org/data_2977</uri><term>Nucleic acid sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_0857</uri><term>Sequence search results</term></data><format><uri>http://edamontology.org/format_2310</uri><term>FASTA-HTML</term></format></output></function><link><url>https://globalamfungi.com/</url><type>Helpdesk</type><type>Repository</type><type>Service</type></link><documentation><url>https://globalamfungi.com/</url><type>User manual</type><type>Citation instructions</type><type>Quick start guide</type><type>Contributions policy</type><type>General</type></documentation><publication><doi>10.1111/nph.19283</doi><type>Primary</type></publication><credit><name>Tomas Vetrovsky</name><email>vetrovsky@biomed.cas.cz</email><orcidid>https://orcid.org/0000-0002-0831-486X</orcidid><rorid>02p1jz666</rorid><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole><typeRole>Developer</typeRole><typeRole>Contributor</typeRole></credit><credit><name>ELIXIR-CZ</name><typeEntity>Consortium</typeEntity></credit></tool><tool><name>ASAFind</name><description>ASAFind is a software that predicts the intracellular location of proteins in cells with four membrane-bound complex plastids of red algal origin. 
These plastids evolved via eukaryote-eukaryote endosymbiosis and for example can be found in diatoms and cryptophytes. ASAFind uses protein sequence data and external predictions of signal peptides (by the tools SignalP and TargetP), and predicts plastid proteins, and proteins that are targeted to the periplastidic compartment from these data; optionally, a graphical output can generated.</description><homepage>https://asafind.jcu.cz/</homepage><biotoolsID>asafind</biotoolsID><biotoolsCURIE>biotools:asafind</biotoolsCURIE><version>2.0</version><toolType>Script</toolType><toolType>Command-line tool</toolType><toolType>Web service</toolType><topic><uri>http://edamontology.org/topic_2229</uri><term>Cell biology</term></topic><topic><uri>http://edamontology.org/topic_0078</uri><term>Proteins</term></topic><topic><uri>http://edamontology.org/topic_0780</uri><term>Plant biology</term></topic><topic><uri>http://edamontology.org/topic_0622</uri><term>Genomics</term></topic><operatingSystem>Mac</operatingSystem><operatingSystem>Linux</operatingSystem><language>Python</language><license>CC-BY-SA-4.0</license><collectionID>ELIXIR-CZ</collectionID><collectionID>Czech Republic</collectionID><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><elixirPlatform>Tools</elixirPlatform><elixirCommunity>Plant Sciences</elixirCommunity><elixirCommunity>Marine Metagenomics</elixirCommunity><elixirCommunity>Microbial Biotechnology</elixirCommunity><elixirNode>Czech Republic</elixirNode><function><operation><uri>http://edamontology.org/operation_2489</uri><term>Subcellular localisation prediction</term></operation><operation><uri>http://edamontology.org/operation_0239</uri><term>Sequence motif recognition</term></operation><operation><uri>http://edamontology.org/operation_0422</uri><term>Protein cleavage site prediction</term></operation><input><data><uri>http://edamontology.org/data_2886</uri><term>Protein sequence record</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><input><data><uri>http://edamontology.org/data_1270</uri><term>Feature table</term></data></input><output><data><uri>http://edamontology.org/data_1277</uri><term>Protein features</term></data></output><output><data><uri>http://edamontology.org/data_2884</uri><term>Plot</term></data></output></function><link><url>https://asafind.jcu.cz</url><type>Service</type><note>Web Service</note></link><link><url>https://github.com/ASAFind/ASAFind-2</url><type>Repository</type><note>Information and source code, for local installation or development</note></link><download><url>https://asafind.jcu.cz/download-page/</url><type>Downloads page</type><note>Download page on web-service, links to repository</note><version>2.0</version></download><documentation><url>https://asafind.jcu.cz/download-page/</url><type>Installation instructions</type></documentation><relation><biotoolsID>signalp</biotoolsID><type>uses</type></relation><relation><biotoolsID>targetp</biotoolsID><type>uses</type></relation><publication><doi>10.1111/tpj.70138</doi><pmid>40464854</pmid><pmcid>PMC12136025</pmcid><type>Primary</type><version>2.0</version><note>Publication of the current version of ASAFind (2.0)</note></publication><publication><doi>10.1111/tpj.12734</doi><pmid>25438865</pmid><pmcid>PMC4329603</pmcid><type>Other</type><version>1.0</version><note>Publication of the first version of ASAFind</note></publication><publication><doi>10.48550/arXiv.2303.02509</doi><type>Benchmarking study</type><version>1.0</version><note>Benchmarking of the performance of the first version of ASAFind</note></publication><credit><name>Marta Vohnoutov&#225;</name><email>mvohnoutova@jcu.cz</email><url>https://www.jcu.cz/cz/univerzita/lide/clovek?identita=Vohnoutova_Marta_61699</url><orcidid>https://orcid.org/0000-0002-8915-8626</orcidid><typeEntity>Person</typeEntity><typeRole>Contributor</typeRole><typeRole>Developer</typeRole><typeRole>Maintainer</typeRole></credit><credit><name>Ansgar Gruber</name><email>agruber@prf.jcu.cz</email><url>https://www.jcu.cz/cz/univerzita/lide/clovek?identita=Gruber_Ansgar_118401</url><orcidid>https://orcid.org/0000-0002-5876-4391</orcidid><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole><typeRole>Developer</typeRole><typeRole>Documentor</typeRole></credit></tool><tool><name>metagWGS</name><description>metagWGS is a workflow dedicated to the analysis of metagenomic data. It allows assembly, taxonomic annotation, and functional annotation of predicted genes. Since release 2.3, binning step with the possibility of cross-alignment is included. It has been developed in collaboration with several CATI BIOS4biol agents. Funded by Antiselfish Project (Labex Ecofect), ExpoMicoPig project (France Futur elevage) and SeqOccIn project (CPER - Occitanie Toulouse / FEDER), ATB_Biofilm funded by PNREST Anses, France genomique (ANR-10-INBS-09-08) and Resalab Ouest.</description><homepage>https://forgemia.inra.fr/genotoul-bioinfo/metagwgs</homepage><biotoolsID>metagwgs</biotoolsID><biotoolsCURIE>biotools:metagwgs</biotoolsCURIE><version>2.3</version><version>2.5.0</version><toolType>Workflow</toolType><topic><uri>http://edamontology.org/topic_3174</uri><term>Metagenomics</term></topic><operatingSystem>Linux</operatingSystem><language>Python</language><license>GPL-3.0</license><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><elixirNode>France</elixirNode><function><operation><uri>http://edamontology.org/operation_3460</uri><term>Taxonomic classification</term></operation><input><data><uri>http://edamontology.org/data_3494</uri><term>DNA sequence</term></data><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format></input><output><data><uri>http://edamontology.org/data_3028</uri><term>Taxonomy</term></data><format><uri>http://edamontology.org/format_1915</uri><term>Format</term></format></output></function><function><operation><uri>http://edamontology.org/operation_3672</uri><term>Gene functional annotation</term></operation><input><data><uri>http://edamontology.org/data_3494</uri><term>DNA sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_3917</uri><term>Count matrix</term></data><format><uri>http://edamontology.org/format_1915</uri><term>Format</term></format></output></function><function><operation><uri>http://edamontology.org/operation_0524</uri><term>De-novo assembly</term></operation><input><data><uri>http://edamontology.org/data_3494</uri><term>DNA sequence</term></data><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format></input><output><data><uri>http://edamontology.org/data_3494</uri><term>DNA sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></output></function><link><url>https://forge.inrae.fr/genotoul-bioinfo/metagwgs</url><type>Repository</type><note>See documentation, source code and functionnal test documentation.</note></link><download><url>https://forgemia.inra.fr/genotoul-bioinfo/metagwgs-test-datasets</url><type>Test data</type><note>Functional tests data and script</note></download><documentation><url>https://genotoul-bioinfo.pages-forge.inrae.fr/metagwgs/master/index.html</url><type>User manual</type><note>We provide up to date installation documentation, usage documentation, output description and functionnal test datasets and procedure.</note></documentation><credit><name>Claire Hoede</name><email>claire.hoede@inrae.fr</email><orcidid>https://orcid.org/0000-0001-5054-7731</orcidid><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole></credit><credit><name>GenoToul bioinformatics facility</name><url>http://bioinfo.genotoul.fr/</url><typeEntity>Division</typeEntity><typeRole>Provider</typeRole></credit></tool><tool><name>Jalview</name><description>Jalview is a free cross-platform program for multiple sequence alignment editing, visualisation and analysis. Use it to align, view and edit sequence alignments, analyse them with phylogenetic trees and principal components analysis (PCA) plots and explore molecular structures and annotation.</description><homepage>https://www.jalview.org/</homepage><biotoolsID>Jalview</biotoolsID><biotoolsCURIE>biotools:Jalview</biotoolsCURIE><version>2.11.5.1</version><toolType>Desktop application</toolType><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_0080</uri><term>Sequence analysis</term></topic><topic><uri>http://edamontology.org/topic_0092</uri><term>Data visualisation</term></topic><operatingSystem>Mac</operatingSystem><operatingSystem>Linux</operatingSystem><operatingSystem>Windows</operatingSystem><license>GPL-3.0</license><collectionID>ELIXIR-UK</collectionID><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><elixirPlatform>Tools</elixirPlatform><elixirPlatform>Interoperability</elixirPlatform><elixirNode>UK</elixirNode><function><operation><uri>http://edamontology.org/operation_3081</uri><term>Sequence alignment editing</term></operation><operation><uri>http://edamontology.org/operation_0324</uri><term>Phylogenetic tree analysis</term></operation><operation><uri>http://edamontology.org/operation_0564</uri><term>Sequence visualisation</term></operation><input><data><uri>http://edamontology.org/data_0863</uri><term>Sequence alignment</term></data><format><uri>http://edamontology.org/format_1948</uri><term>nbrf/pir</term></format><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format><format><uri>http://edamontology.org/format_1984</uri><term>FASTA-aln</term></format><format><uri>http://edamontology.org/format_3015</uri><term>Pileup</term></format><format><uri>http://edamontology.org/format_1938</uri><term>GFF2-seq</term></format><format><uri>http://edamontology.org/format_3313</uri><term>BLC</term></format><format><uri>http://edamontology.org/format_3311</uri><term>RNAML</term></format><format><uri>http://edamontology.org/format_1915</uri><term>Format</term></format><format><uri>http://edamontology.org/format_3774</uri><term>BioJSON (Jalview)</term></format><format><uri>http://edamontology.org/format_1982</uri><term>ClustalW format</term></format><format><uri>http://edamontology.org/format_1477</uri><term>mmCIF</term></format><format><uri>http://edamontology.org/format_1997</uri><term>PHYLIP format</term></format><format><uri>http://edamontology.org/format_3016</uri><term>VCF</term></format><format><uri>http://edamontology.org/format_1961</uri><term>Stockholm format</term></format><format><uri>http://edamontology.org/format_1939</uri><term>GFF3-seq</term></format><format><uri>http://edamontology.org/format_1947</uri><term>GCG MSF</term></format></input><input><data><uri>http://edamontology.org/data_0886</uri><term>Structure alignment</term></data><format><uri>http://edamontology.org/format_1476</uri><term>PDB</term></format></input><output><data><uri>http://edamontology.org/data_0863</uri><term>Sequence alignment</term></data><format><uri>http://edamontology.org/format_1948</uri><term>nbrf/pir</term></format><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format><format><uri>http://edamontology.org/format_3015</uri><term>Pileup</term></format><format><uri>http://edamontology.org/format_3313</uri><term>BLC</term></format><format><uri>http://edamontology.org/format_3774</uri><term>BioJSON (Jalview)</term></format><format><uri>http://edamontology.org/format_1982</uri><term>ClustalW format</term></format><format><uri>http://edamontology.org/format_3464</uri><term>JSON</term></format><format><uri>http://edamontology.org/format_1997</uri><term>PHYLIP format</term></format><format><uri>http://edamontology.org/format_1961</uri><term>Stockholm format</term></format><format><uri>http://edamontology.org/format_1947</uri><term>GCG MSF</term></format></output><output><data><uri>http://edamontology.org/data_2884</uri><term>Plot</term></data><format><uri>http://edamontology.org/format_3466</uri><term>EPS</term></format><format><uri>http://edamontology.org/format_1915</uri><term>Format</term></format><format><uri>http://edamontology.org/format_2331</uri><term>HTML</term></format><format><uri>http://edamontology.org/format_3604</uri><term>SVG</term></format><format><uri>http://edamontology.org/format_3603</uri><term>PNG</term></format></output><note>Other Input formats:
AMSA (.amsa);
JnetFile (.concise, .jnet);
PFAM (.pfam);
Substitution matrix (.matrix);
Jalview Project File (.jvp);
Jalview Feature File (.features, .jvfeatures);
Jalview Annotations File (.annotations, .jvannotations);
Predicted Aligned Error (PAE) Matrix File (.json)
...
Other Output formats:
PFAM (.pfam);
BioJS (.biojs) (interactive HTML/Javascript);
Jalview Project File (.jvp);</note></function><link><url>https://discourse.jalview.org/</url><type>Discussion forum</type></link><link><url>https://issues.jalview.org/</url><type>Issue tracker</type></link><link><url>https://www.jalview.org/development/jalview_develop/</url><type>Other</type><note>Latest development version</note></link><link><url>https://gitlab.jalview.org/jalview/jalview.git</url><type>Repository</type></link><link><url>https://twitter.com/Jalview</url><type>Social media</type><note>Twitter feed</note></link><link><url>https://www.youtube.com/channel/UCIjpnvZB770yz7ftbrJ0tfw</url><type>Social media</type><note>YouTube training videos</note></link><link><url>https://www.linkedin.com/company/104771886/</url><type>Social media</type></link><download><url>https://www.jalview.org/download</url><type>Downloads page</type></download><download><url>https://www.jalview.org/download/source/</url><type>Source code</type></download><download><url>https://www.jalview.org/download/?os=all</url><type>Binaries</type><note>Binaries for all platforms</note></download><download><url>https://www.jalview.org/favicon.svg</url><type>Icon</type></download><download><url>https://www.jalview.org/download/other/jar/</url><type>Binaries</type><note>Executable JAR file</note></download><documentation><url>https://www.jalview.org/about/citation</url><type>Citation instructions</type></documentation><documentation><url>https://www.jalview.org/training/</url><type>Training material</type><note>Hands-on exercises, Training courses and Training videos</note></documentation><documentation><url>https://www.jalview.org/help/faq</url><type>FAQ</type></documentation><documentation><url>https://www.jalview.org/help/documentation/</url><type>User manual</type></documentation><relation><biotoolsID>jabaws</biotoolsID><type>uses</type></relation><relation><biotoolsID>chimera</biotoolsID><type>uses</type></relation><relation><biotoolsID>chimerax</biotoolsID><type>uses</type></relation><relation><biotoolsID>pymol</biotoolsID><type>uses</type></relation><relation><biotoolsID>bioconda</biotoolsID><type>includedIn</type></relation><relation><biotoolsID>3d-beacons</biotoolsID><type>uses</type></relation><relation><biotoolsID>uniprot</biotoolsID><type>uses</type></relation><relation><biotoolsID>pfam</biotoolsID><type>uses</type></relation><relation><biotoolsID>ensembl</biotoolsID><type>uses</type></relation><relation><biotoolsID>pdb</biotoolsID><type>uses</type></relation><relation><biotoolsID>rfam</biotoolsID><type>uses</type></relation><relation><biotoolsID>3d-beacons</biotoolsID><type>uses</type></relation><publication><doi>10.1093/bioinformatics/btp033</doi><pmid>19151095</pmid><pmcid>PMC2672624</pmcid></publication><credit><name>Jim Procter</name><url>http://www.lifesci.dundee.ac.uk/people/jim-procter</url><orcidid>https://orcid.org/0000-0002-7865-7382</orcidid><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole></credit><credit><name>Geoff Barton</name><url>https://www.lifesci.dundee.ac.uk/people/geoff-barton</url><orcidid>https://orcid.org/0000-0002-9014-5355</orcidid></credit></tool><tool><name>emapper2gbk</name><description>creation of genbank files from Eggnog-mapper annotation outputs</description><homepage>https://github.com/AuReMe/emapper2gbk</homepage><biotoolsID>emapper2gbk</biotoolsID><biotoolsCURIE>biotools:emapper2gbk</biotoolsCURIE><version>0.3.2</version><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_4011</uri><term>Data rescue</term></topic><operatingSystem>Linux</operatingSystem><language>Python</language><license>LGPL-3.0</license><collectionID>AuReMe</collectionID><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_3434</uri><term>Conversion</term></operation><input><data><uri>http://edamontology.org/data_0925</uri><term>Sequence assembly</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><input><data><uri>http://edamontology.org/data_1233</uri><term>Sequence set (protein)</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><input><data><uri>http://edamontology.org/data_2012</uri><term>Sequence coordinates</term></data><format><uri>http://edamontology.org/format_2305</uri><term>GFF</term></format></input><output><data><uri>http://edamontology.org/data_1270</uri><term>Feature table</term></data><format><uri>http://edamontology.org/format_2205</uri><term>GenBank-like format (text)</term></format></output><note>Genomic -single mode</note><cmd>genomes -fn genome.fna -fp proteome.faa -gff genome.gff -n "Escherichia coli"</cmd></function><link><url>https://github.com/AuReMe/emapper2gbk</url><type>Repository</type></link><link><url>https://pypi.org/project/emapper2gbk/</url><type>Software catalogue</type><note>Access to Python library</note></link><download><url>https://github.com/AuReMe/emapper2gbk.git</url><type>Source code</type><version>0.3.2</version></download><download><url>https://pypi.org/project/emapper2gbk/#files</url><type>Software package</type><version>0.3.2</version></download><documentation><url>https://github.com/AuReMe/emapper2gbk</url><type>Installation instructions</type><type>User manual</type></documentation><credit><name>Arnaud</name><url>https://github.com/ArnaudBelcour</url><orcidid>https://orcid.org/0000-0003-1170-0785</orcidid><typeEntity>Person</typeEntity><typeRole>Developer</typeRole></credit></tool><tool><name>South Green Genome Hubs</name><description>The South Green Genome Hub is a suite of crop-specific community portals to manage genomic datasets with focus on tropical and Mediterranean plants. Currently developed on Banana, Cacao, Coffee, Grass, Rice, Vanilla, Citrus and Sugarcane, genome hubs provide access to multiple datasets (e.g. assemblies, gene product information, metabolic pathways, gene families, transcriptomics and genetic markers).</description><homepage>https://banana-genome-hub.southgreen.fr/</homepage><biotoolsID>South_Green_Genome_Hubs</biotoolsID><biotoolsCURIE>biotools:South_Green_Genome_Hubs</biotoolsCURIE><toolType>Database portal</toolType><topic><uri>http://edamontology.org/topic_3810</uri><term>Agricultural science</term></topic><topic><uri>http://edamontology.org/topic_0622</uri><term>Genomics</term></topic><topic><uri>http://edamontology.org/topic_3308</uri><term>Transcriptomics</term></topic><topic><uri>http://edamontology.org/topic_0797</uri><term>Comparative genomics</term></topic><topic><uri>http://edamontology.org/topic_0621</uri><term>Model organisms</term></topic><language>JavaScript</language><language>PHP</language><license>GPL-2.0</license><collectionID>Genome Hub</collectionID><maturity>Mature</maturity><cost>Free of charge</cost><elixirPlatform>Data</elixirPlatform><elixirPlatform>Tools</elixirPlatform><elixirNode>France</elixirNode><function><operation><uri>http://edamontology.org/operation_3208</uri><term>Genome visualisation</term></operation><operation><uri>http://edamontology.org/operation_0338</uri><term>Sequence database search</term></operation><operation><uri>http://edamontology.org/operation_2403</uri><term>Sequence analysis</term></operation><input><data><uri>http://edamontology.org/data_0968</uri><term>Keyword</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_2044</uri><term>Sequence</term></data></output></function><link><url>https://github.com/SouthGreenPlatform/genomehubs</url><type>Issue tracker</type></link><documentation><url>https://banana-genome-hub.southgreen.fr/content/term-service</url><type>Terms of use</type></documentation><relation><biotoolsID>tripal</biotoolsID><type>uses</type></relation><relation><biotoolsID>banana_genome_hub</biotoolsID><type>includes</type></relation><relation><biotoolsID>coffee_genome_hub</biotoolsID><type>includes</type></relation><relation><biotoolsID>cocoa_genome_hub</biotoolsID><type>includes</type></relation><relation><biotoolsID>rice_genome_hub</biotoolsID><type>includes</type></relation><relation><biotoolsID>sugarcane_genome_hub</biotoolsID><type>includes</type></relation><relation><biotoolsID>southgreen</biotoolsID><type>includedIn</type></relation><publication><doi>10.1093/database/bat035</doi><pmid>23707967</pmid><pmcid>PMC3662865</pmcid><type>Primary</type></publication><publication><doi>10.1093/nar/gku1108</doi><pmid>25392413</pmid><pmcid>PMC4383925</pmcid><type>Primary</type></publication><publication><doi>10.1093/hr/uhac221</doi><pmid>36479579</pmid><pmcid>PMC9720444</pmcid><type>Primary</type></publication><publication><doi>10.1016/j.xplc.2022.100330</doi><pmid>35617961</pmid><pmcid>PMC9482989</pmcid><type>Usage</type></publication><credit><name>Gaetan Droc</name><email>droc@cirad.fr</email><orcidid>https://orcid.org/0000-0003-1849-1269</orcidid><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole><typeRole>Developer</typeRole><typeRole>Documentor</typeRole><typeRole>Provider</typeRole><typeRole>Support</typeRole></credit><credit><name>Alexis Dereeper</name><email>alexis.dereeper@ird.fr</email></credit><credit><name>Mathieu Rouard</name><email>m.rouard@cgiar.org</email><orcidid>https://orcid.org/0000-0003-1849-1269</orcidid><typeRole>Primary contact</typeRole><typeRole>Maintainer</typeRole><typeRole>Support</typeRole></credit><credit><name>Valentin Guignon</name><email>v.guignon@cgiar.org</email></credit><credit><name>Chrisitine Tranchant</name><email>christine.tranchant@ird.fr</email></credit><credit><name>St&#233;phanie Bocs</name><email>stephanie.sidibe-bocs@cirad.fr</email></credit></tool><tool><name>trimAl</name><description>Tool for the automated removal of spurious sequences or poorly aligned regions from a multiple sequence alignment.</description><homepage>https://trimal.readthedocs.io</homepage><biotoolsID>trimal</biotoolsID><biotoolsCURIE>biotools:trimal</biotoolsCURIE><version>1.5.1</version><version>2.0-RC</version><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_0080</uri><term>Sequence analysis</term></topic><topic><uri>http://edamontology.org/topic_3168</uri><term>Sequencing</term></topic><topic><uri>http://edamontology.org/topic_0160</uri><term>Sequence sites, features and motifs</term></topic><operatingSystem>Mac</operatingSystem><operatingSystem>Windows</operatingSystem><operatingSystem>Linux</operatingSystem><language>C++</language><maturity>Mature</maturity><cost>Free of charge</cost><function><operation><uri>http://edamontology.org/operation_0492</uri><term>Multiple sequence alignment</term></operation><input><data><uri>http://edamontology.org/data_0863</uri><term>Sequence alignment</term></data></input><output><data><uri>http://edamontology.org/data_1916</uri><term>Alignment</term></data><format><uri>http://edamontology.org/format_1998</uri><term>phylipnon</term></format><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format><format><uri>http://edamontology.org/format_1982</uri><term>ClustalW format</term></format><format><uri>http://edamontology.org/format_1997</uri><term>PHYLIP format</term></format></output><output><data><uri>http://edamontology.org/data_2048</uri><term>Report</term></data><format><uri>http://edamontology.org/format_2330</uri><term>Textual format</term></format></output></function><link><url>https://github.com/inab/trimal</url><type>Repository</type></link><download><url>https://github.com/inab/trimal</url><type>Source code</type><version>1.5.1</version></download><download><url>https://github.com/inab/trimal/tree/2.0_RC</url><type>Source code</type><version>2.0-RC</version></download><documentation><url>http://trimal.cgenomics.org/</url><type>General</type><note>Old versions</note></documentation><documentation><url>https://trimal.readthedocs.io</url><type>General</type></documentation><publication><doi>10.1093/bioinformatics/btp348</doi><pmid>19505945</pmid><pmcid>PMC2712344</pmcid></publication><credit><name>Salvador Capella-Gutierrez</name><email>salcagu@gmail.com</email><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole></credit><credit><email>tgabaldon@crg.es</email><url>http://gabaldonlab.crg.es/</url><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole></credit></tool><tool><name>MMseqs2</name><description>MMseqs2 (Many-against-Many sequence searching) is a software suite to search and cluster huge protein and nucleotide sequence sets. MMseqs2 is open source software implemented in C++ for Linux, MacOS, and (as beta version, via cygwin) Windows. The software is designed to run on multiple cores and servers and exhibits very good scalability. MMseqs2 can run 10000 times faster than BLAST. At 100 times its speed it achieves almost the same sensitivity. It can perform profile searches with the same sensitivity as PSI-BLAST at over 400 times its speed.

MMseqs2 includes Linclust, the first clustering algorithm whose runtime scales linearly With Linclust we clustered 1.6 billion metagenomic sequence fragments in 10&#8201;h on a single server to 50% sequence identity.</description><homepage>https://mmseqs.com</homepage><biotoolsID>MMseqs2</biotoolsID><biotoolsCURIE>biotools:MMseqs2</biotoolsCURIE><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_3174</uri><term>Metagenomics</term></topic><topic><uri>http://edamontology.org/topic_0080</uri><term>Sequence analysis</term></topic><topic><uri>http://edamontology.org/topic_0078</uri><term>Proteins</term></topic><topic><uri>http://edamontology.org/topic_0077</uri><term>Nucleic acids</term></topic><topic><uri>http://edamontology.org/topic_0623</uri><term>Gene and protein families</term></topic><topic><uri>http://edamontology.org/topic_0637</uri><term>Taxonomy</term></topic><operatingSystem>Windows</operatingSystem><operatingSystem>Linux</operatingSystem><operatingSystem>Mac</operatingSystem><language>C++</language><license>MIT</license><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_0346</uri><term>Sequence similarity search</term></operation><operation><uri>http://edamontology.org/operation_0292</uri><term>Sequence alignment</term></operation><input><data><uri>http://edamontology.org/data_0850</uri><term>Sequence set</term></data><format><uri>http://edamontology.org/format_2182</uri><term>FASTQ-like format (text)</term></format><format><uri>http://edamontology.org/format_2200</uri><term>FASTA-like (text)</term></format></input><output><data><uri>http://edamontology.org/data_0857</uri><term>Sequence search results</term></data><format><uri>http://edamontology.org/format_1333</uri><term>BLAST results</term></format></output><cmd>mmseqs easy-search query.fasta target.fasta output.m8 tmp</cmd></function><function><operation><uri>http://edamontology.org/operation_0291</uri><term>Sequence clustering</term></operation><input><data><uri>http://edamontology.org/data_0850</uri><term>Sequence set</term></data><format><uri>http://edamontology.org/format_2182</uri><term>FASTQ-like format (text)</term></format><format><uri>http://edamontology.org/format_2200</uri><term>FASTA-like (text)</term></format></input><output><data><uri>http://edamontology.org/data_1235</uri><term>Sequence cluster</term></data></output><cmd>mmseqs easy-cluster input.fasta output tmp
mmseqs easy-linclust input.fasta output tmp</cmd></function><function><operation><uri>http://edamontology.org/operation_3460</uri><term>Taxonomic classification</term></operation><input><data><uri>http://edamontology.org/data_0850</uri><term>Sequence set</term></data><format><uri>http://edamontology.org/format_2182</uri><term>FASTQ-like format (text)</term></format><format><uri>http://edamontology.org/format_2200</uri><term>FASTA-like (text)</term></format></input><output><data><uri>http://edamontology.org/data_3028</uri><term>Taxonomy</term></data></output><cmd>mmseqs easy-taxonomy query.fasta targetDB output tmp</cmd></function><link><url>https://github.com/soedinglab/mmseqs2/issues</url><type>Issue tracker</type></link><link><url>https://github.com/soedinglab/mmseqs2</url><type>Repository</type></link><download><url>https://github.com/soedinglab/MMseqs2/releases</url><type>Source code</type></download><documentation><url>https://github.com/soedinglab/MMseqs2/blob/master/README.md</url><type>General</type></documentation><documentation><url>https://github.com/soedinglab/mmseqs2/wiki</url><type>User manual</type></documentation><documentation><url>https://github.com/soedinglab/MMseqs2/wiki/Tutorials</url><type>Training material</type><note>Tutorial material</note></documentation><relation><biotoolsID>mmseqs</biotoolsID><type>isNewVersionOf</type></relation><relation><biotoolsID>plass</biotoolsID><type>usedBy</type></relation><relation><biotoolsID>linclust</biotoolsID><type>includes</type></relation><relation><biotoolsID>metaeuk</biotoolsID><type>usedBy</type></relation><relation><biotoolsID>conterminator</biotoolsID><type>usedBy</type></relation><relation><biotoolsID>spacepharer</biotoolsID><type>usedBy</type></relation><publication><doi>10.1038/nbt.3988</doi><pmid>29035372</pmid></publication><publication><doi>10.1101/079681</doi></publication><publication><doi>10.1038/s41467-018-04964-5</doi><pmid>29959318</pmid><pmcid>PMC6026198</pmcid><type>Method</type></publication><publication><doi>10.1093/bioinformatics/bty1057</doi><pmid>30615063</pmid><pmcid>PMC6691333</pmcid><type>Method</type></publication><publication><doi>10.1101/2020.11.27.401018</doi></publication><publication><doi>10.1093/bioinformatics/btab184</doi><pmid>33734313</pmid><pmcid>PMC8479651</pmcid><note>MMseqs2 Taxonomy</note></publication><publication><doi>10.1101/2024.11.13.623350v1</doi><note>MMseqs2-GPU preprint</note></publication><publication><doi>10.1038/s41592-025-02819-8</doi><note>MMseqs2-GPU</note></publication></tool><tool><name>prolfquapp</name><description>A command-line tool for differential expression analysis in quantitative proteomics</description><homepage>https://github.com/prolfqua/prolfquapp</homepage><biotoolsID>prolfquapp</biotoolsID><biotoolsCURIE>biotools:prolfquapp</biotoolsCURIE><version>0.1.6</version><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_0121</uri><term>Proteomics</term></topic><operatingSystem>Mac</operatingSystem><operatingSystem>Linux</operatingSystem><language>R</language><license>MIT</license><maturity>Emerging</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_3741</uri><term>Differential protein expression profiling</term></operation><input><data><uri>http://edamontology.org/data_2603</uri><term>Expression data</term></data><format><uri>http://edamontology.org/format_3752</uri><term>CSV</term></format><format><uri>http://edamontology.org/format_3475</uri><term>TSV</term></format></input><input><data><uri>http://edamontology.org/data_2044</uri><term>Sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_2603</uri><term>Expression data</term></data><format><uri>http://edamontology.org/format_3620</uri><term>xlsx</term></format><format><uri>http://edamontology.org/format_2331</uri><term>HTML</term></format></output><cmd>./prolfqua_dea.sh -i data_dir/ -d annotation.xlsx -y config.yaml -w NameOfAnalysis -s DIANN
# and again you run the version within the docker container with
# ./prolfquapp_docker.sh prolfqua_dea.sh -i data_dir/ -d annotation.xlsx -y config.yaml -w NameOfAnalysis -s DIANN</cmd></function><function><operation><uri>http://edamontology.org/operation_2428</uri><term>Validation</term></operation><input><data><uri>http://edamontology.org/data_2603</uri><term>Expression data</term></data><format><uri>http://edamontology.org/format_3620</uri><term>xlsx</term></format></input><output><data><uri>http://edamontology.org/data_3914</uri><term>Quality control report</term></data><format><uri>http://edamontology.org/format_2331</uri><term>HTML</term></format></output></function><function><operation><uri>http://edamontology.org/operation_0571</uri><term>Expression data visualisation</term></operation><input><data><uri>http://edamontology.org/data_2603</uri><term>Expression data</term></data><format><uri>http://edamontology.org/format_3752</uri><term>CSV</term></format><format><uri>http://edamontology.org/format_3475</uri><term>TSV</term></format></input><input><data><uri>http://edamontology.org/data_2976</uri><term>Protein sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_2603</uri><term>Expression data</term></data><format><uri>http://edamontology.org/format_2331</uri><term>HTML</term></format><format><uri>http://edamontology.org/format_3508</uri><term>PDF</term></format></output></function><link><url>https://github.com/prolfqua/prolfquapp</url><type>Repository</type></link><link><url>https://github.com/prolfqua/prolfquapp/issues</url><type>Issue tracker</type></link><download><url>https://github.com/prolfqua/prolfquapp/releases/tag/0.1.6</url><type>Downloads page</type><version>0.1.6</version></download><documentation><url>https://github.com/prolfqua/prolfquapp/blob/master/README.md</url><type>General</type></documentation><publication><doi>10.1021/acs.jproteome.4c00911</doi><type>Primary</type><version>0.0.6</version></publication></tool><tool><name>PeptideShaker</name><description>PeptideShaker is a search engine independent platform for interpretation of proteomics identification results from multiple search engines, currently supporting X!Tandem, MS-GF+, MS Amanda, OMSSA, MyriMatch, Comet, Tide, Mascot, Andromeda and mzIdentML. By combining the results from multiple search engines, while re-calculating PTM localization scores and redoing the protein inference, PeptideShaker attempts to give you the best possible understanding of your proteomics data</description><homepage>http://compomics.github.io/projects/peptide-shaker.html</homepage><biotoolsID>peptideshaker</biotoolsID><biotoolsCURIE>biotools:peptideshaker</biotoolsCURIE><version>3.0.11</version><toolType>Desktop application</toolType><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_0121</uri><term>Proteomics</term></topic><topic><uri>http://edamontology.org/topic_3520</uri><term>Proteomics experiment</term></topic><topic><uri>http://edamontology.org/topic_0644</uri><term>Proteome</term></topic><operatingSystem>Mac</operatingSystem><operatingSystem>Linux</operatingSystem><language>Java</language><license>Apache-2.0</license><collectionID>de.NBI</collectionID><collectionID>Proteomics</collectionID><collectionID>BioInfra.Prot</collectionID><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_3694</uri><term>Mass spectrum visualisation</term></operation><operation><uri>http://edamontology.org/operation_3631</uri><term>Peptide identification</term></operation><operation><uri>http://edamontology.org/operation_2428</uri><term>Validation</term></operation><operation><uri>http://edamontology.org/operation_3755</uri><term>PTM localisation</term></operation><operation><uri>http://edamontology.org/operation_3501</uri><term>Enrichment analysis</term></operation><operation><uri>http://edamontology.org/operation_0570</uri><term>Structure visualisation</term></operation><operation><uri>http://edamontology.org/operation_3767</uri><term>Protein identification</term></operation><input><data><uri>http://edamontology.org/data_2976</uri><term>Protein sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><input><data><uri>http://edamontology.org/data_0945</uri><term>Peptide identification</term></data><format><uri>http://edamontology.org/format_3247</uri><term>mzIdentML</term></format></input><input><data><uri>http://edamontology.org/data_2536</uri><term>Mass spectrometry data</term></data><format><uri>http://edamontology.org/format_3651</uri><term>MGF</term></format></input><output><data><uri>http://edamontology.org/data_2992</uri><term>Protein structure image</term></data><format><uri>http://edamontology.org/format_3579</uri><term>JPG</term></format><format><uri>http://edamontology.org/format_3604</uri><term>SVG</term></format></output><output><data><uri>http://edamontology.org/data_3754</uri><term>GO-term enrichment data</term></data><format><uri>http://edamontology.org/format_2330</uri><term>Textual format</term></format></output><output><data><uri>http://edamontology.org/data_2969</uri><term>Sequence image</term></data><format><uri>http://edamontology.org/format_3547</uri><term>Image format</term></format></output><output><data><uri>http://edamontology.org/data_0896</uri><term>Protein report</term></data><format><uri>http://edamontology.org/format_3475</uri><term>TSV</term></format></output></function><download><url>https://github.com/compomics/peptide-shaker</url><type>Source code</type></download><documentation><url>http://compomics.github.io/projects/peptide-shaker.html#read-me</url><type>General</type></documentation><publication><doi>10.1038/nbt.3109</doi><pmid>25574629</pmid><type>Primary</type></publication><credit><name>BioInfra.Prot</name><typeEntity>Institute</typeEntity><typeRole>Provider</typeRole></credit><credit><name>Dominik Kopczynsk</name><email>dominik.kopczynski@isas.de</email><url>http://compomics.github.io/projects/peptide-shaker.html</url><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole></credit></tool><tool><name>NGP</name><description>Simulation of next-generation proteomics (NGP) based on single-molecule sequencing.</description><homepage>https://github.com/magnuspalmblad/NGP</homepage><biotoolsID>ngp</biotoolsID><biotoolsCURIE>biotools:ngp</biotoolsCURIE><function><operation><uri>http://edamontology.org/operation_2426</uri><term>Modelling and simulation</term></operation><input><data><uri>http://edamontology.org/data_2976</uri><term>Protein sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_3869</uri><term>Simulation</term></data><format><uri>http://edamontology.org/format_3752</uri><term>CSV</term></format><format><uri>http://edamontology.org/format_3508</uri><term>PDF</term></format></output></function><publication><doi>10.1021/acs.jproteome.1c00136</doi><pmid>33904308</pmid><pmcid>PMC8185883</pmcid><type>Primary</type></publication></tool><tool><name>NetStart</name><description>Prediction of eukaryotic translation initiation sites using a protein language model.

NetStart 2.0 is a deep learning-based model that predicts canonical translation initiation sites in mRNA transcripts in species across the eukaryotic domain, including vertebrates, invertebrates, plants, fungi, and protists. 

The model integrates the ESM-2 protein language model for assessing transitions from non-coding to coding regions with local start codon sequence context and taxonomical information. It has been trained on sequences from 60 phylogenetically diverse eukaryotic species.

If you need help with the bioinformatics programs, see the "Getting Help" section at the server site at the link.</description><homepage>https://services.healthtech.dtu.dk/services/NetStart-2.0/</homepage><biotoolsID>netstart_2.0</biotoolsID><biotoolsCURIE>biotools:netstart_2.0</biotoolsCURIE><version>2.0</version><topic><uri>http://edamontology.org/topic_3512</uri><term>Gene transcripts</term></topic><topic><uri>http://edamontology.org/topic_0108</uri><term>Protein expression</term></topic><topic><uri>http://edamontology.org/topic_0203</uri><term>Gene expression</term></topic><topic><uri>http://edamontology.org/topic_0621</uri><term>Model organisms</term></topic><operatingSystem>Mac</operatingSystem><operatingSystem>Linux</operatingSystem><language>Python</language><function><operation><uri>http://edamontology.org/operation_3767</uri><term>Protein identification</term></operation><operation><uri>http://edamontology.org/operation_0362</uri><term>Genome annotation</term></operation><operation><uri>http://edamontology.org/operation_0477</uri><term>Protein modelling</term></operation><operation><uri>http://edamontology.org/operation_0439</uri><term>Translation initiation site prediction</term></operation><input><data><uri>http://edamontology.org/data_3495</uri><term>RNA sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_1772</uri><term>Score</term></data></output></function><link><url>https://github.com/lsandvad/netstart2</url><type>Repository</type><note>GitHub with code and instructions and to download and run the program locally</note></link><link><url>https://services.healthtech.dtu.dk/services/NetStart-2.0/</url><type>Service</type><note>NetStart 2.0 server where sequences can be uploaded directly without having to download the program (hosted by DTU Health Tech).</note></link><publication><doi>10.1186/S12859-025-06220-2</doi><pmid>40830753</pmid><pmcid>PMC12366053</pmcid><type>Method</type><version>2.0</version></publication><credit><name>Line Sandvad Nielsen</name><email>line.s.nielsen@bio.ku.dk</email><typeEntity>Person</typeEntity></credit><credit><name>Henrik Nielsen</name><email>henni@dtu.dk</email><typeEntity>Person</typeEntity></credit></tool><tool><name>MS Annika</name><description>MS Annika is a crosslink search engine based on MS Amanda, aimed at identifying crosslinks of cleavable and non-cleavable crosslinkers from MS2 and MS3 spectra. It can deal with a wide variety of cross-linkers and provides robust and transparent FDR control based on a target-decoy approach. MS Annika is available free of charge as a plug-in for Proteome Discoverer.</description><homepage>https://github.com/hgb-bin-proteomics/MSAnnika</homepage><biotoolsID>ms_annika</biotoolsID><biotoolsCURIE>biotools:ms_annika</biotoolsCURIE><version>Proteome Discoverer 3.2: MS Annika 3.0 v3.0.7</version><version>Proteome Discoverer 3.1: MS Annika 3.0 v3.0.7</version><version>Proteome Discoverer 3.0: MS Annika 2.0 v1.1.4fix</version><version>Proteome Discoverer 2.5: MS Annika 2.0 v1.1.3</version><version>Proteome Discoverer 2.4: MS Annika v1.0.0</version><version>Proteome Discoverer 2.3: MS Annika v1.0.0</version><toolType>Plug-in</toolType><topic><uri>http://edamontology.org/topic_0130</uri><term>Protein folding, stability and design</term></topic><topic><uri>http://edamontology.org/topic_0736</uri><term>Protein folds and structural domains</term></topic><topic><uri>http://edamontology.org/topic_0128</uri><term>Protein interactions</term></topic><topic><uri>http://edamontology.org/topic_0121</uri><term>Proteomics</term></topic><topic><uri>http://edamontology.org/topic_0091</uri><term>Bioinformatics</term></topic><operatingSystem>Windows</operatingSystem><language>C#</language><license>Freeware</license><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_3631</uri><term>Peptide identification</term></operation><operation><uri>http://edamontology.org/operation_3646</uri><term>Peptide database search</term></operation><operation><uri>http://edamontology.org/operation_3755</uri><term>PTM localisation</term></operation><operation><uri>http://edamontology.org/operation_3645</uri><term>PTM identification</term></operation><operation><uri>http://edamontology.org/operation_3649</uri><term>Target-Decoy</term></operation><input><data><uri>http://edamontology.org/data_0943</uri><term>Mass spectrum</term></data><format><uri>http://edamontology.org/format_3651</uri><term>MGF</term></format><format><uri>http://edamontology.org/format_3712</uri><term>Thermo RAW</term></format></input><input><data><uri>http://edamontology.org/data_2886</uri><term>Protein sequence record</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_0857</uri><term>Sequence search results</term></data><format><uri>http://edamontology.org/format_2330</uri><term>Textual format</term></format></output></function><link><url>https://github.com/hgb-bin-proteomics/MSAnnika</url><type>Repository</type><note>MS Annika version archive</note></link><download><url>https://github.com/hgb-bin-proteomics/MSAnnika/raw/master/releases/latest/PD3.2/latest.zip</url><type>Software package</type><note>Latest MS Annika version for Proteome Disoverer 3.2</note><version>PD3.2:latest</version></download><download><url>https://github.com/hgb-bin-proteomics/MSAnnika/raw/master/releases/latest/PD3.1/latest.zip</url><type>Software package</type><note>Latest MS Annika version for Proteome Disoverer 3.1</note><version>PD3.1:latest</version></download><download><url>https://github.com/hgb-bin-proteomics/MSAnnika/raw/master/releases/latest/PD3.0/latest.zip</url><type>Software package</type><note>Latest MS Annika version for Proteome Disoverer 3.0</note><version>PD3.0:latest</version></download><download><url>https://github.com/hgb-bin-proteomics/MSAnnika/raw/master/releases/latest/PD2.5/latest.zip</url><type>Software package</type><note>Latest MS Annika version for Proteome Disoverer 2.5</note><version>PD2.5:latest</version></download><documentation><url>https://hgb-bin-proteomics.github.io/MSAnnika/</url><type>General</type></documentation><documentation><url>https://github.com/hgb-bin-proteomics/MSAnnika/raw/master/MS_Annika_3.0_UserManual.pdf</url><type>User manual</type></documentation><documentation><url>https://github.com/hgb-bin-proteomics/MSAnnika/raw/master/tutorial/MS_Annika_3.0_Tutorial.pdf</url><type>Training material</type><note>Detailed tutorial how to use MS Annika</note></documentation><documentation><url>https://www.youtube.com/watch?v=L1lVt35PYv4</url><type>Training material</type><note>Detailed tutorial how to use MS Annika</note></documentation><relation><biotoolsID>ms_amanda</biotoolsID><type>uses</type></relation><publication><doi>10.1021/acs.jproteome.0c01000</doi><pmid>33852321</pmid><pmcid>PMC8155564</pmcid><type>Primary</type><version>1.0</version><note>MS Annika for cleavable crosslink identification from MS2 spectra</note></publication><publication><doi>10.1021/acs.jproteome.3c00325</doi><pmid>37566781</pmid><pmcid>PMC10476269</pmcid><type>Primary</type><version>2.0</version><note>MS Annika 2.0 for cleavable crosslink identification from MS3 spectra</note></publication><publication><doi>10.1038/s42004-024-01386-x</doi><pmid>39702463</pmid><pmcid>PMC11659399</pmcid><type>Primary</type><version>3.0</version><note>MS Annika 3.0 for non-cleavable crosslink identification</note></publication><credit><name>Viktoria Dorfer</name><email>viktoria.dorfer@fh-hagenberg.at</email><url>http://bioinformatics.fh-hagenberg.at/</url><orcidid>https://orcid.org/0000-0002-5332-5701</orcidid><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole></credit><credit><name>Micha Birklbauer</name><email>micha.birklbauer@fh-hagenberg.at</email><url>http://bioinformatics.fh-hagenberg.at/</url><orcidid>https://orcid.org/0009-0005-1051-179X</orcidid><typeEntity>Person</typeEntity><typeRole>Developer</typeRole></credit></tool><tool><name>EPIK</name><description>Precise and scalable evolutionary placement with informative k-mers.</description><homepage>https://github.com/phylo42/EPIK</homepage><biotoolsID>epik</biotoolsID><biotoolsCURIE>biotools:epik</biotoolsCURIE><version>0.2.0</version><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_0084</uri><term>Phylogeny</term></topic><operatingSystem>Linux</operatingSystem><operatingSystem>Mac</operatingSystem><language>C++</language><language>Python</language><license>MIT</license><maturity>Emerging</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_0323</uri><term>Phylogenetic inference</term></operation><input><data><uri>http://edamontology.org/data_0863</uri><term>Sequence alignment</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><input><data><uri>http://edamontology.org/data_3494</uri><term>DNA sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><input><data><uri>http://edamontology.org/data_2976</uri><term>Protein sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><input><data><uri>http://edamontology.org/data_0955</uri><term>Data index</term></data><format><uri>http://edamontology.org/format_2333</uri><term>Binary format</term></format></input><output><data><uri>http://edamontology.org/data_0872</uri><term>Phylogenetic tree</term></data><format><uri>http://edamontology.org/format_3464</uri><term>JSON</term></format></output></function><publication><doi>10.1093/BIOINFORMATICS/BTAD692</doi><pmid>37975872</pmid><pmcid>PMC10701097</pmcid><version>0.2.0</version></publication><credit><name>Nikolai Romashchenko</name><email>nromashchenko@lirmm.fr</email><orcidid>https://orcid.org/0000-0002-2264-5219</orcidid><typeEntity>Person</typeEntity></credit><credit><name>Eric Rivals</name><email>rivals@lirmm.fr</email><orcidid>https://orcid.org/0000-0003-3791-3973</orcidid><typeEntity>Person</typeEntity></credit></tool><tool><name>SQUARNA</name><description>SQUARNA is a tool for RNA secondary structure prediction. It can take a single RNA sequence or an alignment of sequences as input. SQUARNA handles pseudoknots and can predict alternative structures. SQUARNA allows structural restraints and chemical probing data as additional input and is available at https://github.com/febos/SQUARNA.</description><homepage>https://github.com/febos/SQUARNA</homepage><biotoolsID>squarna</biotoolsID><biotoolsCURIE>biotools:squarna</biotoolsCURIE><version>2.3</version><toolType>Command-line tool</toolType><toolType>Script</toolType><toolType>Library</toolType><topic><uri>http://edamontology.org/topic_3307</uri><term>Computational biology</term></topic><topic><uri>http://edamontology.org/topic_0077</uri><term>Nucleic acids</term></topic><topic><uri>http://edamontology.org/topic_0081</uri><term>Structure analysis</term></topic><topic><uri>http://edamontology.org/topic_0097</uri><term>Nucleic acid structure analysis</term></topic><topic><uri>http://edamontology.org/topic_0082</uri><term>Structure prediction</term></topic><topic><uri>http://edamontology.org/topic_0099</uri><term>RNA</term></topic><topic><uri>http://edamontology.org/topic_0659</uri><term>Functional, regulatory and non-coding RNA</term></topic><topic><uri>http://edamontology.org/topic_3511</uri><term>Nucleic acid sites, features and motifs</term></topic><operatingSystem>Mac</operatingSystem><operatingSystem>Linux</operatingSystem><operatingSystem>Windows</operatingSystem><language>Python</language><license>Apache-2.0</license><collectionID>3D-BioInfo-Nucleic-Acid</collectionID><maturity>Emerging</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_0278</uri><term>RNA secondary structure prediction</term></operation><input><data><uri>http://edamontology.org/data_3495</uri><term>RNA sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format><format><uri>http://edamontology.org/format_1961</uri><term>Stockholm format</term></format><format><uri>http://edamontology.org/format_1982</uri><term>ClustalW format</term></format></input><input><data><uri>http://edamontology.org/data_1383</uri><term>Nucleic acid sequence alignment</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format><format><uri>http://edamontology.org/format_1961</uri><term>Stockholm format</term></format><format><uri>http://edamontology.org/format_1982</uri><term>ClustalW format</term></format></input><input><data><uri>http://edamontology.org/data_0889</uri><term>Structural profile</term></data></input><output><data><uri>http://edamontology.org/data_0880</uri><term>RNA secondary structure</term></data><format><uri>http://edamontology.org/format_2546</uri><term>FASTA-like</term></format></output><output><data><uri>http://edamontology.org/data_1772</uri><term>Score</term></data></output><note>Input can be a single or several RNA sequences or an alignment of RNA sequences along with structural restraints and/or chemical probing data on base reactivities</note><cmd>SQUARNA i=inputfile [OPTIONS]</cmd></function><link><url>https://github.com/febos/SQUARNA</url><type>Repository</type><note>GitHub repository</note></link><link><url>https://pypi.org/project/SQUARNA/</url><type>Software catalogue</type><note>PyPi repository page</note></link><download><url>https://doi.org/10.5281/zenodo.8292325</url><type>Downloads page</type><version>2.3</version></download><download><url>https://pypi.org/project/SQUARNA/</url><type>Software package</type><note>pip install SQUARNA</note><version>2.3</version></download><documentation><url>https://github.com/febos/SQUARNA/blob/main/README.md</url><type>User manual</type></documentation><documentation><url>https://pypi.org/project/SQUARNA/</url><type>Installation instructions</type><note>pip install SQUARNA</note></documentation><publication><doi>10.1101/2023.08.28.555103</doi><note>biorxiv preprint</note></publication><credit><name>Eugene F Baulin</name><email>efbaulin@gmail.com</email><url>https://scholar.google.com/citations?user=sjfDaUwAAAAJ&amp;hl=en</url><orcidid>https://orcid.org/0000-0003-4694-9783</orcidid><typeEntity>Person</typeEntity><typeRole>Developer</typeRole><typeRole>Primary contact</typeRole><typeRole>Support</typeRole><note>https://imol.institute/leaders/baulin-group/</note></credit></tool><tool><name>REPET</name><description>The REPET package integrates bioinformatics pipelines dedicated to detect, annotate and analyze transposable elements (TEs) in genomic sequences. The main pipelines are (i) TEdenovo, which search for interspersed repeats, build consensus sequences and classify them according to TE features, and (ii)
 TEannot, which mines a genome with a library of TE sequences, for instance the one produced by the TEdenovo pipeline, to provide TE annotations exported into GFF3 files.</description><homepage>https://urgi.versailles.inrae.fr/Tools/REPET</homepage><biotoolsID>repet</biotoolsID><biotoolsCURIE>biotools:repet</biotoolsCURIE><version>v3.0</version><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_0157</uri><term>Sequence composition, complexity and repeats</term></topic><topic><uri>http://edamontology.org/topic_0654</uri><term>DNA</term></topic><topic><uri>http://edamontology.org/topic_0097</uri><term>Nucleic acid structure analysis</term></topic><operatingSystem>Linux</operatingSystem><operatingSystem>Mac</operatingSystem><language>C++</language><language>Python</language><license>CECILL-2.0</license><collectionID>REPET</collectionID><collectionID>elixir-fr-sdp-2019</collectionID><collectionID>URGI</collectionID><maturity>Legacy</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><elixirPlatform>Tools</elixirPlatform><elixirNode>France</elixirNode><function><operation><uri>http://edamontology.org/operation_0379</uri><term>Repeat sequence detection</term></operation><operation><uri>http://edamontology.org/operation_0237</uri><term>Repeat sequence analysis</term></operation><input><data><uri>http://edamontology.org/data_3494</uri><term>DNA sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_3494</uri><term>DNA sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></output><output><data><uri>http://edamontology.org/data_1270</uri><term>Feature table</term></data><format><uri>http://edamontology.org/format_2206</uri><term>Sequence feature table format (text)</term></format></output><output><data><uri>http://edamontology.org/data_3002</uri><term>Annotation track</term></data><format><uri>http://edamontology.org/format_1939</uri><term>GFF3-seq</term></format></output></function><link><url>https://urgi.versailles.inrae.fr/download/repet/</url><type>Repository</type></link><download><url>https://urgi.versailles.inrae.fr/download/repet/REPET_linux-x64-3.0.tar.gz</url><type>Source code</type><version>v3.0</version></download><documentation><url>https://urgi.versailles.inrae.fr/Tools/REPET</url><type>General</type><note>see also https://urgi.versailles.inrae.fr/Tools/REPET/README</note></documentation><documentation><url>https://urgi.versailles.inrae.fr/Tools/REPET/INSTALL</url><type>Installation instructions</type></documentation><documentation><url>https://urgi.versailles.inrae.fr/Tools/REPET/TEdenovo-tuto</url><type>User manual</type><note>See also https://urgi.versailles.inra.fr/Tools/REPET/TEannot-tuto</note></documentation><documentation><url>https://forgemia.inra.fr/urgi-anagen/wiki-repet/-/wikis/REPET-V3.0-tutorial</url><type>Training material</type></documentation><publication><doi>10.1371/journal.pone.0091929</doi><pmid>24786468</pmid><pmcid>PMC4008368</pmcid><type>Primary</type></publication><publication><doi>10.1371/journal.pone.0016526</doi><pmid>21304975</pmid><pmcid>PMC3031573</pmcid><type>Primary</type></publication><publication><doi>10.1371/journal.pcbi.0010022</doi><pmid>16110336</pmid><pmcid>PMC1185648</pmcid><type>Primary</type></publication><publication><doi>10.1007/s00239-003-0007-2</doi><pmid>15008403</pmid><type>Method</type></publication><publication><doi>10.1109/JPROC.2016.2590833</doi><type>Method</type></publication><credit><name>URGI</name><email>urgi-repet@versailles.inrae.fr</email><url>http://urgi.versailles.inrae.fr/</url><typeEntity>Institute</typeEntity><typeRole>Developer</typeRole><typeRole>Contributor</typeRole><typeRole>Maintainer</typeRole><typeRole>Provider</typeRole><typeRole>Support</typeRole><typeRole>Documentor</typeRole><note>This tool has a "Numero de depot APP": FR 001 480007 000 R P 2008 000 31 235</note></credit></tool><tool><name>fqcleaner</name><description>Mini workflow for preprocessing fastq files, it includes quality trimming, duplicate and artefacts filters for FASTQ read files.</description><homepage>https://gensoft.pasteur.fr/docs/fqCleanER/21.10/</homepage><biotoolsID>fqcleaner</biotoolsID><biotoolsCURIE>biotools:fqcleaner</biotoolsCURIE><version>0.4.1.1</version><toolType>Web application</toolType><topic><uri>http://edamontology.org/topic_3168</uri><term>Sequencing</term></topic><topic><uri>http://edamontology.org/topic_3572</uri><term>Data quality management</term></topic><topic><uri>http://edamontology.org/topic_0080</uri><term>Sequence analysis</term></topic><collectionID>galaxyPasteur</collectionID><collectionID>Institut Pasteur</collectionID><collectionID>fqtools</collectionID><maturity>Mature</maturity><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_3192</uri><term>Sequence trimming</term></operation><operation><uri>http://edamontology.org/operation_3695</uri><term>Data filtering</term></operation><input><data><uri>http://edamontology.org/data_0006</uri><term>Data</term></data><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format></input><input><data><uri>http://edamontology.org/data_0006</uri><term>Data</term></data><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format></input><input><data><uri>http://edamontology.org/data_0006</uri><term>Data</term></data><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format></input><input><data><uri>http://edamontology.org/data_0863</uri><term>Sequence alignment</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_0006</uri><term>Data</term></data><format><uri>http://edamontology.org/format_1932</uri><term>FASTQ-sanger</term></format></output><output><data><uri>http://edamontology.org/data_0006</uri><term>Data</term></data><format><uri>http://edamontology.org/format_1932</uri><term>FASTQ-sanger</term></format></output><output><data><uri>http://edamontology.org/data_0006</uri><term>Data</term></data><format><uri>http://edamontology.org/format_1932</uri><term>FASTQ-sanger</term></format></output></function><link><url>https://galaxy.pasteur.fr/tool_runner?tool_id=toolshed.pasteur.fr/repos/odoppelt/fqcleaner/fqCleaner/0.4.1.1</url><type>Galaxy service</type></link><publication><doi>10.1093/nar/gkw343</doi><type>Other</type></publication><publication><doi>10.7490/f1000research.1114334.1</doi><type>Other</type></publication><credit><name>Galaxy Support Team</name><email>galaxy@pasteur.fr</email><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole></credit></tool><tool><name>ETENLNC</name><description>ETENLNC (End-To-End-Novel-NonCoding) is a lncRNA identification and analysis framework</description><homepage>https://github.com/EvolOMICS-TU/ETENLNC</homepage><biotoolsID>etenlnc</biotoolsID><biotoolsCURIE>biotools:etenlnc</biotoolsCURIE><version>1.0</version><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_0659</uri><term>Functional, regulatory and non-coding RNA</term></topic><operatingSystem>Linux</operatingSystem><operatingSystem>Mac</operatingSystem><operatingSystem>Windows</operatingSystem><license>GPL-3.0</license><maturity>Emerging</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_3219</uri><term>Read pre-processing</term></operation><operation><uri>http://edamontology.org/operation_3218</uri><term>Sequencing quality control</term></operation><operation><uri>http://edamontology.org/operation_3258</uri><term>Transcriptome assembly</term></operation><operation><uri>http://edamontology.org/operation_0415</uri><term>Nucleic acid feature detection</term></operation><operation><uri>http://edamontology.org/operation_3800</uri><term>RNA-Seq quantification</term></operation><operation><uri>http://edamontology.org/operation_3223</uri><term>Differential gene expression profiling</term></operation><operation><uri>http://edamontology.org/operation_0531</uri><term>Heat map generation</term></operation><operation><uri>http://edamontology.org/operation_0278</uri><term>RNA secondary structure prediction</term></operation><operation><uri>http://edamontology.org/operation_3472</uri><term>k-mer counting</term></operation><operation><uri>http://edamontology.org/operation_0389</uri><term>Protein-nucleic acid interaction analysis</term></operation><operation><uri>http://edamontology.org/operation_3901</uri><term>RNA-binding protein prediction</term></operation><operation><uri>http://edamontology.org/operation_2437</uri><term>Gene regulatory network prediction</term></operation><input><data><uri>http://edamontology.org/data_2603</uri><term>Expression data</term></data><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format></input><input><data><uri>http://edamontology.org/data_2977</uri><term>Nucleic acid sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><input><data><uri>http://edamontology.org/data_1288</uri><term>Genome map</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><input><data><uri>http://edamontology.org/data_3002</uri><term>Annotation track</term></data><format><uri>http://edamontology.org/format_2306</uri><term>GTF</term></format><format><uri>http://edamontology.org/format_2305</uri><term>GFF</term></format></input><output><data><uri>http://edamontology.org/data_2887</uri><term>Nucleic acid sequence record</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></output><output><data><uri>http://edamontology.org/data_3112</uri><term>Gene expression matrix</term></data><format><uri>http://edamontology.org/format_3752</uri><term>CSV</term></format></output><output><data><uri>http://edamontology.org/data_1636</uri><term>Heat map</term></data><format><uri>http://edamontology.org/format_3508</uri><term>PDF</term></format></output><output><data><uri>http://edamontology.org/data_0906</uri><term>Protein interaction data</term></data></output><note>Run ETENLNC on paired-end raw RNA-Seq (.fastq) data</note><cmd>bash ETENLNC_docker.sh</cmd></function><link><url>https://github.com/EvolOMICS-TU/ETENLNC</url><type>Repository</type><note>GitHub Repo</note></link><download><url>https://zenodo.org/records/14325721?token=eyJhbGciOiJIUzUxMiJ9.eyJpZCI6ImVmYTllNWJkLWE4ODUtNGM3OC05YTAxLWM4ZDk5YTljZDZjMCIsImRhdGEiOnt9LCJyYW5kb20iOiI4M2I1YjBlZDQ4MmUyZTIxNDg2YmI0YTFkMWE5MTI1OCJ9.HeB3WrsPduNzMyXjH4x5HfCgmIp4NzYv0P_11XU9lcXC_ZxEaVndP-kD0LDkxwufHVNlroeAhdK33PR51F6jnw</url><type>Biological data</type><note>Sample/demo data for ETENLNC. A detailed guide on running ETENLNC using the demo data can be found in the ETENLNC manual (supplementary to our publication).</note><version>1.0</version></download><documentation><url>https://github.com/EvolOMICS-TU/ETENLNC</url><type>Command-line options</type><note>A detailed guide on running ETENLNC using the demo data can be found in the ETENLNC manual (supplementary to our publication)</note></documentation><relation><biotoolsID>hisat2</biotoolsID><type>uses</type></relation><relation><biotoolsID>salmon</biotoolsID><type>uses</type></relation><relation><biotoolsID>fastqc</biotoolsID><type>uses</type></relation><relation><biotoolsID>fastp</biotoolsID><type>uses</type></relation><relation><biotoolsID>stringtie</biotoolsID><type>uses</type></relation><relation><biotoolsID>gffcompare</biotoolsID><type>uses</type></relation><relation><biotoolsID>cpc2</biotoolsID><type>uses</type></relation><relation><biotoolsID>ncbi_blast_plus</biotoolsID><type>uses</type></relation><relation><biotoolsID>tximport</biotoolsID><type>uses</type></relation><relation><biotoolsID>deseq2</biotoolsID><type>uses</type></relation><relation><biotoolsID>lnctar</biotoolsID><type>uses</type></relation><relation><biotoolsID>capsule-lpi</biotoolsID><type>uses</type></relation><relation><biotoolsID>seekr</biotoolsID><type>uses</type></relation><publication><doi>10.1016/j.compbiolchem.2024.108140</doi><pmid>38996755</pmid><type>Primary</type><version>.0</version></publication><credit><name>Pankaj Barah</name><email>barah@tezu.ernet.in</email><url>https://www.tezu.ernet.in/dmbbt/profile/34</url><orcidid>https://orcid.org/0000-0001-7039-7996</orcidid><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole><typeRole>Maintainer</typeRole><note>Assistant Professor at Department of Molecular Biology and Biotechnology, Tezpur University.</note></credit><credit><name>Prangan Nath</name><email>prangannathofficial@gmai.com</email><orcidid>https://orcid.org/0000-0002-9451-7822</orcidid><typeEntity>Person</typeEntity><typeRole>Developer</typeRole><typeRole>Maintainer</typeRole></credit></tool><tool><name>UniversalMer</name><description>UniversalMer is a k-mer counting tool for multiple size of k at once. It is available for DNA, RNA, and protein sequences. The program counts and summarizes the exact frequency of all k-mers from 1-mer to a user-defined maximum length (kmax). Analyzing the k-mer spectrum across multiple values of k can be done in seconds. This program is designed for bioinformatics researchers and scientists.</description><homepage>https://ugrammer.github.io</homepage><biotoolsID>UniversalMer</biotoolsID><biotoolsCURIE>biotools:UniversalMer</biotoolsCURIE><version>2.0.0</version><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_0091</uri><term>Bioinformatics</term></topic><topic><uri>http://edamontology.org/topic_3473</uri><term>Data mining</term></topic><topic><uri>http://edamontology.org/topic_0080</uri><term>Sequence analysis</term></topic><operatingSystem>Windows</operatingSystem><operatingSystem>Mac</operatingSystem><language>C++</language><language>C</language><license>Freeware</license><maturity>Emerging</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><elixirPlatform>Tools</elixirPlatform><elixirNode>UK</elixirNode><function><operation><uri>http://edamontology.org/operation_3472</uri><term>k-mer counting</term></operation><input><data><uri>http://edamontology.org/data_3494</uri><term>DNA sequence</term></data><format><uri>http://edamontology.org/format_2546</uri><term>FASTA-like</term></format></input><input><data><uri>http://edamontology.org/data_3495</uri><term>RNA sequence</term></data><format><uri>http://edamontology.org/format_2546</uri><term>FASTA-like</term></format></input><input><data><uri>http://edamontology.org/data_2976</uri><term>Protein sequence</term></data><format><uri>http://edamontology.org/format_2546</uri><term>FASTA-like</term></format></input><output><data><uri>http://edamontology.org/data_1266</uri><term>Base word frequencies table</term></data></output><output><data><uri>http://edamontology.org/data_1265</uri><term>Base frequencies table</term></data></output><output><data><uri>http://edamontology.org/data_3671</uri><term>Text</term></data></output></function><link><url>https://sourceforge.net/projects/universal-mer/</url><type>Mirror</type></link><download><url>https://sourceforge.net/projects/universal-mer/</url><type>Command-line specification</type><version>2.0</version></download></tool><tool><name>MirGeneDB</name><description>MirGeneDB is a database of manually curated microRNA genes that have been validated and annotated as initially described in Fromm et al. 2015 , Fromm et al. 2020 and Fromm et al 2022. MirGeneDB 3.0 (Clarke and Hoye et al. 2024 ) includes more than 21,000 microRNA gene entries representing more than 1,700 microRNA families from 114 metazoan species. All microRNAs can be browsed, searched and downloaded.</description><homepage>http://mirgenedb.org/</homepage><biotoolsID>mirgen</biotoolsID><biotoolsCURIE>biotools:mirgen</biotoolsCURIE><version>3.0</version><otherID><value>doi:10.25504/FAIRsharing.QXSgvF</value><type>doi</type><version>2.0</version></otherID><toolType>Database portal</toolType><topic><uri>http://edamontology.org/topic_0659</uri><term>Functional, regulatory and non-coding RNA</term></topic><topic><uri>http://edamontology.org/topic_0204</uri><term>Gene regulation</term></topic><topic><uri>http://edamontology.org/topic_3299</uri><term>Evolutionary biology</term></topic><topic><uri>http://edamontology.org/topic_3500</uri><term>Zoology</term></topic><topic><uri>http://edamontology.org/topic_2815</uri><term>Human biology</term></topic><operatingSystem>Linux</operatingSystem><operatingSystem>Windows</operatingSystem><operatingSystem>Mac</operatingSystem><license>CC0-1.0</license><collectionID>UiO tools</collectionID><collectionID>ELIXIR-NO</collectionID><collectionID>ELIXIR-Norway</collectionID><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><elixirNode>Norway</elixirNode><function><operation><uri>http://edamontology.org/operation_2422</uri><term>Data retrieval</term></operation><operation><uri>http://edamontology.org/operation_0564</uri><term>Sequence visualisation</term></operation><input><data><uri>http://edamontology.org/data_1097</uri><term>Sequence accession (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><input><data><uri>http://edamontology.org/data_1869</uri><term>Organism identifier</term></data></input><output><data><uri>http://edamontology.org/data_3134</uri><term>Gene transcript report</term></data><format><uri>http://edamontology.org/format_2331</uri><term>HTML</term></format></output><output><data><uri>http://edamontology.org/data_0880</uri><term>RNA secondary structure</term></data><format><uri>http://edamontology.org/format_2331</uri><term>HTML</term></format></output><note>Data retrieval: curated miRNA. Organism identifier: a specific miRNA identifier or a species  for all miRNAs for that species. Gene transcript report: with metadata and visualization. RNA secondary structure: the hairpin loop of the miRNA with bases.</note></function><function><operation><uri>http://edamontology.org/operation_0224</uri><term>Query and retrieval</term></operation><input><data><uri>http://edamontology.org/data_1097</uri><term>Sequence accession (nucleic acid)</term></data></input><output><data><uri>http://edamontology.org/data_3495</uri><term>RNA sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></output><output><data><uri>http://edamontology.org/data_2012</uri><term>Sequence coordinates</term></data><format><uri>http://edamontology.org/format_2305</uri><term>GFF</term></format><format><uri>http://edamontology.org/format_3003</uri><term>BED</term></format></output><output><data><uri>http://edamontology.org/data_3917</uri><term>Count matrix</term></data><format><uri>http://edamontology.org/format_2330</uri><term>Textual format</term></format></output></function><link><url>https://elixir.no/helpdesk</url><type>Helpdesk</type></link><download><url>https://www.mirgenedb.org/download</url><type>Biological data</type><note>Sequence downloads for 75 species</note><version>3.0</version></download><documentation><url>https://www.mirgenedb.org/information</url><type>General</type></documentation><publication><doi>10.1093/nar/gkab1101</doi><pmid>34850127</pmid><pmcid>PMC8728216</pmcid><type>Primary</type><version>2.1</version></publication><publication><doi>10.1093/nar/gkz885</doi><pmid>31598695</pmid><pmcid>PMC6943042</pmcid><type>Primary</type><version>2.0</version></publication><publication><doi>10.1146/annurev-genet-120213-092023</doi><pmid>26473382</pmid><pmcid>PMC4743252</pmcid><type>Primary</type><version>1.0</version></publication><publication><doi>10.1093/nar/gkae1094</doi><pmid>39673268</pmid><pmcid>PMC11701709</pmcid><type>Primary</type><version>3.0</version></publication><credit><name>Bastian Fromm</name><email>BastianFromm@gmail.com</email><orcidid>https://orcid.org/0000-0003-0352-3037</orcidid><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole><typeRole>Developer</typeRole><typeRole>Maintainer</typeRole><typeRole>Support</typeRole></credit><credit><name>Kevin J. Peterson</name><email>kevin.j.peterson@dartmouth.edu</email><typeEntity>Person</typeEntity><typeRole>Developer</typeRole><typeRole>Maintainer</typeRole></credit><credit><name>The Norwegian Bioinformatics Platform (ELIXIR-Norway) Helpdesk</name><email>support@elixir.no</email><url>https://elixir.no/helpdesk</url><typeEntity>Consortium</typeEntity><typeRole>Support</typeRole></credit><credit><name>University of Oslo</name><url>https://www.uio.no/english/index.html</url><gridid>grid.5510.1</gridid><rorid>01xtthb56</rorid><fundrefid>10.13039/501100005366</fundrefid><typeEntity>Institute</typeEntity><typeRole>Provider</typeRole></credit></tool><tool><name>RepeatModeler2</name><description>RepeatModeler is a de novo transposable element (TE) family identification and modeling package. At the heart of RepeatModeler are three de-novo repeat finding programs ( RECON, RepeatScout and LtrHarvest/Ltr_retriever ) which employ complementary computational methods for identifying repeat element boundaries and family relationships from sequence data.</description><homepage>https://github.com/Dfam-consortium/RepeatModeler</homepage><biotoolsID>RepeatModeler2</biotoolsID><biotoolsCURIE>biotools:RepeatModeler2</biotoolsCURIE><version>2.0.7</version><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_0621</uri><term>Model organisms</term></topic><topic><uri>http://edamontology.org/topic_0157</uri><term>Sequence composition, complexity and repeats</term></topic><topic><uri>http://edamontology.org/topic_3673</uri><term>Whole genome sequencing</term></topic><topic><uri>http://edamontology.org/topic_0196</uri><term>Sequence assembly</term></topic><topic><uri>http://edamontology.org/topic_0798</uri><term>Mobile genetic elements</term></topic><operatingSystem>Mac</operatingSystem><operatingSystem>Linux</operatingSystem><language>Perl</language><license>OSL-2.0</license><maturity>Mature</maturity><cost>Free of charge</cost><function><operation><uri>http://edamontology.org/operation_0525</uri><term>Genome assembly</term></operation><operation><uri>http://edamontology.org/operation_3644</uri><term>de Novo sequencing</term></operation><operation><uri>http://edamontology.org/operation_0362</uri><term>Genome annotation</term></operation><input><data><uri>http://edamontology.org/data_3494</uri><term>DNA sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input></function><link><url>https://github.com/Dfam-consortium/RepeatModeler/blob/master/LICENSE</url><type>Repository</type></link><relation><biotoolsID>repeatmodeler</biotoolsID><type>isNewVersionOf</type></relation><publication><doi>10.1101/856591</doi></publication></tool><tool><name>purge_dups</name><description>Identifying and removing haplotypic duplication in primary genome assemblies | haplotypic duplication identification tool | scripts/pd_config.py: script to generate a configuration file used by run_purge_dups.py | purge haplotigs and overlaps in an assembly based on read depth | Given a primary assembly pri_asm and an alternative assembly hap_asm (optional, if you have one), follow the steps shown below to build your own purge_dups pipeline, steps with same number can be run simultaneously. Among all the steps, although step 4 is optional, we highly recommend our users to do so, because assemblers may produce overrepresented seqeuences. In such a case, The final step 4 can be applied to remove those seqeuences</description><homepage>https://github.com/dfguan/purge_dups</homepage><biotoolsID>purge_dups</biotoolsID><biotoolsCURIE>biotools:purge_dups</biotoolsCURIE><version>v.1.2.6</version><topic><uri>http://edamontology.org/topic_0196</uri><term>Sequence assembly</term></topic><operatingSystem>Mac</operatingSystem><operatingSystem>Linux</operatingSystem><language>Python</language><language>C</language><license>MIT</license><collectionID>ONTeater</collectionID><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_0525</uri><term>Genome assembly</term></operation><operation><uri>http://edamontology.org/operation_3798</uri><term>Read binning</term></operation><operation><uri>http://edamontology.org/operation_3216</uri><term>Scaffolding</term></operation><input><data><uri>http://edamontology.org/data_1234</uri><term>Sequence set (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format></input><input><data><uri>http://edamontology.org/data_1234</uri><term>Sequence set (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_1234</uri><term>Sequence set (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></output></function><link><url>https://github.com/dfguan/purge_dups</url><type>Repository</type></link><link><url>https://github.com/dfguan/purge_dups/issues</url><type>Issue tracker</type></link><publication><doi>10.1101/729962</doi></publication><credit><name>Dengfeng Guan</name><url>https://www.chatlink.com.cn</url><orcidid>https://orcid.org/0000-0002-6376-3940</orcidid><typeEntity>Person</typeEntity><typeRole>Developer</typeRole></credit></tool><tool><name>Compleasm</name><description>Compleasm: a faster and more accurate reimplementation of BUSCO.
It provides measures for quantitative assessment of genome assembly, gene set, and transcriptome completeness based on evolutionarily informed expectations of gene content from near-universal single-copy orthologs.</description><homepage>https://github.com/huangnengCSU/compleasm</homepage><biotoolsID>compleasm</biotoolsID><biotoolsCURIE>biotools:compleasm</biotoolsCURIE><version>v.0.2.5</version><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_0196</uri><term>Sequence assembly</term></topic><topic><uri>http://edamontology.org/topic_0622</uri><term>Genomics</term></topic><topic><uri>http://edamontology.org/topic_3308</uri><term>Transcriptomics</term></topic><topic><uri>http://edamontology.org/topic_0080</uri><term>Sequence analysis</term></topic><language>Python</language><license>Apache-2.0</license><collectionID>ONTeater</collectionID><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_3180</uri><term>Sequence assembly validation</term></operation><input><data><uri>http://edamontology.org/data_1234</uri><term>Sequence set (nucleic acid)</term></data><format><uri>http://edamontology.org/format_2546</uri><term>FASTA-like</term></format></input><output><data><uri>http://edamontology.org/data_2955</uri><term>Sequence report</term></data></output><note>Runs compleasm using the BUSCO set corresponding to the lineage given.</note><cmd>compleasm run -l "$lineage" -a assembly.fa -o output_prefix</cmd></function><link><url>https://github.com/huangnengCSU/compleasm</url><type>Repository</type></link><link><url>https://github.com/huangnengCSU/compleasm/issues</url><type>Issue tracker</type></link><link><url>https://busco.ezlab.org/list_of_lineages.html</url><type>Other</type><note>List of accepted lineages (taxonomic groups with curated BUSCO sets)</note></link><documentation><url>https://github.com/huangnengCSU/compleasm/blob/0.2.6/README.md</url><type>General</type></documentation><relation><biotoolsID>busco</biotoolsID><type>isNewVersionOf</type></relation><publication><doi>10.1093/bioinformatics/btad595</doi></publication><credit><name>Neng Huang</name><email>neng@ds.dfci.harvard.edu</email><orcidid>https://orcid.org/0000-0001-7187-0749</orcidid><typeEntity>Person</typeEntity><typeRole>Developer</typeRole></credit></tool><tool><name>SILVA rRNA database</name><description>SILVA provides comprehensive, quality checked and regularly updated datasets of aligned small (16S/18S, SSU) and large subunit (23S/28S, LSU) ribosomal RNA (rRNA) sequences for all three domains of life (Bacteria, Archaea and Eukaryota).</description><homepage>https://www.arb-silva.de</homepage><biotoolsID>silva</biotoolsID><biotoolsCURIE>biotools:silva</biotoolsCURIE><otherID><value>RRID:SCR_006423</value><type>rrid</type></otherID><toolType>Web application</toolType><toolType>Database portal</toolType><topic><uri>http://edamontology.org/topic_0659</uri><term>Functional, regulatory and non-coding RNA</term></topic><topic><uri>http://edamontology.org/topic_0080</uri><term>Sequence analysis</term></topic><topic><uri>http://edamontology.org/topic_3293</uri><term>Phylogenetics</term></topic><topic><uri>http://edamontology.org/topic_0637</uri><term>Taxonomy</term></topic><topic><uri>http://edamontology.org/topic_3050</uri><term>Biodiversity</term></topic><topic><uri>http://edamontology.org/topic_3301</uri><term>Microbiology</term></topic><topic><uri>http://edamontology.org/topic_0632</uri><term>Probes and primers</term></topic><license>CC-BY-4.0</license><collectionID>de.NBI</collectionID><collectionID>de.NBI-biodata</collectionID><collectionID>DSMZ Digital Diversity</collectionID><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><elixirPlatform>Data</elixirPlatform><elixirNode>Germany</elixirNode><function><operation><uri>http://edamontology.org/operation_0224</uri><term>Query and retrieval</term></operation><input><data><uri>http://edamontology.org/data_1097</uri><term>Sequence accession (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1964</uri><term>plain text format (unformatted)</term></format></input><input><data><uri>http://edamontology.org/data_1868</uri><term>Taxon</term></data><format><uri>http://edamontology.org/format_1964</uri><term>plain text format (unformatted)</term></format></input><input><data><uri>http://edamontology.org/data_1046</uri><term>Strain name</term></data><format><uri>http://edamontology.org/format_1964</uri><term>plain text format (unformatted)</term></format></input><input><data><uri>http://edamontology.org/data_1088</uri><term>Article ID</term></data><format><uri>http://edamontology.org/format_1964</uri><term>plain text format (unformatted)</term></format></input><input><data><uri>http://edamontology.org/data_2909</uri><term>Organism name</term></data><format><uri>http://edamontology.org/format_1964</uri><term>plain text format (unformatted)</term></format></input><output><data><uri>http://edamontology.org/data_2955</uri><term>Sequence report</term></data><format><uri>http://edamontology.org/format_2331</uri><term>HTML</term></format></output><output><data><uri>http://edamontology.org/data_1383</uri><term>Sequence alignment (nucleic acid)</term></data><format><uri>http://edamontology.org/format_3830</uri><term>ARB</term></format><format><uri>http://edamontology.org/format_1984</uri><term>FASTA-aln</term></format></output><output><data><uri>http://edamontology.org/data_2977</uri><term>Nucleic acid sequence</term></data><format><uri>http://edamontology.org/format_3830</uri><term>ARB</term></format><format><uri>http://edamontology.org/format_1984</uri><term>FASTA-aln</term></format></output><note>The search and retrieval functions of the SILVA website can be used to build custom subsets of sequences. In addition to simple searches e.g. for accession numbers, organism names, taxonomic entities, or publication DOI/PubMed IDs, complex queries over several database fields using constraints such as sequence length or quality values are possible. The results can be sorted according to accession numbers, organism names, sequence length, sequence and alignment quality and Pintail values. The search results show accession number, organism name, sequence length, sequence quality values, taxonomic classifications, and links to view the full sequence record on SILVA and on ENA. Sequences found via search and added to download cart can be downloaded as FASTA and ARB files.</note></function><function><operation><uri>http://edamontology.org/operation_2419</uri><term>Primer and probe design</term></operation><input><data><uri>http://edamontology.org/data_2977</uri><term>Nucleic acid sequence</term></data><format><uri>http://edamontology.org/format_1207</uri><term>nucleotide</term></format></input><input><data><uri>http://edamontology.org/data_1240</uri><term>PCR primers</term></data><format><uri>http://edamontology.org/format_1207</uri><term>nucleotide</term></format></input><output><data><uri>http://edamontology.org/data_2048</uri><term>Report</term></data><format><uri>http://edamontology.org/format_3752</uri><term>CSV</term></format><format><uri>http://edamontology.org/format_2331</uri><term>HTML</term></format></output><output><data><uri>http://edamontology.org/data_0850</uri><term>Sequence set</term></data><format><uri>http://edamontology.org/format_2333</uri><term>Binary format</term></format></output><output><data><uri>http://edamontology.org/data_1383</uri><term>Nucleic acid sequence alignment</term></data><format><uri>http://edamontology.org/format_3830</uri><term>ARB</term></format><format><uri>http://edamontology.org/format_1984</uri><term>FASTA-aln</term></format></output><output><data><uri>http://edamontology.org/data_2977</uri><term>Nucleic acid sequence</term></data><format><uri>http://edamontology.org/format_3830</uri><term>ARB</term></format><format><uri>http://edamontology.org/format_1984</uri><term>FASTA-aln</term></format></output><note>The SILVA Probe Match and Evaluation Tool detects and displays all occurrences of a given probe or primer sequence in the SILVA datasets. 

TestPrime allows you to evaluate the performance of primer pairs by running an in silico PCR on the SILVA databases. From the results of the PCR, TestPrime computes coverages for each taxonomic group in all of the taxonomies offered by SILVA.</note></function><function><operation><uri>http://edamontology.org/operation_0492</uri><term>Multiple sequence alignment</term></operation><operation><uri>http://edamontology.org/operation_0496</uri><term>Global alignment</term></operation><operation><uri>http://edamontology.org/operation_3460</uri><term>Taxonomic classification</term></operation><operation><uri>http://edamontology.org/operation_0547</uri><term>Phylogenetic inference (maximum likelihood and Bayesian methods)</term></operation><input><data><uri>http://edamontology.org/data_2977</uri><term>Nucleic acid sequence</term></data><format><uri>http://edamontology.org/format_1984</uri><term>FASTA-aln</term></format></input><output><data><uri>http://edamontology.org/data_1383</uri><term>Nucleic acid sequence alignment</term></data><format><uri>http://edamontology.org/format_3830</uri><term>ARB</term></format><format><uri>http://edamontology.org/format_1984</uri><term>FASTA-aln</term></format></output><output><data><uri>http://edamontology.org/data_0867</uri><term>Sequence alignment report</term></data><format><uri>http://edamontology.org/format_3752</uri><term>CSV</term></format><format><uri>http://edamontology.org/format_2330</uri><term>Textual format</term></format></output><output><data><uri>http://edamontology.org/data_0872</uri><term>Phylogenetic tree</term></data><format><uri>http://edamontology.org/format_1910</uri><term>newick</term></format></output><output><data><uri>http://edamontology.org/data_1872</uri><term>Taxonomic classification</term></data><format><uri>http://edamontology.org/format_3752</uri><term>CSV</term></format></output><note>The Alignment, Classification and Tree Service (ACT) of SILVA allows to align and optionally classify sequences using the SILVA Incremental Aligner (SINA). From aligned sequences, (optionally) phylogenetic trees can be inferred using FastTree or RAxML.</note></function><link><url>https://www.arb-silva.de/browser/</url><type>Service</type><note>SILVA Taxonomy Browser</note></link><link><url>https://www.arb-silva.de/search/</url><type>Service</type><note>SILVA metadata search</note></link><link><url>https://www.arb-silva.de/aligner/</url><type>Service</type><note>ACT: Alignment, Classification and Tree Service</note></link><link><url>https://www.arb-silva.de/search/testprobe/</url><type>Service</type><note>SILVA Probe Match and Evaluation Tool</note></link><link><url>https://www.arb-silva.de/search/testprime/</url><type>Service</type><note>SILVA Primer Evaluation Tool</note></link><link><url>https://treeviewer.arb-silva.de/</url><type>Service</type><note>Web-based viewer for the SILVA guide trees</note></link><download><url>https://www.arb-silva.de/download/archive/</url><type>Downloads page</type><note>SILVA dataset archive</note></download><documentation><url>https://www.arb-silva.de/silva-license-information/</url><type>Terms of use</type></documentation><documentation><url>https://www.arb-silva.de/contact/</url><type>Citation instructions</type></documentation><documentation><url>https://www.arb-silva.de/documentation/</url><type>General</type></documentation><documentation><url>https://www.arb-silva.de/documentation/faqs/</url><type>FAQ</type></documentation><relation><biotoolsID>silvangs</biotoolsID><type>usedBy</type></relation><relation><biotoolsID>d3hub</biotoolsID><type>includedIn</type></relation><publication><doi>10.1093/nar/gks1219</doi><pmid>23193283</pmid><pmcid>PMC3531112</pmcid><type>Primary</type></publication><publication><doi>10.1093/nar/gkt1209</doi><pmid>24293649</pmid><pmcid>PMC3965112</pmcid><type>Other</type></publication><publication><doi>10.1093/nar/gkm864</doi><pmid>17947321</pmid><pmcid>PMC2175337</pmcid><type>Other</type></publication><publication><doi>10.1093/bioinformatics/bts252</doi><pmid>22556368</pmid><pmcid>PMC3389763</pmcid><type>Other</type></publication><publication><doi>10.1016/j.jbiotec.2017.06.1198</doi><pmid>28648396</pmid><type>Review</type></publication><publication><doi>10.1186/s12859-017-1841-3</doi><type>Other</type></publication><credit><name>Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures</name><email>hub@dsmz.de</email><url>https://www.dsmz.de</url><gridid>grid.420081.f</gridid><rorid>02tyer376</rorid><typeEntity>Institute</typeEntity><typeRole>Provider</typeRole></credit><credit><name>SILVA Team</name><email>contact@arb-silva.de</email><url>https://www.arb-silva.de/contact/team/</url><gridid>grid.507782.f</gridid><rorid>027z9pz32</rorid><typeEntity>Division</typeEntity><typeRole>Primary contact</typeRole></credit></tool><tool><name>SILVAngs</name><description>SILVAngs is a data analysis service for ribosomal RNA gene (rDNA) amplicon reads from high-throughput sequencing (next-generation sequencing (NGS)) approaches based on an automatic software pipeline. It uses the SILVA rDNA databases, taxonomies, and alignments as a reference. It facilitates the classification of rDNA reads and provides a wealth of results (tables, graphs and sequence files) for download.</description><homepage>https://ngs.arb-silva.de</homepage><biotoolsID>silvangs</biotoolsID><biotoolsCURIE>biotools:silvangs</biotoolsCURIE><version>1.9.10</version><toolType>Web application</toolType><topic><uri>http://edamontology.org/topic_3174</uri><term>Metagenomics</term></topic><topic><uri>http://edamontology.org/topic_3050</uri><term>Biodiversity</term></topic><topic><uri>http://edamontology.org/topic_0637</uri><term>Taxonomy</term></topic><topic><uri>http://edamontology.org/topic_0659</uri><term>Functional, regulatory and non-coding RNA</term></topic><topic><uri>http://edamontology.org/topic_0080</uri><term>Sequence analysis</term></topic><topic><uri>http://edamontology.org/topic_3697</uri><term>Microbial ecology</term></topic><collectionID>de.NBI</collectionID><collectionID>de.NBI-biodata</collectionID><collectionID>DSMZ Digital Diversity</collectionID><maturity>Mature</maturity><cost>Free of charge (with restrictions)</cost><elixirPlatform>Data</elixirPlatform><elixirNode>Germany</elixirNode><function><operation><uri>http://edamontology.org/operation_0337</uri><term>Visualisation</term></operation><operation><uri>http://edamontology.org/operation_2238</uri><term>Statistical calculation</term></operation><operation><uri>http://edamontology.org/operation_2478</uri><term>Nucleic acid sequence analysis</term></operation><operation><uri>http://edamontology.org/operation_3460</uri><term>Taxonomic classification</term></operation><operation><uri>http://edamontology.org/operation_2428</uri><term>Validation</term></operation><operation><uri>http://edamontology.org/operation_0291</uri><term>Sequence clustering</term></operation><operation><uri>http://edamontology.org/operation_0292</uri><term>Sequence alignment</term></operation><input><data><uri>http://edamontology.org/data_2977</uri><term>Nucleic acid sequence</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_2082</uri><term>Matrix</term></data><format><uri>http://edamontology.org/format_3475</uri><term>TSV</term></format></output><output><data><uri>http://edamontology.org/data_2884</uri><term>Plot</term></data></output><output><data><uri>http://edamontology.org/data_2048</uri><term>Report</term></data><format><uri>http://edamontology.org/format_3508</uri><term>PDF</term></format></output><output><data><uri>http://edamontology.org/data_1246</uri><term>Sequence cluster (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1984</uri><term>FASTA-aln</term></format></output><output><data><uri>http://edamontology.org/data_2977</uri><term>Nucleic acid sequence</term></data><format><uri>http://edamontology.org/format_3830</uri><term>ARB</term></format><format><uri>http://edamontology.org/format_1984</uri><term>FASTA-aln</term></format></output><note>The pipeline accepts input data in Multi-Fasta format with each input file representing one sample. Samples that belong to one project (a transect, timeseries etc.) should be uploaded as a single SILVAngs project.</note></function><documentation><url>https://www.arb-silva.de/fileadmin/silva_databases/sngs/SILVAngs_User_Guide.pdf</url><type>User manual</type></documentation><documentation><url>https://www.arb-silva.de/documentation/silvangs/userfaq/</url><type>FAQ</type></documentation><documentation><url>https://www.arb-silva.de/footer/sngs-termsofuse</url><type>Terms of use</type></documentation><relation><biotoolsID>silva</biotoolsID><type>uses</type></relation><publication><doi>10.1093/nar/gks1219</doi><pmid>23193283</pmid><pmcid>PMC3531112</pmcid><type>Primary</type></publication><publication><doi>10.1093/nar/gkt1209</doi><pmid>24293649</pmid><pmcid>PMC3965112</pmcid><type>Primary</type></publication><publication><doi>10.1093/bioinformatics/bts252</doi><pmid>22556368</pmid><pmcid>PMC3389763</pmcid><type>Primary</type></publication><credit><name>SILVA Team</name><email>ngs-contact@arb-silva.de</email><url>https://www.arb-silva.de/contact/team/</url><gridid>grid.507782.f</gridid><rorid>027z9pz32</rorid><typeEntity>Division</typeEntity><typeRole>Primary contact</typeRole></credit><credit><name>Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures</name><email>hub@dsmz.de</email><url>https://www.dsmz.de</url><gridid>grid.420081.f</gridid><rorid>02tyer376</rorid><typeEntity>Institute</typeEntity><typeRole>Provider</typeRole></credit></tool><tool><name>DeepSig</name><description>Prediction of secretory signal peptides in protein sequences</description><homepage>https://busca.biocomp.unibo.it/deepsig/</homepage><biotoolsID>deepsig</biotoolsID><biotoolsCURIE>biotools:deepsig</biotoolsCURIE><version>1.2.5</version><toolType>Web application</toolType><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_3307</uri><term>Computational biology</term></topic><topic><uri>http://edamontology.org/topic_3510</uri><term>Protein sites, features and motifs</term></topic><topic><uri>http://edamontology.org/topic_0123</uri><term>Protein properties</term></topic><operatingSystem>Linux</operatingSystem><operatingSystem>Windows</operatingSystem><operatingSystem>Mac</operatingSystem><language>Python</language><language>C++</language><license>GPL-3.0</license><collectionID>Bologna Biocomputing Group</collectionID><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><elixirNode>Italy</elixirNode><function><operation><uri>http://edamontology.org/operation_0418</uri><term>Protein signal peptide detection</term></operation><input><data><uri>http://edamontology.org/data_2974</uri><term>Protein sequence (raw)</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><input><data><uri>http://edamontology.org/data_3028</uri><term>Taxonomy</term></data><format><uri>http://edamontology.org/format_2330</uri><term>Textual format</term></format></input><output><data><uri>http://edamontology.org/data_0896</uri><term>Protein report</term></data><format><uri>http://edamontology.org/format_2331</uri><term>HTML</term></format></output></function><download><url>https://github.com/BolognaBiocomp/deepsig</url><type>Source code</type><version>1.2.5</version></download><download><url>https://hub.docker.com/r/bolognabiocomp/deepsig</url><type>Container file</type><version>1.2.5</version></download><documentation><url>https://github.com/BolognaBiocomp/deepsig</url><type>Command-line options</type></documentation><publication><doi>10.1093/bioinformatics/btx818</doi><pmid>29280997</pmid><pmcid>PMC5946842</pmcid><type>Primary</type><version>1.0</version></publication><credit><name>ELIXIR-ITA-BOLOGNA</name><url>http://biocomp.unibo.it</url><typeEntity>Institute</typeEntity><typeRole>Provider</typeRole></credit><credit><name>Castrense Savojardo</name><email>castrense.savojardo2@unibo.it</email><orcidid>https://orcid.org/0000-0002-7359-0633</orcidid><typeEntity>Person</typeEntity><typeRole>Developer</typeRole><typeRole>Primary contact</typeRole></credit><credit><name>Pier Luigi Martelli</name><email>pierluigi.martelli@unibo.it</email><url>http://biocomp.unibo.it</url><orcidid>https://orcid.org/0000-0002-0274-5669</orcidid><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole></credit></tool><tool><name>NanoPlot</name><description>NanoPlot is a tool with various visualizations of sequencing data in bam, cram, fastq, fasta or platform-specific TSV summaries, mainly intended for long-read sequencing from Oxford Nanopore Technologies and Pacific Biosciences</description><homepage>https://github.com/wdecoster/NanoPlot</homepage><biotoolsID>nanoplot</biotoolsID><biotoolsCURIE>biotools:nanoplot</biotoolsCURIE><version>v.1.42.0</version><toolType>Command-line tool</toolType><toolType>Web application</toolType><topic><uri>http://edamontology.org/topic_0622</uri><term>Genomics</term></topic><operatingSystem>Mac</operatingSystem><operatingSystem>Linux</operatingSystem><operatingSystem>Windows</operatingSystem><language>Python</language><license>GPL-3.0</license><collectionID>ONTeater</collectionID><maturity>Mature</maturity><cost>Free of charge (with restrictions)</cost><accessibility>Open access (with restrictions)</accessibility><function><operation><uri>http://edamontology.org/operation_2940</uri><term>Scatter plot plotting</term></operation><operation><uri>http://edamontology.org/operation_2943</uri><term>Box-Whisker plot plotting</term></operation><input><data><uri>http://edamontology.org/data_3494</uri><term>DNA sequence</term></data><format><uri>http://edamontology.org/format_2546</uri><term>FASTA-like</term></format><format><uri>http://edamontology.org/format_1207</uri><term>nucleotide</term></format></input></function><link><url>https://github.com/wdecoster/NanoPlot</url><type>Repository</type><note>Issue tracker and most up to date software version</note></link><link><url>http://nanoplot.bioinf.be/</url><type>Service</type><note>Web service with more limited options compared to the command line tool</note></link><download><url>https://anaconda.org/bioconda/nanoplot</url><type>Command-line specification</type></download><download><url>https://pypi.org/project/NanoPlot/</url><type>Command-line specification</type></download><documentation><url>https://github.com/wdecoster/NanoPlot</url><type>Command-line options</type></documentation><relation><biotoolsID>nanopack</biotoolsID><type>includedIn</type></relation><publication><doi>10.1093/bioinformatics/bty149</doi><pmid>29547981</pmid><pmcid>PMC6061794</pmcid><type>Method</type></publication><credit><name>Wouter De Coster</name><url>https://gigabaseorgigabyte.wordpress.com/</url><orcidid>https://orcid.org/0000-0002-5248-8197</orcidid><typeEntity>Person</typeEntity><typeRole>Developer</typeRole></credit></tool><tool><name>NextDenovo</name><description>NextDenovo is a string graph-based de novo assembler for long reads (CLR, HiFi and ONT). It uses a "correct-then-assemble" strategy similar to canu (no correction step for PacBio Hifi reads), but requires significantly less computing resources and storages.</description><homepage>https://github.com/Nextomics/NextDenovo</homepage><biotoolsID>nextdenovo</biotoolsID><biotoolsCURIE>biotools:nextdenovo</biotoolsCURIE><version>v.2.5.2</version><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_3168</uri><term>Sequencing</term></topic><topic><uri>http://edamontology.org/topic_0196</uri><term>Sequence assembly</term></topic><language>Python</language><language>C</language><license>GPL-3.0</license><collectionID>ONTeater</collectionID><maturity>Mature</maturity><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_0524</uri><term>De-novo assembly</term></operation><operation><uri>http://edamontology.org/operation_0525</uri><term>Genome assembly</term></operation><input><data><uri>http://edamontology.org/data_0924</uri><term>Sequence trace</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format></input><output><data><uri>http://edamontology.org/data_0925</uri><term>Sequence assembly</term></data><format><uri>http://edamontology.org/format_2561</uri><term>Sequence assembly format (text)</term></format><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></output></function><link><url>https://github.com/Nextomics/NextDenovo/issues</url><type>Issue tracker</type></link><download><url>https://github.com/Nextomics/NextDenovo/releases/tag/2.5.2</url><type>Source code</type></download><documentation><url>https://nextdenovo.readthedocs.io/en/latest/</url><type>User manual</type></documentation><publication><doi>10.1101/2023.03.09.531669.</doi></publication><credit><name>Nextomics</name><email>support@nextomics.org</email><typeEntity>Institute</typeEntity><typeRole>Primary contact</typeRole></credit></tool><tool><name>quickmerge</name><description>Quickmerge is a program that uses complementary information from genomes assembled with long reads in order to improve contiguity, and works with assemblies derived from both Pacific Biosciences or Oxford Nanopore. Quickmerge will even work with hybrid assemblies made by combining long reads and Illumina short reads.</description><homepage>https://github.com/mahulchak/quickmerge</homepage><biotoolsID>quickmerge</biotoolsID><biotoolsCURIE>biotools:quickmerge</biotoolsCURIE><version>v.0.3</version><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_3175</uri><term>Structural variation</term></topic><topic><uri>http://edamontology.org/topic_0196</uri><term>Sequence assembly</term></topic><topic><uri>http://edamontology.org/topic_2885</uri><term>DNA polymorphism</term></topic><topic><uri>http://edamontology.org/topic_3673</uri><term>Whole genome sequencing</term></topic><topic><uri>http://edamontology.org/topic_0625</uri><term>Genotype and phenotype</term></topic><operatingSystem>Mac</operatingSystem><operatingSystem>Linux</operatingSystem><language>C++</language><language>C</language><license>GPL-3.0</license><collectionID>ONTeater</collectionID><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_0525</uri><term>Genome assembly</term></operation><operation><uri>http://edamontology.org/operation_3216</uri><term>Scaffolding</term></operation><operation><uri>http://edamontology.org/operation_0524</uri><term>De-novo assembly</term></operation><operation><uri>http://edamontology.org/operation_3196</uri><term>Genotyping</term></operation><input><data><uri>http://edamontology.org/data_1234</uri><term>Sequence set (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_1234</uri><term>Sequence set (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></output><note>Runs whole merge process on an input assembly.
Assembly 2 will be used to fill gaps in assembly 1.</note><cmd>merge_wrapper.py -pre output_prefix assembly_1.fa assembly_2.fa</cmd></function><link><url>https://github.com/mahulchak/quickmerge/issues</url><type>Issue tracker</type></link><link><url>https://github.com/mahulchak/quickmerge</url><type>Repository</type></link><publication><doi>10.1534/g3.118.200162</doi><pmid>30018084</pmid><pmcid>PMC6169397</pmcid><type>Primary</type></publication><credit><name>Mahul Chakraborty</name><url>https://mahulchakraborty.wordpress.com/</url><orcidid>https://orcid.org/0000-0003-2414-9187</orcidid><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole></credit></tool><tool><name>Racon</name><description>Consensus module for raw de novo DNA assembly of long uncorrected reads

Racon is intended as a standalone consensus module to correct raw contigs generated by rapid assembly methods which do not include a consensus step. The goal of Racon is to generate genomic consensus which is of similar or better quality compared to the output generated by assembly methods which employ both error correction and consensus steps, while providing a speedup of several times compared to those methods. It supports data produced by both Pacific Biosciences and Oxford Nanopore Technologies.</description><homepage>https://github.com/isovic/racon</homepage><biotoolsID>Racon</biotoolsID><biotoolsCURIE>biotools:Racon</biotoolsCURIE><topic><uri>http://edamontology.org/topic_3673</uri><term>Whole genome sequencing</term></topic><topic><uri>http://edamontology.org/topic_0196</uri><term>Sequence assembly</term></topic><operatingSystem>Mac</operatingSystem><operatingSystem>Linux</operatingSystem><language>C++</language><language>Python</language><license>MIT</license><collectionID>ONTeater</collectionID><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_0525</uri><term>Genome assembly</term></operation><operation><uri>http://edamontology.org/operation_0523</uri><term>Mapping assembly</term></operation><input><data><uri>http://edamontology.org/data_1383</uri><term>Nucleic acid sequence alignment</term></data><format><uri>http://edamontology.org/format_2572</uri><term>BAM</term></format><format><uri>http://edamontology.org/format_2573</uri><term>SAM</term></format></input><input><data><uri>http://edamontology.org/data_1234</uri><term>Sequence set (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format></input><input><data><uri>http://edamontology.org/data_1234</uri><term>Sequence set (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_1234</uri><term>Sequence set (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></output><note># The mapping file can be generated with any mapping program - eg, bwa-mem or minimap2.
# The following is an example using minimap2 with ONT data
minimap2 assembly.fa-ax map-ont reads.fa &gt; mapped_reads.sam</note><cmd>racon -u reads.fa mapped_reads.sam assembly.fa &gt; assembly_racon.fa</cmd></function><link><url>https://github.com/isovic/racon</url><type>Repository</type></link><link><url>https://github.com/isovic/racon/issues</url><type>Issue tracker</type></link><publication><doi>10.3390/plants8080270</doi><pmid>31390788</pmid><pmcid>PMC6724115</pmcid></publication><credit><name>Chon-Sik Kang</name><email>cskang@korea.kr</email><typeEntity>Person</typeEntity></credit><credit><name>Changsoo Kim</name><email>changsookim@cnu.ac.kr</email><orcidid>https://orcid.org/0000-0002-3596-2934</orcidid><typeEntity>Person</typeEntity></credit></tool><tool><name>Flye</name><description>Flye is a de novo assembler for single molecule sequencing reads, such as those produced by PacBio and Oxford Nanopore Technologies. It is designed for a wide range of datasets, from small bacterial projects to large mammalian-scale assemblies. The package represents a complete pipeline: it takes raw PB / ONT reads as input and outputs polished contigs.</description><homepage>https://github.com/fenderglass/Flye</homepage><biotoolsID>Flye</biotoolsID><biotoolsCURIE>biotools:Flye</biotoolsCURIE><version>2.9.6</version><toolType>Command-line tool</toolType><toolType>Workflow</toolType><topic><uri>http://edamontology.org/topic_0196</uri><term>Sequence assembly</term></topic><topic><uri>http://edamontology.org/topic_3174</uri><term>Metagenomics</term></topic><topic><uri>http://edamontology.org/topic_3673</uri><term>Whole genome sequencing</term></topic><topic><uri>http://edamontology.org/topic_0622</uri><term>Genomics</term></topic><operatingSystem>Mac</operatingSystem><operatingSystem>Linux</operatingSystem><language>C++</language><language>Python</language><language>C</language><license>BSD-3-Clause</license><collectionID>ONTeater</collectionID><cost>Free of charge</cost><accessibility>Open access</accessibility><function><operation><uri>http://edamontology.org/operation_0525</uri><term>Genome assembly</term></operation><operation><uri>http://edamontology.org/operation_0524</uri><term>De-novo assembly</term></operation><operation><uri>http://edamontology.org/operation_0523</uri><term>Mapping assembly</term></operation><operation><uri>http://edamontology.org/operation_3730</uri><term>Cross-assembly</term></operation><input><data><uri>http://edamontology.org/data_1234</uri><term>Sequence set (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1930</uri><term>FASTQ</term></format></input><output><data><uri>http://edamontology.org/data_1234</uri><term>Sequence set (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></output></function><link><url>https://github.com/fenderglass/Flye/issues</url><type>Issue tracker</type></link><link><url>https://github.com/mikolmogorov/Flye</url><type>Repository</type></link><publication><doi>10.1099/mgen.0.000294</doi><pmid>31483244</pmid><pmcid>PMC6807382</pmcid><type>Usage</type></publication><publication><doi>10.1038/s41587-019-0072-8</doi><pmid>30936562</pmid><type>Primary</type></publication><publication><doi>10.1038/s41592-020-00971-x</doi><pmid>33020656</pmid><pmcid>PMC10699202</pmcid><type>Primary</type></publication><credit><name>Mikhail Kolmogorov</name><email>fenderglass@gmail.com</email><typeEntity>Person</typeEntity><typeRole>Developer</typeRole><typeRole>Support</typeRole></credit><credit><name>Yu Lin</name><typeEntity>Person</typeEntity><typeRole>Developer</typeRole></credit><credit><name>Jeffrey Yuan</name><typeEntity>Person</typeEntity><typeRole>Developer</typeRole></credit></tool><tool><name>BUSCO</name><description>Provides measures for quantitative assessment of genome assembly, gene set, and transcriptome completeness based on evolutionarily informed expectations of gene content from near-universal single-copy orthologs.</description><homepage>https://busco.ezlab.org/</homepage><biotoolsID>busco</biotoolsID><biotoolsCURIE>biotools:busco</biotoolsCURIE><version>1</version><toolType>Command-line tool</toolType><topic><uri>http://edamontology.org/topic_0196</uri><term>Sequence assembly</term></topic><topic><uri>http://edamontology.org/topic_0622</uri><term>Genomics</term></topic><topic><uri>http://edamontology.org/topic_3308</uri><term>Transcriptomics</term></topic><topic><uri>http://edamontology.org/topic_0080</uri><term>Sequence analysis</term></topic><operatingSystem>Linux</operatingSystem><language>Python</language><elixirPlatform>Tools</elixirPlatform><elixirNode>Switzerland</elixirNode><function><operation><uri>http://edamontology.org/operation_3180</uri><term>Sequence assembly validation</term></operation><input><data><uri>http://edamontology.org/data_1234</uri><term>Sequence set (nucleic acid)</term></data><format><uri>http://edamontology.org/format_1929</uri><term>FASTA</term></format></input><output><data><uri>http://edamontology.org/data_2955</uri><term>Sequence report</term></data></output></function><documentation><url>https://busco.ezlab.org/busco_userguide.html</url><type>User manual</type></documentation><publication><doi>10.1093/bioinformatics/btv351</doi><pmid>26059717</pmid></publication><publication><doi>10.1002/cpz1.323</doi><pmid>34936221</pmid></publication><publication><doi>10.1093/molbev/msx319</doi><pmid>29220515</pmid><pmcid>PMC5850278</pmcid></publication><publication><doi>10.1093/molbev/msab199</doi><pmid>34320186</pmid><pmcid>PMC8476166</pmcid></publication><credit><name>SIB Swiss Institute of Bioinformatics</name><typeEntity>Institute</typeEntity><typeRole>Provider</typeRole></credit><credit><email>evgeny.zdobnov@unige.ch</email><typeEntity>Person</typeEntity><typeRole>Primary contact</typeRole></credit></tool></tools>