This is an old revision of the document!


Software


  • ELaSTIC is a software suite for a rapid identification and clustering of similar sequences from large-scale biological sequence collections. At its core is an efficient MinHash-based strategy to detect similar sequence pairs without aligning all sequences against each other. It is designed to work with data sets consisting of millions of DNA/RNA or amino acid strings, using various alignment criteria.