Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
software [2019/01/11 09:43]
Jaroslaw Zola
software [2019/06/24 15:53]
Jaroslaw Zola
Line 5: Line 5:
   * [[https://gitlab.com/SCoRe-Group/SABNAtk|SABNAtk]] Toolkit for fast counting over categorical data. This toolkit can be used to power any application that involves counting of data configurations, e.g., to estimate likelihoods, evaluate model scoring functions, etc. Currently, SABNAtk is the workhorse supporting SABNA.   * [[https://gitlab.com/SCoRe-Group/SABNAtk|SABNAtk]] Toolkit for fast counting over categorical data. This toolkit can be used to power any application that involves counting of data configurations, e.g., to estimate likelihoods, evaluate model scoring functions, etc. Currently, SABNAtk is the workhorse supporting SABNA.
   * [[https://gitlab.com/SCoRe-Group/SABNA-Release|SABNA]] Scalable Accelerated Bayesian Network Analytics. This very actively developed software toolkit provides a set of sequential and parallel tools for structure learning of Bayesian networks. It is designed to provide exact solutions on large-scale data in acceptable time limits.   * [[https://gitlab.com/SCoRe-Group/SABNA-Release|SABNA]] Scalable Accelerated Bayesian Network Analytics. This very actively developed software toolkit provides a set of sequential and parallel tools for structure learning of Bayesian networks. It is designed to provide exact solutions on large-scale data in acceptable time limits.
 +  * [[https://gitlab.com/SCoRe-Group/APSPark|APSPark]] Efficient and scalable All-Pairs Shortest-Path Solver for Apache Spark. On a modest Spark cluster (e.g., 1024 Intel Xeon cores), the solver can handle arbitrary undirected graphs with over 200,000 vertices.
   * [[http://www.jzola.org/elastic|ELaSTIC]] is a software suite for a rapid identification and clustering of similar sequences from large-scale biological sequence collections. At its core is an efficient MinHash-based strategy to detect similar sequence pairs without aligning all sequences against each other. It is designed to work with data sets consisting of millions of DNA/RNA or amino acid strings, using various alignment criteria.    * [[http://www.jzola.org/elastic|ELaSTIC]] is a software suite for a rapid identification and clustering of similar sequences from large-scale biological sequence collections. At its core is an efficient MinHash-based strategy to detect similar sequence pairs without aligning all sequences against each other. It is designed to work with data sets consisting of millions of DNA/RNA or amino acid strings, using various alignment criteria. 
  • Show page