机器学习工具推荐
2015-10-23 15:34
746 查看
原网址:http://www.demnag.com/b/java-machine-learning-tools-libraries-cm570/
This is a list of 25 Java Machine learning tools & libraries.
Weka has a collection of machine learning
algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization.
Massive Online Analysis (MOA) is a popular open
source framework for data stream mining, with a very active growing community. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for
evaluation. Related to the WEKA project, MOA is also written in Java, while scaling to more demanding problems.
The MEKA project provides
an open source implementation of methods for multi-label learning and evaluation. In multi-label classification, we want to predict multiple output variables for each input instance. This different from the 'standard' case which involves only a single target
variable. MEKA is based on the WEKA Machine Learning Toolkit.
The Advanced Data mining And Machine learning
System (ADAMS) is a novel, flexible workflow engine aimed at quickly building and maintaining real-world, complex knowledge workflows, released under GPLv3.
Environment for Developing KDD-Applications Supported by Index-Structure (ELKI)
is an open source (AGPLv3) data mining software written in Java. The focus of ELKI is research in algorithms, with an emphasis on unsupervised methods in cluster analysis and outlier detection.
Mallet is a java machine learning toolkit for textual
document. Mallet supports classification algorithms like maximum entropy, naive bayes and decision tree for classification.
Encog is an advanced machine learning framework
which supports Support Vector Machines,Artificial Neural Networks, Genetic Programming, Bayesian Networks, Hidden Markov Models, Genetic Programming and Genetic Algorithms are supported.
The Datumbox Machine Learning
Framework is an open-source framework written in Java which allows the rapid development Machine Learning and Statistical applications. The main focus of the framework is to include a large number of machine learning algorithms & statistical tests and being
able to handle medium-large sized datasets.
Deeplearning4j is the first commercial-grade, open-source,
distributed deep-learning library written for Java and Scala. It is designed to be used in business environments, rather than as a research tool.
Mahout is a machine learning framework with built in
algorithms. Mahout-Samsara helps people create their own math while providing some off-the-shelf algorithm implementations.
Rapid Miner was developed at Technical University of
Dortmund, Germany. It provides a GUI and a Java API for developing your own applications. It provides data handling, visualization and modeling with machine learning algorithms.
Apache SAMOA is a machine learning (ML) framework
that contains a programing abstraction for distributed streaming ML algorithms and enables development of new ML algorithms without directly dealing with the complexity of underlying distributed stream processing engines (DSPEe, such as Apache Storm, Apache
S4, and Apache Samza). Its users can develop distributed streaming ML algorithms once and execute them on multiple DSPEs.
Neuroph simplifies the development of neural
networks by providing Java neural network library and GUI tool that supports creating, training and saving neural networks.
Oryx 2 is a realization of the lambda architecture built on Apache
Spark and Apache Kafka, but with specialization for real-time large scale machine learning. It is a framework for building applications, but also includes packaged, end-to-end applications for collaborative filtering, classification, regression and clustering.
Stanford Classifier is a machine
learning tool that will take data items and place them into one of k classes. A probabilistic classifier, like this one, can also give a probability distribution over the class assignment for a data item. This software is a Java implementation of a maximum
entropy classifier.
Cortical.io is a Retina API fast, precise and brain like
algorithm that enables NLP.
JSAT is a library for quickly
getting started with Machine Learning problems. It is developed in my free time, and made available for use under the GPL 3. Part of the library is for self education, as such - all code is self contained. JSAT has no external dependencies, and is pure Java.
N-Dimensional Arrays for Java (ND4J) is a scientific computing
libraries for the JVM. They are meant to be used in production environments, which means routines are designed to run fast with minimum RAM requirements.
The Java Machine Learning Library is
a set of reference implementations of machine learning algorithms. These algorithms are well documented, both in the source code as on the documentation site.It is mostly written in Java.
Java-ML is a Java API with a collection of machine
learning algorithms implemented in Java. It only provides a standard interface for algorithms.
MLlib (Spark) is Apache Spark's scalable machine
learning library. Although Java, the library and the platform support Java, Scala and Python bindings. The library is new and the list of algorithms is long.
H2O is a machine learning API for smarter applications.
It scales statistics, machine learning, and math over big data. H2O is extensible and individual can build blocks using simple math legos in the core.
WalnutiQ is a object oriented model of partial
human brain with 1 theorized common learning algorithm (work in progress towards a simplistic model of a strong emotional A.I.)
RankLib is a library of learning
to rank algorithms. Currently eight popular algorithms have been implemented.
htm.java (Hierarchical Temporal Memory implementation
in Java) is a Java port of the Numenta Platform for Intelligent Computing.
This is a list of 25 Java Machine learning tools & libraries.
Weka has a collection of machine learning
algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization.
Massive Online Analysis (MOA) is a popular open
source framework for data stream mining, with a very active growing community. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for
evaluation. Related to the WEKA project, MOA is also written in Java, while scaling to more demanding problems.
The MEKA project provides
an open source implementation of methods for multi-label learning and evaluation. In multi-label classification, we want to predict multiple output variables for each input instance. This different from the 'standard' case which involves only a single target
variable. MEKA is based on the WEKA Machine Learning Toolkit.
The Advanced Data mining And Machine learning
System (ADAMS) is a novel, flexible workflow engine aimed at quickly building and maintaining real-world, complex knowledge workflows, released under GPLv3.
Environment for Developing KDD-Applications Supported by Index-Structure (ELKI)
is an open source (AGPLv3) data mining software written in Java. The focus of ELKI is research in algorithms, with an emphasis on unsupervised methods in cluster analysis and outlier detection.
Mallet is a java machine learning toolkit for textual
document. Mallet supports classification algorithms like maximum entropy, naive bayes and decision tree for classification.
Encog is an advanced machine learning framework
which supports Support Vector Machines,Artificial Neural Networks, Genetic Programming, Bayesian Networks, Hidden Markov Models, Genetic Programming and Genetic Algorithms are supported.
The Datumbox Machine Learning
Framework is an open-source framework written in Java which allows the rapid development Machine Learning and Statistical applications. The main focus of the framework is to include a large number of machine learning algorithms & statistical tests and being
able to handle medium-large sized datasets.
Deeplearning4j is the first commercial-grade, open-source,
distributed deep-learning library written for Java and Scala. It is designed to be used in business environments, rather than as a research tool.
Mahout is a machine learning framework with built in
algorithms. Mahout-Samsara helps people create their own math while providing some off-the-shelf algorithm implementations.
Rapid Miner was developed at Technical University of
Dortmund, Germany. It provides a GUI and a Java API for developing your own applications. It provides data handling, visualization and modeling with machine learning algorithms.
Apache SAMOA is a machine learning (ML) framework
that contains a programing abstraction for distributed streaming ML algorithms and enables development of new ML algorithms without directly dealing with the complexity of underlying distributed stream processing engines (DSPEe, such as Apache Storm, Apache
S4, and Apache Samza). Its users can develop distributed streaming ML algorithms once and execute them on multiple DSPEs.
Neuroph simplifies the development of neural
networks by providing Java neural network library and GUI tool that supports creating, training and saving neural networks.
Oryx 2 is a realization of the lambda architecture built on Apache
Spark and Apache Kafka, but with specialization for real-time large scale machine learning. It is a framework for building applications, but also includes packaged, end-to-end applications for collaborative filtering, classification, regression and clustering.
Stanford Classifier is a machine
learning tool that will take data items and place them into one of k classes. A probabilistic classifier, like this one, can also give a probability distribution over the class assignment for a data item. This software is a Java implementation of a maximum
entropy classifier.
Cortical.io is a Retina API fast, precise and brain like
algorithm that enables NLP.
JSAT is a library for quickly
getting started with Machine Learning problems. It is developed in my free time, and made available for use under the GPL 3. Part of the library is for self education, as such - all code is self contained. JSAT has no external dependencies, and is pure Java.
N-Dimensional Arrays for Java (ND4J) is a scientific computing
libraries for the JVM. They are meant to be used in production environments, which means routines are designed to run fast with minimum RAM requirements.
The Java Machine Learning Library is
a set of reference implementations of machine learning algorithms. These algorithms are well documented, both in the source code as on the documentation site.It is mostly written in Java.
Java-ML is a Java API with a collection of machine
learning algorithms implemented in Java. It only provides a standard interface for algorithms.
MLlib (Spark) is Apache Spark's scalable machine
learning library. Although Java, the library and the platform support Java, Scala and Python bindings. The library is new and the list of algorithms is long.
H2O is a machine learning API for smarter applications.
It scales statistics, machine learning, and math over big data. H2O is extensible and individual can build blocks using simple math legos in the core.
WalnutiQ is a object oriented model of partial
human brain with 1 theorized common learning algorithm (work in progress towards a simplistic model of a strong emotional A.I.)
RankLib is a library of learning
to rank algorithms. Currently eight popular algorithms have been implemented.
htm.java (Hierarchical Temporal Memory implementation
in Java) is a Java port of the Numenta Platform for Intelligent Computing.
相关文章推荐
- Struts2 action配置
- JavaWeb + mySQL + Android 实现Android简单的注册登录
- 百度云管家上传大文件网络异常时自动重启
- 查找页面<img/>
- JavaScript之 ------ 基本运用语法(变量、运算符、语句)
- spring
- Java IO流分析整理
- HDU1142 (Dijkstra+记忆化搜索)
- Java语言基础(移位运算符)
- 原生的强大DOM选择器querySelector
- 用sql查询当天,一周,一个月的数据
- Android 通过网页打开自己的APP(scheme)
- 如何实现MindMapper中的幻灯片功能的合理运用
- SearchView 学习_0
- 利用NSOperation进行异步图片下载——设置UITabView数据,图片下载,占位图。解决异步下载和Cell重用造成的图片设置混乱。和多次重复下载的问题
- java中的代理模式 用接口
- 动态的计算带有行间距的字段的长高
- SVN服务器搭建和使用(二)
- VS2010 生成Xml格式的注释文档
- StringBuilder清空方式