[轉(zhuǎn)]11個(gè)著名的開源機(jī)器學(xué)習(xí)工具

Open source machine learning software makes it easier to implement machine learning solutions on single computers and at scale, and the diversity of packages provide more options for implementers.

Accord Framework/AForge.net

Accord, a machine learning and signal processing framework for .Net, is an extension of a previous project in the same vein,AForge.net. A set of algorithms for vision processing are included; it operates on image streams (such as video) and can be used to implement such functions as the tracking of moving objects. Accord also includes libraries that provide a more conventional gamut of machine learning functions, from neural networks to decision-tree systems.

Github:github.com/accord-net/framework/

Cloudera Oryx

Yet another machine learning project designed for Hadoop, Oryx comes courtesy of the creators of the Cloudera Hadoop distribution. The name on the label isn’t the only detail that sets Oryx apart: Per Cloudera’s emphasis on analyzing live streaming data by way of the Spark project, Oryx is designed to allow machine learning models to be deployed on real-time streamed data, enabling projects like real-time spam filters or recommendation engines.

Github:github.com/cloudera/oryx

ConvNetJS

As the name implies, ConvNetJS provides neural network machine learning libraries for use in JavaScript, facilitating use of the browser as a data workbench. An NPM version is also available for those using Node.js.

Github:github.com/karpathy/convnetjs

CUDA-Convnet

By now most everyone knows how GPUs can crunch certain problems faster than CPUs. But applications don’t automatically take advantage of GPU acceleration; they have to be specifically written to do so. CUDA-Convnet is a machine learning library for neural-network applications, written in C++ to exploit the Nvidia’s CUDA GPU processing technology (CUDA boards of at least the Fermi generation are required).

GoLearn

Google’s Go language has been in the wild for only five years, but has started to enjoy wider use, due to a growing collection of libraries. GoLearn was created to address the lack of an all-in-one machine learning library for Go; the goal is “simplicity paired with customizability,” according to developer Stephen Witworth.

Github:github.com/sjwhitworth/golearn

H2O

0xdata’s H2O's algorithms are geared for business processes -- fraud or trend predictions, for instance -- rather than, say, image analysis. H2O can interact in a stand-alone fashion with HDFS stores, on top of YARN, in MapReduce, or directly in an Amazon EC2 instance.

Github:github.com/h2oai/h2o

Mahout

The Mahout framework has long been tied to Hadoop, but many of the algorithms under its umbrella can also run as-is outside Hadoop. They're useful for stand-alone applications that might eventually be migrated into Hadoop or for Hadoop projects that could be spun off into their own stand-alone applications.

MLlib

Apache’s own machine learning library for Spark and Hadoop, MLlib boasts a gamut of common algorithms and useful data types, designed to run at speed and scale. As you’d expect with any Hadoop project, Java is the primary language for working in MLlib, but Python users can connect MLlib with the NumPy library (also used in scikit-learn), and Scala users can write code against MLlib.

Scikit-learn

Python has become a go-to programming language for math, science, and statistics due to its ease of adoption and the breadth of libraries available for nearly any application. Scikit-learn leverages this breadth by building on top of several existing Python packages -- NumPy, SciPy, and matplotlib -- for math and science work. The resulting libraries can be used either for interactive “workbench” applications or be embedded into other software and reused.

GitHub:github.com/scikit-learn/scikit-learn

Shogun

Among the oldest, most venerable of machine learning libraries, Shogun was created in 1999 and written in C++, but isn’t limited to working in C++. Thanks to the SWIG library, Shogun can be used transparently in such languages and environments: as Java, Python, C#, Ruby, R, Lua, Octave, and Matlab.

Github:github.com/shogun-toolbox/shogun

Weka

Weka, a product of the University of Waikato, New Zealand, collects a set of Java machine learning algorithms engineered specifically for data mining. This GNU GPLv3-licensed collection has a package system to extend its functionality, with both official and unofficial packages available.

Original:http://www.networkworld.com/article/2855100/opensource-subnet/11-open-source-tools-machine-learning.html

最后編輯于
?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請(qǐng)聯(lián)系作者
【社區(qū)內(nèi)容提示】社區(qū)部分內(nèi)容疑似由AI輔助生成,瀏覽時(shí)請(qǐng)結(jié)合常識(shí)與多方信息審慎甄別。
平臺(tái)聲明:文章內(nèi)容(如有圖片或視頻亦包括在內(nèi))由作者上傳并發(fā)布,文章內(nèi)容僅代表作者本人觀點(diǎn),簡(jiǎn)書系信息發(fā)布平臺(tái),僅提供信息存儲(chǔ)服務(wù)。

相關(guān)閱讀更多精彩內(nèi)容

  • 一,概念 1,數(shù)字證書的概念 數(shù)字證書是由權(quán)威公正的第三方機(jī)構(gòu)即CA中心簽發(fā)的,以數(shù)字證書為核心的加密技術(shù)可以對(duì)網(wǎng)...
    黃曉星閱讀 4,798評(píng)論 0 0
  • 這是我欲封天中的一個(gè)橋段,令我印象很深,摘錄出來,以示敬仰。 時(shí)光流逝,出生時(shí)祥瑞齊開的陳雷早已成長(zhǎng)到了少年,雖然...
    三頁薄紙閱讀 456評(píng)論 1 0
  • 今日運(yùn)動(dòng): 2992步,30個(gè)仰臥起坐。未完成目標(biāo)。明天4000步補(bǔ)上。 今日嘉許 一家去送小倍,很享受彼此在一起...
    阿點(diǎn)的親子芳療會(huì)客廳閱讀 177評(píng)論 0 0
  • 人生在世,總會(huì)遇到困難,這是每個(gè)人生命中都會(huì)遇到的。每個(gè)人對(duì)于困難的態(tài)度不同,解決困難的方法也不同。有的人很彷徨,...
    安瑗Annie閱讀 523評(píng)論 2 4

友情鏈接更多精彩內(nèi)容