Jump to content
Main menu
Main menu
move to sidebar
hide
Navigation
Main page
Recent changes
Random page
Help about MediaWiki
Special pages
Niidae Wiki
Search
Search
Appearance
Create account
Log in
Personal tools
Create account
Log in
Pages for logged out editors
learn more
Contributions
Talk
Editing
Data mining
(section)
Page
Discussion
English
Read
Edit
View history
Tools
Tools
move to sidebar
hide
Actions
Read
Edit
View history
General
What links here
Related changes
Page information
Appearance
move to sidebar
hide
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
==Software== {{Category see also|Data mining and machine learning software}} ===Free open-source data mining software and applications=== The following applications are available under free/open-source licenses. Public access to application source code is also available. * [[Carrot2]]: Text and search results clustering framework. * [[Chemicalize.org]]: A chemical structure miner and web search engine. * [[ELKI]]: A university research project with advanced [[cluster analysis]] and [[outlier detection]] methods written in the [[Java (programming language)|Java]] language. * [[General Architecture for Text Engineering|GATE]]: a [[natural language processing]] and language engineering tool. * [[KNIME]]: The Konstanz Information Miner, a user-friendly and comprehensive data analytics framework. * [[MOA (Massive Online Analysis)|Massive Online Analysis (MOA)]]: a real-time big data stream mining with concept drift tool in the [[Java (programming language)|Java]] programming language. * [[Multi expression programming|MEPX]]: cross-platform tool for regression and classification problems based on a Genetic Programming variant. * [[mlpack]]: a collection of ready-to-use machine learning algorithms written in the [[C++]] language. * [[NLTK]] ([[Natural Language Toolkit]]): A suite of libraries and programs for symbolic and statistical natural language processing (NLP) for the [[Python (programming language)|Python]] language. * [[OpenNN]]: Open [[Artificial neural network|neural networks]] library. * [[Orange (software)|Orange]]: A component-based data mining and [[machine learning]] software suite written in the [[Python (programming language)|Python]] language. *[[PSPP]]: Data mining and statistics software under the GNU Project similar to [[SPSS]] * [[R (programming language)|R]]: A [[programming language]] and software environment for [[statistical]] computing, data mining, and graphics. It is part of the [[GNU Project]]. * [[scikit-learn]]: An open-source machine learning library for the Python programming language; * [[Torch (machine learning)|Torch]]: An [[open-source]] [[deep learning]] library for the [[Lua (programming language)|Lua]] programming language and [[scientific computing]] framework with wide support for [[machine learning]] algorithms. * [[UIMA]]: The UIMA (Unstructured Information Management Architecture) is a component framework for analyzing unstructured content such as text, audio and video β originally developed by IBM. * [[Weka (machine learning)|Weka]]: A suite of machine learning software applications written in the [[Java (programming language)|Java]] programming language. ===Proprietary data-mining software and applications=== The following applications are available under proprietary licenses. * [[Angoss]] KnowledgeSTUDIO: data mining tool * [[LIONsolver]]: an integrated software application for data mining, business intelligence, and modeling that implements the Learning and Intelligent OptimizatioN (LION) approach. * [[PolyAnalyst]]: data and text mining software by Megaputer Intelligence. * [[Microsoft Analysis Services]]: data mining software provided by [[Microsoft]]. * [[NetOwl]]: suite of multilingual text and entity analytics products that enable data mining. * [[Oracle Data Mining]]: data mining software by [[Oracle Corporation]]. * [[PSeven]]: platform for automation of engineering simulation and analysis, multidisciplinary optimization and data mining provided by [[DATADVANCE]]. * [[Qlucore]] Omics Explorer: data mining software. * [[RapidMiner]]: An environment for [[machine learning]] and data mining experiments. <!-- Latest version is NOT opensource --> * [[SAS (software)#Components|SAS Enterprise Miner]]: data mining software provided by the [[SAS Institute]]. * [[SPSS Modeler]]: data mining software provided by [[IBM]]. * [[STATISTICA]] Data Miner: data mining software provided by [[StatSoft]]. * [[Tanagra (machine learning)|Tanagra]]: Visualisation-oriented data mining software, also for teaching. * [[Vertica]]: data mining software provided by [[Hewlett-Packard]]. * [[Google Cloud Platform]]: automated custom ML models managed by [[Google]]. * [[Amazon SageMaker]]: managed service provided by [[Amazon.com|Amazon]] for creating & productionising custom ML models.
Summary:
Please note that all contributions to Niidae Wiki may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see
Encyclopedia:Copyrights
for details).
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)
Search
Search
Editing
Data mining
(section)
Add topic