Data Mining tools

R

R is a language and environment for statistical computing and graphics. It is a GNU project which is similar to the S language and environment which was developed at Bell Laboratories (formerly AT&T, now Lucent Technologies) by John Chambers and colleagues. R can be considered as a different implementation of S. There are some important differences, but much code written for S runs unaltered under R.

R

Tool Description as in https://www.r-project.org/about.html

Image Credit: https://rattle.togaware.com/rattle-screenshots.html

download-R

RapidMiner

RapidMiner makes data science teams more productive through an open source platform for data prep, machine learning, and model deployment.

RapidMiner

Tool Description as in https://rapidminer.com/

Image Credit: https://www.getapp.com/business-intelligence-analytics-software/a/rapidminer/

download-RapidMiner

IMB SPSS Modeler

IBM SPSS Modeler is a predictive analytics platform that helps you build accurate predictive models quickly and deliver predictive intelligence to individuals, groups, systems and the enterprise. It provides a range of advanced algorithms and analysis techniques, including text analytics, entity analytics, decision management and optimization to deliver insights in near real-time.

IMB-SPSS-Modeler

Tool Description as in https://www.ibm.com/us-en/marketplace/spss-modeler

Image Credit: https://www.ibm.com/us-en/marketplace/spss-modeler

download-IMB-SPSS-Modeler

SAS Data Mining

Descriptive and predictive modeling provide insights that drive better decision making. Now you can streamline the data mining process to develop models quickly. Understand key relationships. And find the patterns that matter most.

SAS-Data-Mining

Tool Description as in https://www.sas.com/en_id/software/analytics/enterprise-miner.html

Image Credit: https://www.sas.com/en_id/software/analytics/enterprise-miner.html#m=screenshot

download-SAS-Data-Mining

Python

Welcome! Are you completely new to programming? If not then we presume you will be looking for information about why and how to get started with Python. Fortunately an experienced programmer in any programming language (whatever it may be) can pick up Python very quickly. Whether you’re new to programming or an experienced developer, it’s easy to learn and use Python. Python source code and installers are available for download for all versions.

Python

Tool Description as in https://www.python.org/about/gettingstarted/

Image Credit: https://twitter.com/_odisseus/status/831032625464754177

download-Python

Orange

Open source machine learning and data visualization for novice and expert. Interactive data analysis workflows with a large toolbox. The most frequent words in these texts are conjunctions (‘and’, ‘or’) and prepositions (‘in’, ‘of’), but so they are in almost every English text in the world. We need to remove these frequent and uninteresting words to get to the interesting part. We remove the punctuation by defining our tokens.

Orange

Tool Description as in https://blog.biolab.si/2017/06/19/text-preprocessing/

Image Credit: https://orange.biolab.si/

download-Orange

KNIME

KNIME Analytics Platform is the leading open solution for data-driven innovation, helping you discover the potential hidden in your data, mine for fresh insights, or predict new futures. Our enterprise-grade, open source platform is fast to deploy, easy to scale and intuitive to learn.

KNIME

Tool Description as in https://www.knime.org/knime-analytics-platform

Image Credit: https://www.knime.org/knime-analytics-platform

download-KNIME

Spark

Apache Spark has an advanced DAG execution engine that supports acyclic data flow and in-memory computing. Spark offers over 80 high-level operators that make it easy to build parallel apps. And you can use it interactively from the Scala, Python and R shells. Spark runs on Hadoop, Mesos, standalone, or in the cloud. It can access diverse data sources including HDFS, Cassandra, HBase, and S3.

Spark

Tool Description as in http://spark.apache.org/

Image Credit: http://spark.apache.org/

download-Spark

H2O

H2O is the world’s leading open source deep learning platform. H2O is used by over 100,000 data scientists and more than 10,000 organizations around the world. H2O.ai is developing mission critical data products for some of the world’s most admired and influential companies. Every enterprise needs a digital brain and H2O.ai is making it possible.

H2O

Tool Description as in https://www.h2o.ai/

Image Credit: https://bitbook.io/h2o-ai-quick-start-tutorial-for-just-about-anyone/

download-H2O

Weka

Weka is a Java based free and open source software licensed under the GNU GPL and available for use on Linux, Mac OS X and Windows. It comprises a collection of machine learning algorithms for data mining. It packages tools for data pre-processing, classification, regression, clustering, association rules and visualisation.

Weka

Tool Description as in http://opensourceforu.com/2017/03/top-10-open-source-data-mining-tools/

Image Credit: https://www.ibm.com/developerworks/library/os-weka2/index.html

download-Weka

Useful Videos

SAS Data Mining

Source:Raghunandan Reddy Alugubelli

KNIME

Source:KNIMETV

Python

Source:PyTexas

Leave a Reply

Your email address will not be published. Required fields are marked *