R is a language and environment for statistical computing and graphics. It is a GNU project which is similar to the S language and environment which was developed at Bell Laboratories (formerly AT&T, now Lucent Technologies) by John Chambers and colleagues. R can be considered as a different implementation of S. There are some important differences, but much code written for S runs unaltered under R.
Tool Description as in https://www.r-project.org/about.html
Image Credit: https://rattle.togaware.com/rattle-screenshots.html
RapidMiner makes data science teams more productive through an open source platform for data prep, machine learning, and model deployment.
Tool Description as in https://rapidminer.com/
IMB SPSS Modeler
IBM SPSS Modeler is a predictive analytics platform that helps you build accurate predictive models quickly and deliver predictive intelligence to individuals, groups, systems and the enterprise. It provides a range of advanced algorithms and analysis techniques, including text analytics, entity analytics, decision management and optimization to deliver insights in near real-time.
Tool Description as in https://www.ibm.com/us-en/marketplace/spss-modeler
Image Credit: https://www.ibm.com/us-en/marketplace/spss-modeler
SAS Data Mining
Descriptive and predictive modeling provide insights that drive better decision making. Now you can streamline the data mining process to develop models quickly. Understand key relationships. And find the patterns that matter most.
Tool Description as in https://www.sas.com/en_id/software/analytics/enterprise-miner.html
Welcome! Are you completely new to programming? If not then we presume you will be looking for information about why and how to get started with Python. Fortunately an experienced programmer in any programming language (whatever it may be) can pick up Python very quickly. Whether you’re new to programming or an experienced developer, it’s easy to learn and use Python. Python source code and installers are available for download for all versions.
Tool Description as in https://www.python.org/about/gettingstarted/
Image Credit: https://twitter.com/_odisseus/status/831032625464754177
Open source machine learning and data visualization for novice and expert. Interactive data analysis workflows with a large toolbox. The most frequent words in these texts are conjunctions (‘and’, ‘or’) and prepositions (‘in’, ‘of’), but so they are in almost every English text in the world. We need to remove these frequent and uninteresting words to get to the interesting part. We remove the punctuation by defining our tokens.
Tool Description as in https://blog.biolab.si/2017/06/19/text-preprocessing/
Image Credit: https://orange.biolab.si/
KNIME Analytics Platform is the leading open solution for data-driven innovation, helping you discover the potential hidden in your data, mine for fresh insights, or predict new futures. Our enterprise-grade, open source platform is fast to deploy, easy to scale and intuitive to learn.
Tool Description as in https://www.knime.org/knime-analytics-platform
Image Credit: https://www.knime.org/knime-analytics-platform
Apache Spark has an advanced DAG execution engine that supports acyclic data flow and in-memory computing. Spark offers over 80 high-level operators that make it easy to build parallel apps. And you can use it interactively from the Scala, Python and R shells. Spark runs on Hadoop, Mesos, standalone, or in the cloud. It can access diverse data sources including HDFS, Cassandra, HBase, and S3.
Tool Description as in http://spark.apache.org/
Image Credit: http://spark.apache.org/
H2O is the world’s leading open source deep learning platform. H2O is used by over 100,000 data scientists and more than 10,000 organizations around the world. H2O.ai is developing mission critical data products for some of the world’s most admired and influential companies. Every enterprise needs a digital brain and H2O.ai is making it possible.
Tool Description as in https://www.h2o.ai/
Weka is a Java based free and open source software licensed under the GNU GPL and available for use on Linux, Mac OS X and Windows. It comprises a collection of machine learning algorithms for data mining. It packages tools for data pre-processing, classification, regression, clustering, association rules and visualisation.
Tool Description as in http://opensourceforu.com/2017/03/top-10-open-source-data-mining-tools/
SAS Data Mining
Source:Raghunandan Reddy Alugubelli