Machine Learning Fundamentals with Scala and Apache Spark Training Course
The aim of this course is to provide a basic proficiency in applying Machine Learning methods in practice. Through the use of the Scala programming language and its various libraries, and based on a multitude of practical examples this course teaches how to use the most important building blocks of Machine Learning, how to make data modeling decisions, interpret the outputs of the algorithms and validate the results.
Our goal is to give you the skills to understand and use the most fundamental tools from the Machine Learning toolbox confidently and avoid the common pitfalls of Data Sciences applications.
Course Outline
Introduction to Applied Machine Learning
- Statistical learning vs. Machine learning
- Iteration and evaluation
- Bias-Variance trade-off
Machine Learning with Scala
- Choice of libraries
- Add-on tools
Regression
- Linear regression
- Generalizations and Nonlinearity
- Exercises
Classification
- Bayesian refresher
- Naive Bayes
- Logistic regression
- K-Nearest neighbors
- Exercises
Cross-validation and Resampling
- Cross-validation approaches
- Bootstrap
- Exercises
Unsupervised Learning
- K-means clustering
- Examples
- Challenges of unsupervised learning and beyond K-means
Requirements
Knowledge of Java/Scala programming language. Basic familiarity with statistics and linear algebra is recommended.
Need help picking the right course?
Machine Learning Fundamentals with Scala and Apache Spark Training Course - Booking
Machine Learning Fundamentals with Scala and Apache Spark Training Course - Enquiry
Machine Learning Fundamentals with Scala and Apache Spark - Consultancy Enquiry
Provisonal Upcoming Courses (Contact Us For More Information)
Related Courses
DataRobot
7 HoursThis instructor-led, live training in the UK (online or onsite) is aimed at data scientists and data analysts who wish to automate, evaluate, and manage predictive models using DataRobot's machine learning capabilities.
By the end of this training, participants will be able to:
- Load datasets in DataRobot to analyze, assess, and quality check data.
- Build and train models to identify important variables and meet prediction targets.
- Interpret models to create valuable insights that are useful in making business decisions.
- Monitor and manage models to maintain an optimized prediction performance.
Artificial Intelligence (AI) with H2O
14 HoursThis instructor-led, live training in the UK (online or onsite) is aimed at technical persons who wish to build machine learning models using algorithms such as GLM, Deep Learning and Random Forests.
By the end of this training, participants will be able to:
- Install and configure H2O.
- Create machine learning models using different popular algorithms.
- Evaluate models based on the type of data and business requirements.
H2O AutoML
14 HoursThis instructor-led, live training in the UK (online or onsite) is aimed at data scientists who wish to use H2O AutoML to automoate the process of building and selecting the best machine learning algorithm and parameters.
By the end of this training, participants will be able to:
- Automate the machine learning workflow.
- Automatically train and tune many machine learning models within a specified time range.
- Train stacked ensembles to arrive at highly predictive ensemble models.
AutoML with Auto-sklearn
14 HoursThis instructor-led, live training in the UK (online or onsite) is aimed at machine learning practitioners who wish to use Auto-sklearn to automate the process of selecting and optimizing a machine learning model.
By the end of this training, participants will be able to:
- Automate the process of training highly efficient machine learning models.
- Build highly accurate machine learning models while bypassing the more tedious tasks of selecting, training and testing different models.
- Use the power of machine learning to solve real-world business problems.
AutoML with Auto-Keras
14 HoursThis instructor-led, live training in the UK (online or onsite) is aimed at data scientists as well as less technical persons who wish to use Auto-Keras to automate the process of selecting and optimizing a machine learning model.
By the end of this training, participants will be able to:
- Automate the process of training highly efficient machine learning models.
- Automatically search for the best parameters for deep learning models.
- Build highly accurate machine learning models.
- Use the power of machine learning to solve real-world business problems.
AdaBoost Python for Machine Learning
14 HoursThis instructor-led, live training in the UK (online or onsite) is aimed at data scientists and software engineers who wish to use AdaBoost to build boosting algorithms for machine learning with Python.
By the end of this training, participants will be able to:
- Set up the necessary development environment to start building machine learning models with AdaBoost.
- Understand the ensemble learning approach and how to implement adaptive boosting.
- Learn how to build AdaBoost models to boost machine learning algorithms in Python.
- Use hyperparameter tuning to increase the accuracy and performance of AdaBoost models.
Machine Learning with Random Forest
14 HoursThis instructor-led, live training in the UK (online or onsite) is aimed at data scientists and software engineers who wish to use Random Forest to build machine learning algorithms for large datasets.
By the end of this training, participants will be able to:
- Set up the necessary development environment to start building machine learning models with Random forest.
- Understand the advantages of Random Forest and how to implement it to resolve classification and regression problems.
- Learn how to handle large datasets and interpret multiple decision trees in Random Forest.
- Evaluate and optimize machine learning model performance by tuning the hyperparameters.
Data Mining with Weka
14 HoursThis instructor-led, live training in the UK (online or onsite) is aimed at beginner to intermediate-level data analysts and data scientists who wish to use Weka to perform data mining tasks.
By the end of this training, participants will be able to:
- Install and configure Weka.
- Understand the Weka environment and workbench.
- Perform data mining tasks using Weka.
Machine Learning for Mobile Apps using Google’s ML Kit
14 HoursThis instructor-led, live training in (online or onsite) is aimed at developers who wish to use Google’s ML Kit to build machine learning models that are optimized for processing on mobile devices.
By the end of this training, participants will be able to:
- Set up the necessary development environment to start developing machine learning features for mobile apps.
- Integrate new machine learning technologies into Android and iOS apps using the ML Kit APIs.
- Enhance and optimize existing apps using the ML Kit SDK for on-device processing and deployment.
AutoML
14 HoursThis instructor-led, live training in the UK (online or onsite) is aimed at technical persons with a background in machine learning who wish to optimize the machine learning models used for detecting complex patterns in big data.
By the end of this training, participants will be able to:
- Install and evaluate various open source AutoML tools (H2O AutoML, auto-sklearn, TPOT, TensorFlow, PyTorch, Auto-Keras, TPOT, Auto-WEKA, etc.)
- Train high quality machine learning models.
- Efficiently solve different types of supervised machine learning problems.
- Write just the necessary code to initiate the automated machine learning process.
Creating Custom Chatbots with Google AutoML
14 HoursThis instructor-led, live training in the UK (online or onsite) is aimed at participants with varying levels of expertise who wish to leverage Google's AutoML platform to build customized chatbots for various applications.
By the end of this training, participants will be able to:
- Understand the fundamentals of chatbot development.
- Navigate the Google Cloud Platform and access AutoML.
- Prepare data for training chatbot models.
- Train and evaluate custom chatbot models using AutoML.
- Deploy and integrate chatbots into various platforms and channels.
- Monitor and optimize chatbot performance over time.
Google Cloud AutoML
7 HoursThis instructor-led, live training in the UK (online or onsite) is aimed at data scientists, data analysts, and developers who wish to explore AutoML products and features to create and deploy custom ML training models with minimal effort.
By the end of this training, participants will be able to:
- Explore the AutoML product line to implement different services for various data types.
- Prepare and label datasets to create custom ML models.
- Train and manage models to produce accurate and fair machine learning models.
- Make predictions using trained models to meet business objectives and needs.
Advanced Analytics with RapidMiner
14 HoursThis instructor-led, live training in the UK (online or onsite) is aimed at intermediate-level data analysts who wish to learn how to use RapidMiner to estimate and project values and utilize analytical tools for time series forecasting.
By the end of this training, participants will be able to:
- Learn to apply the CRISP-DM methodology, select appropriate machine learning algorithms, and enhance model construction and performance.
- Use RapidMiner to estimate and project values, and utilize analytical tools for time series forecasting.
RapidMiner for Machine Learning and Predictive Analytics
14 HoursRapidMiner is an open source data science software platform for rapid application prototyping and development. It includes an integrated environment for data preparation, machine learning, deep learning, text mining, and predictive analytics.
In this instructor-led, live training, participants will learn how to use RapidMiner Studio for data preparation, machine learning, and predictive model deployment.
By the end of this training, participants will be able to:
- Install and configure RapidMiner
- Prepare and visualize data with RapidMiner
- Validate machine learning models
- Mashup data and create predictive models
- Operationalize predictive analytics within a business process
- Troubleshoot and optimize RapidMiner
Audience
- Data scientists
- Engineers
- Developers
Format of the Course
- Part lecture, part discussion, exercises and heavy hands-on practice
Note
- To request a customized training for this course, please contact us to arrange.
Pattern Recognition
21 HoursThis instructor-led, live training in the UK (online or onsite) provides an introduction into the field of pattern recognition and machine learning. It touches on practical applications in statistics, computer science, signal processing, computer vision, data mining, and bioinformatics.
By the end of this training, participants will be able to:
- Apply core statistical methods to pattern recognition.
- Use key models like neural networks and kernel methods for data analysis.
- Implement advanced techniques for complex problem-solving.
- Improve prediction accuracy by combining different models.