With the massive amounts of data collected and stored every day, the need to make sense of this data has become essential in many academic and industrial domains. Machine learning is an interdisciplinary field focused on analyzing large amounts of data to discover patterns, and is an integral part of the data-driven decision-making process.
This two-day workshop at the San Diego Supercomputer Center, followed by an online project, introduces students to the tools and techniques used to explore, analyze, and leverage data to construct machine learning solutions to scientific and business problems. Topics include the machine learning process, methods for data exploration and preparation, fundamental machine learning algorithms, and model evaluation approaches. Hands-on exercises will allow students to apply their new skills and work with data analysis tools.
This is a Modern Data Science Academy course. The Modern Data Science Academy provides workshops on current data science topics taught by leading SDSC researchers and practitioners.
- Machine learning process
- Machine learning tools
- Data exploration
- Data preparation
- Cluster analysis
- Associative analysis
- Model evaluation
- Hands-on exercises using machine learning tools
- Project to apply techniques and skills to real-world data
Software: Students must bring a laptop with RapidMiner Studio 7.3, Community Edition installed on it to the workshop. There is no cost for this program.
Prerequisites: Working knowledge of statistics and any programming language recommended.