Preparing a dataset for Machine Learning in Power BI

We explore a famous machine learning dataset, the Breast Cancer Wisconsin (Diagnostic) data, to see if it is possible to build a model with some predictive power. We visualise the data as histogram, box and whisker plot and a ‘home-made’ swarm plot. To do this, we need to shape the dataset in the Query Editor and standardise the data with a few DAX calculations. This is the first of several videos exploring the AI features in Power BI. In future sessions, we’ll use the Key Influencers and the Decomposition Tree visuals to gain insights, incorporate R and Python into our analysis, build, evaluate and run a predictive model using AutoML, and look at Power BI’s natural language capabilities. Links The datasets and course materials are in at https://bit.ly/33zWQtp in the Wisconsin folder Dataset on UCI: https://archive.ics.uci.edu/ml/datase...) Dataset on Kaggle: https://www.kaggle.com/uciml/breast-c... We run a full range of data analysis and generative AI courses in London and online. See our courses at zomalex.co.uk Credits: UCI Machine Learning Repository Music: www.bensound.com