The Hark Blog - Archive

Splitting data

Splitting data into ‘train’, ‘validation’ and ‘test’ sets

When developing and deploying machine learning models, it’s important that we split the dataset into ‘train’, ‘validation’, and ‘test’ datasets. This protects against an overfitted model, and helps ensure results are generalised. In this blog post we will look in to how to split the data, and why.

Read More

Subscribe to Our Newsletter

Stay up to date with the latest industry news, platform developments and more.