Data must be prepared before machine learning algorithms can be applied. Furthermore, applying an algorithm follows a specific cycle, which we will review here. The MLBase package will be used in this section.
We will highlight some data-preprocessing techniques
We will walk through a typical algorithm application cycle
We will discuss some techniques to validate how well a model performs