Creating ML Pipelines on SparkΒΆ
To create a Machine Learning Pipeline for Spark, the developer does the following:
Divides the pipeline into a logical sequence of components
Pulls the generated code for the newly added components to his development machine from the code repository
The generated code will have pre-generated classes for each component, as well as stubs for some methods. The developer provides code for these methods (see here for details)