Model training job involves using datasets to create and optimize machine learning models. This process occurs in the cloud environment, where data scientists run code to build models and tune hyperparameters. Training uses computational resources like GPUs or TPUs to accelerate the learning process and improve model accuracy.
Log in to your Greennode AI Platform account and navigate to the Model Training Dashboard at: https://aiplatform.console.greennode.ai/training.
Find and click on the "Create a training job" button.
1. Basic Configuration
Provide a suitable name for your training job, e.g., "MyAITrainingJob."
Choosing the location for this Cloud Training Job.
Instance count: By default, the number of instances to run the training job is 1, in a single training mode.
3. Training Container
Choose a training framework & suitable version that meets your requirements. Select Pre-built container option to use as a supported framework, then specify the model framework & its version.
Training command & arguments
Command: Input the command containing the location where the file code located to be executed in the training job. Ensure it is accessible and imported correctly.
Arguments: Set up any necessary arguments for the training job, such as hyperparameters, dataset locations, model configuration, etc. Pass these arguments to the argument field.
4. Data Mount (optional)
Click the "Start training" button to run your training model with the specified configurations at the bottom right corner.
Once start running your model, access your training process with “Running” status from the Model Training Dashboard.
Navigate to the Monitoring section to view logs generated during the training process, such as loss values, accuracy scores, and other performance indicators.