Practical session 4

To efficiently train our model on larger datasets and scale it across multiple GPUs, we will utilize PyTorch Lightning, a high-level framework built on top of PyTorch. PyTorch Lightning simplifies the process of distributed and multi-GPU training, making it easier to implement, scale, and manage training workloads without having to manage the underlying complexities.

Install Required Tools and structure the package:

Activate the devtools_scicomp, install PyTorch Lightning and add it to the requirements.txt file. Do the same for tensorboard and torchvision.
Inside the devtools_scicomp_project_2025 repository create a new branch starting from the main one called deep_classifier.
Inside the src/pyclassify/ create model.py, module.py, datamodule.py.

Practical session 4

The way the processor industry is going, is to add more and more cores, but nobody knows how to program those things. I mean, two, yeah; four, not really; eight, forget it. - Steve Jobs

Development Tools for Scientific Computing 2024/2025

Pasquale Claudio Africa, Dario Coscia

21 Feb 2025

Part 1: From Binary to Multi-Class Classification

Part 2: Build a Classifier with Lightining

Part 3: Train, Test, Train, Test, Train, ... SCALE!

Solutions