Google Colab setup

Run the project on Google Colab with zero local installation required.

Advantages

No local installation needed
Free GPU access (Tesla T4)
Works on any device with a web browser
Automatic dependency management
Perfect for quick start

Limitations

Session timeout after inactivity
Need to re-run setup after disconnect
Limited to Colab’s GPU quota
Requires Google account

Setup steps

1. Open the Colab notebook

Go to Google Colab
Click File > Open notebook
Select the GitHub tab
Enter: gperdrizet/GANNs-with-friends
Select: notebooks/run_worker_colab.ipynb

2. Enable GPU runtime

Click Runtime > Change runtime type
Set Hardware accelerator to GPU
Select T4 (or best available)
Click Save

3. Run the setup cells

Execute the cells in order:

Cell 1: Clone repository

# Clone repository if not already present
import os
if not os.path.exists('GANNs-with-friends'):
    !git clone https://github.com/gperdrizet/GANNs-with-friends.git
    %cd GANNs-with-friends
else:
    %cd GANNs-with-friends
    !git pull

Cell 2: Install dependencies

!pip install -q -r requirements.txt

4. Configure database connection

Cell 3: Create config file

if not os.path.exists('config.yaml'):
    !cp config.yaml.template config.yaml
    print('Created config.yaml from template')
    print('Edit config.yaml with your database credentials before continuing')
else:
    print('config.yaml already exists')

Cell 4: Edit credentials

Click the folder icon in the left sidebar, find config.yaml, and edit:

database:
  host: YOUR_DATABASE_HOST      # From instructor
  port: 54321
  database: distributed_gan
  user: YOUR_USERNAME           # From instructor
  password: YOUR_PASSWORD       # From instructor

5. Start worker

Cell 5: Run worker

!python src/worker.py

On first run, the dataset will be automatically downloaded from Hugging Face (~1.4 GB).

You should see:

Initializing worker...
Dataset not found locally. Downloading from Hugging Face...
Download complete: data/celeba_torchvision/data/img_align_celeba.zip
Images will be loaded directly from zip (no extraction needed)
Loaded dataset with 202599 images
Worker abc123 initialized successfully!
Name: YourName
GPU: Tesla T4
Batch size: 64
Waiting for work units...

Keeping the worker running

Colab sessions timeout after inactivity. To maximize uptime:

Keep the browser tab active - Don’t close or switch away for long
Monitor periodically - Check every 30-60 minutes
Use Colab Pro (optional) - Longer runtimes and better GPUs
Re-run when disconnected - Just execute the cells again

Monitoring your contribution

The worker prints updates as it processes batches:

Processing work unit 42 (iteration 5)...
Completed work unit 42 in 12.3s
Processed 320 images total

Stopping the worker

To stop gracefully:

Click the Stop button in Colab
Or press Runtime > Interrupt execution

Tips for Colab users

Save your config:

Download config.yaml after editing
On next session, upload it instead of editing again

Monitor GPU usage:

!nvidia-smi

Check remaining quota:

Colab shows GPU usage at bottom-right
Free tier: ~12 hours/day
Colab Pro: longer sessions

Resume after disconnect:

Just re-run all cells
Worker will pick up where training left off
No data loss (all training state is in database)

Troubleshooting

GPU not available:

Verify runtime type is set to GPU
May need to wait if quota exceeded
Try again in a few hours

Dataset download fails:

Re-run the download cell
Check internet connection
Try clearing output and re-running

Can’t connect to database:

Verify credentials in config.yaml
Check database host is publicly accessible
Contact instructor for help

Session keeps disconnecting:

Normal for free tier with long idle periods
Keep browser tab active
Consider Colab Pro for longer sessions

Next steps

Student guide - Understanding your role as a worker
Monitoring - Track training progress
View results - See generated faces