๐Ÿ“ค From Scratch Implementations

Shared my work on implementations of various DL models in PyTorch. Open sourced the weights from the pre and post training stages

From Scratch Implementations

๐Ÿ—ƒ๏ธ Dataset Management

Datasets I created as part of personal project or internships

Browse Datasets

๐Ÿ“ค Custom Pretrained Model Weights

Open sources model weights from some of my pre and post training runs of various models from Llama, Mixtral to Llava and Whisper covering NLP, Vision and Audio.

Browse Models

๐Ÿ Python Client Library

A package to streamline training your models!.

Learn More

๐Ÿงช SmolHub Playground

Space to play with my custom trained models!

SmolHub Playground

About SmolHub

SmolHub is a website which showcases from scratch models in Pytorch in Multimodal (language + audio + vision) domain ranging from from-scratch models , pretraining + postraining on billions of tokens and datasets I create for fun!.

Key Features

๐Ÿš€

Easy Model Download

Access models with minimal friction through smolhub_downloads package

๐Ÿ”

Model Discovery

Custom models trained from scratch myself which you could play and experiment with!

๐Ÿ“Š

Dataset Management

Download datasets best for your project

๐Ÿ’ป

Web Interface

Intuitive browsing of models and datasets

๐Ÿ

Python Client Library

Custom made smolhub python package to streamline your training runs

๐Ÿ”’

Gradio Interface

Gradio interface right in the browser to streamline interactions with your models.

Getting Started

Using the Web Interface

  1. Browse available models and datasets
  2. Upload your models/datasets through the intuitive UI (Admin only (soon for every user!))
  3. Search and download models/datasets through our smolhub_download Python package!

Using the Python Client

Install the client library:

pip install smolhub_download

Basic usage:


 from smolhub_download.client import download_model, download_dataset

# Initialize the client
client = SmolHubClient()



# Download a Datasets
dataset_name = 'Luis Suarez Handball Stance Detection'
dataset_path = download_dataset(dataset_name, output_dir=output_directory)

# Test model download
model_name = 'SmolLlama-130M-Pretrained' 
model_path = download_model(model_name, output_dir=output_directory)

Requirements

  • Python >= 3.7
  • PyTorch >= 1.7.0
  • Modern web browser for the web interface
  • CUDA-capable GPU (optional, for GPU model support)