southern-cross-ai / BabyJoey

Small 115 million parameter model - .5GB
Apache License 2.0
4 stars 9 forks source link

🌟 BabyJoey 🌟

A Compact 115 Million Parameter Model - 0.5GB

Welcome to BabyJoey, a streamlined Australian language model designed for efficient performance. This document provides a detailed overview of the project structure to help you get started quickly.


πŸ“‚ Project Root File Structure

πŸ“ README.md

Comprehensive documentation for the project, explaining the purpose, setup instructions, and usage of BabyJoey.


πŸ“ README.md

Purpose

BabyJoey is a lightweight language model inspired by GPT-1, featuring 115 million parameters and designed for tasks that require a balance of efficiency and performance. It's perfect for educational purposes, research, and small-scale applications.

Setup Instructions

  1. Clone the Repository:
    git clone https://github.com/yourusername/babyjoey.git
    cd babyjoey
  2. Install Dependencies:
    pip install -r requirements.txt
  3. Configure the Model: Modify config.py to set your desired hyperparameters and configurations.
  4. Prepare Data: Ensure your datasets are in place and correctly referenced in data/dataloader.py.

Usage


πŸš€ main.py

The entry point of BabyJoey. This script parses arguments, sets up configurations, and invokes the training loop.

πŸ“‹ requirements.txt

Lists the dependencies required to run BabyJoey, which can be installed using pip.


πŸ“ Directories

🧠 model/

πŸ“Š data/

πŸ‹οΈ training/

πŸ› οΈ utils/

πŸ“œ scripts/


πŸ“‚ Additional Directories

πŸ“‘ logs/

Stores log files generated during training and evaluation for debugging and monitoring purposes.

πŸ’Ύ checkpoints/

Stores model checkpoints saved during training to allow for resuming or fine-tuning.


We hope you enjoy working with BabyJoey! If you have any questions, feel free to reach out to our support team or check the detailed documentation in the README.md. Happy coding! πŸš€