Cataloguing LLMs: Your Ultimate Market Guide

Team Introduction

Big Byte Theory

Amy Leung: Software Engineer, Agile Project Management
Kemi Alake: Data Analyst, Data Processing and Analysis
Josh Ashton: Data Engineer, Data Integration and Management
Chris De Wardt: Data Engineer, Data Infrastructure and Pattern Matching

The Challenge

In regulated industries such as banking and pharmaceuticals, identifying suitable Large Language Models (LLMs) is a significant challenge. Businesses often struggle with determining which LLMs to consider and which to avoid. This issue is particularly pressing for market researchers, AI teams, and governance teams (including risk, tech, legal, and compliance) who need reliable information to make informed decisions. HorizonX Consultancy was looking for a team to create a centralized repository of LLM information, cataloging over 400 models currently on the market, to help provide crucial insights such as release dates, creators, number of parameters, training data, and any associated lawsuits.

Similar to how Crunchbase leads in private company data, Lighthouse aims to be the largest and most comprehensive LLM database, akin to the Stanford Database, offering invaluable data and analysis to guide users in selecting the most appropriate LLMs for their needs.

Key Features

Comprehensive List of LLMs: Detailed information on each model.
User-Friendly Interface: Easy navigation and search.
Informative Matrix: Comparing popular LLMs.

Evaluation Methodology

Criteria Selection: Accuracy (Exact match, F1 score, BLEU-4), Benchmark Tests (Commonsense, GSM, etc.), Popularity.
Data Source/Processing: Scores from Stanford HELM, integrated with MongoDB.
Weighting Criteria: Business Readiness (40% accuracy, 60% benchmark), Perceived Business Value (capability scores, model popularity).

Commercialization Strategy

Affiliate Marketing Model: Commission from LLM firms for leads.
Strengths: Low startup costs, passive income.
Weaknesses: Tracking and attribution, competition.

Market Expansion and Roadmap

Future Growth Areas: Partner with firms for LLM deployment, develop a marketplace for engineers/consultants.
Continuous Improvement: Integrate additional benchmarks, develop robust admin features, automate ETL pipeline updates.

Tech Stack

MERN Stack: Built with MongoDB, Express.js, React and Node.js.
Styling: Tailwind CSS

Reflection

Improvements: Better modularization, enhanced communication, standardized naming conventions, dynamic data integration.

Sources & Research

Stanford Database: HELM framework for LLM benchmarking.
Ecosystem-Graphs: General data on LLM models/applications/datasets.

License

This project is licensed under the MIT License.

Setup

git clone the repo. cd frontend/ - for client-side of project cd backend/ - for backend side of project npm start - project start locally with locally hosted database npm run prod - run project in production linked to cloud database npm build - build dist files for the project (frontend)

Live View: https://lighthouse-29v4.onrender.com/

big-byte-theory / Lighthouse

readme