HSV-AI / presentations

This repository is used to manage the presentations given at Huntsville AI meetups. It provides a collection of Issues, Cards, and Files to plan and create the content needed for a presentation.
17 stars 6 forks source link

10/25/2023 - Mixture of Experts: Harnessing the Hidden Architecture of GPT4 #88

Closed fearnworks closed 1 year ago

fearnworks commented 1 year ago

Description

Hello everyone!

We're back at HudsonAlpha this week to talk about Mixture of Experts. Josh Phillips has been working with MoE across several LLM based projects over the last few months. He will give us an overview of MoE and the current progress on this pattern among the open source community.

In today's AI landscape, we often face a trade-off between the capabilities of dense language models and the computational cost associated with running them. The Mixture of Experts (MoE) architecture offers an intriguing solution to this dilemma. By replacing standard linear layers with a Gating Mechanism that routes tasks to specialized 'experts,' MoE models manage to keep computational costs stable while expanding the model's memory footprint. This approach allows for high performance without the usual spike in inference time. This however comes at the cost of increased complexity during training and inference.

The Mixture of Experts architecture is widely known to be the "secret sauce" behind the GPT4 model currently available from OpenAI, although concrete details of their implementation of the architecture are not currently known.

In this talk we will explore the building blocks of the mixture of experts architecture and how it compares to traditional transformers. We will also explore the current paradigms for implementing mixture of experts, example projects, and the challenges being faced by each.

Links:

Papers

Repos / Code

Misc :

Details:

Complete the following items to get a presentation ready for Huntsville AI

Adding material to the presentations repository

Add the file to present (prefer Jupyter Notebooks or Markdown formated files) to the folder structure. For multiple files, create a directory following the naming convention and add the files to it.

Naming convention

We use a convention of starting the filenames with a date (year/month/day) so that the files are still sorted by date even when in alphabetical format.

YYMMDD_Session_Description.extension

fearnworks commented 1 year ago

image