openppl-public / ppl.llm.serving

Apache License 2.0
122 stars 13 forks source link

PPL LLM Serving

Overview

ppl.llm.serving is a part of PPL.LLM system.

SYSTEM_OVERVIEW

We recommend users who are new to this project to read the Overview of system.

ppl.llm.serving is a serving based on ppl.nn for various Large Language Models(LLMs). This repository contains a server based on gRPC and inference support for LLaMA.

Prerequisites

Quick Start

Here is a brief tutorial, refer to LLaMA Guide for more details.

License

This project is distributed under the Apache License, Version 2.0.