Change models of GPT using cli option or environment variable

minheibis commented 1 year ago

実は、新しいモデルと古いモデルでは、API のエンドポイントが異なるようです[1]。

First Header	Second Header	Second Header
Newer models (2023–)	gpt-4, gpt-3.5-turbo	https://api.openai.com/v1/chat/completions
Older models (2020–2022)	text-davinci-003, text-davinci-002, davinci, curie, babbage, ada	https://api.openai.com/v1/completions

そして、精度の面では、gpt-4, gpt-3.5-turboには差があるようですが、gpt-3.5-turboとtext-davinci-003の差はそこまでないかもしれません[2]。

[2]より抜粋 The difference between these APIs derives mainly from the underlying GPT models that are available in each. The chat completions API is the interface to our most capable model (gpt-4), and our most cost effective model (gpt-3.5-turbo). For reference, gpt-3.5-turbo performs at a similar capability level to text-davinci-003 but at 10% the price per token! See pricing details here.

入力のフォーマットも異なるようなので、gpt-3.5-turbo と text-davinci-003 でどの程度差があるかをまずは実験した上で、gpt-3.5-turbo で良さそうであれば、ステップ① 叩く API エンドポイントを変えて、入力のフォーマットも併せて変更し、ステップ② gpt-3.5-turbo と gpt-4 を環境変数で入力するような形（これはこのissueで想定されているように、モデル名を環境変数で指定する意味）にするのが良さそうだと思われます。

参考: [1] GPT - OpenAI API - https://platform.openai.com/docs/guides/gpt

[2] GPT - OpenAI API - https://platform.openai.com/docs/guides/gpt/chat-completions-vs-completions

ryo-ma commented 1 year ago

基本的に性能面を考えてgpt-3.5-turboで良いと考えております。実装した当時はtext-davinci-003が適切だったので。

なのでおっしゃる通り環境変数およびCLIで指定できるモデルとしてはgpt-3.5-turboかgpt-4かなと思っています。

minheibis commented 1 year ago

@ryo-ma こちらもまだ対応されてなければ、私の方で、今週来週くらいに作業してみようと思います。

c-3lab / dim

Change models of GPT using cli option or environment variable #106