jan-janssen / LangSim

Application of Large Language Models (LLM) for computational materials science
https://mybinder.org/v2/gh/jan-janssen/LangSim/HEAD?labpath=notebooks/demonstration.ipynb
BSD 3-Clause "New" or "Revised" License
44 stars 6 forks source link

Challenges of Large Language Models #48

Open jan-janssen opened 1 week ago

jan-janssen commented 1 week ago

Open Source Unfortunately most llama based and other free models fail to work with the tools defined by langchain. It works for single functions but already the current complexity of langsim they struggle.

ChatGPT

The behaviour seems to be somewhat reproducible so I wanted to quickly summarise it here.

jan-janssen commented 4 days ago

Different large language models can be tested by setting environment variables:

Antropic:

import os, getpass
os.environ["LANGSIM_PROVIDER"] = "anthropic" 
os.environ["LANGSIM_API_KEY"] = os.environ['ANTHROPIC_API_KEY']
os.environ["LANGSIM_MODEL"] = "claude-3-5-sonnet-20240620"

OpenAPI:

import os, getpass
os.environ["LANGSIM_API_KEY"] = os.environ['OPENAI_API_KEY']
os.environ["LANGSIM_MODEL"] = "gpt-4o"

KISSKI:

import os, getpass
os.environ["LANGSIM_API_KEY"] = os.environ['KISSKI_API']
os.environ["LANGSIM_API_URL"] = "https://chat-ai.academiccloud.de/v1"
os.environ["LANGSIM_MODEL"] = "meta-llama-3-8b-instruct"