ltalirz commented 4 months ago

Motivation

Enabling experimentalists to run simulations is a great long-term goal, but requires substantial work in vetting workflows, documenting them, and then actual testing with a broad audience of experimentalists in order to weed out edge cases and earn trust.

A much lower-hanging fruit is to make computational scientists more productive (and dogfooding always makes a product better). For this use case, hardcoded agents are typically too restrictive - instead, we want langsim to be a Copilot that helps us write Python code for running simulations.

This is possible via the ipython magics (basic implementation for an immediately executable "code response" here), but once you get into the details (e.g. streaming responses rather than having to wait for a lengthy code completion) it starts to get tricky.

jupyter-ai follows the same route, but adds extra features, including streaming responses, as well as other nifty UI integrations (e.g. copilot in side bar).

Downsides

adds significant extra dependency
at the time of the hackathon, jupyter-ai was not supported on python 3.12

Steps

As far as I understand, in order to connect our setup to jupyter-ai, we need to create a "Jupyter AI module", for which they offer a cookiecutter.

Jupyter AI already uses langchain, so that should help with the integration, but when I briefly looked into this during the hackathon (see code) was at the level of langchain_core.language_models.llms.LLM rather than at the level of agents/agent executors that we use in langsim.

I was not able to quickly determine whether this poses a problem; perhaps @chiang-yuan can give some pointers on whether establishing this link is straightforward or whether coupling agents to jupyter-ai is difficult with the current implementation.

chiang-yuan commented 4 months ago

I am not familiar with ipython magics but if we want code generation and execution it is already possible by using from langchain_experimental.tools import PythonREPLTool.

I have an example in LLaMP generating and running ASE EMT relaxation as implemented here.

Regarding the streaming, it is possible to turn off if the code generation speed is a concern (although I don't think there be much difference)

ltalirz commented 4 months ago

Thanks Yuan, I see that this REPL tool can be useful as well.

I had a slightly different use case in mind, though - instead of an agent that executes the code directly I'd like to simply have the language model generate code for me in the next notebook cell so that I can review and edit it before running it myself. Alternatively, I want a tab-autocomplete (which jupyter-ai) also supports.

This will not use the calculation agents (or REPL), but it might in principle use others, e.g. to read documentation of packages etc.

In most cases, however, the added value here vs vanilla ChatGPT/Github copilot comes from the engineered system prompt, which knows about the simulation packages that are installed in my environment and, besides infos on how to use the langchain tool function, could also include information on how to interact with the Python API of the underlying simulation software, how to best run certain calculations, etc.

This use case is slightly different from the one in the demo video, but I think it has great short-term potential (I would use such a tool).

chiang-yuan commented 4 months ago

That is a very interesting idea! I am not sure if someone has implemented this before, but imo it is possible by

Load python package by DirectoryLoader or PythonLoader
Create a local documentation index and save them in the vector store
Try to ingest into jupyter-ai to query vector store whenever code completion is called

Will be interested in developing this :)

jan-janssen / LangSim

consider switching basic ipython magic to `jupyter-ai` #30

Motivation

Downsides

Steps