daveshap commented 3 days ago

Establish XML Standard for Chain of Thought within JSONL for LLM Training

Objective

Create an XML standard for structuring Chain of Thought (CoT) data within JSONL files for our open-source AI finetuning dataset project. This standard will help models track their thought process and improve UX by providing a clear structure for output.

Background

We're using the industry-standard JSONL format for LLM finetuning
The XML structure within the JSONL is our unique approach, inspired by (but distinct from) other corporate methodologies
This standard is for internal use in our open-source project

Requirements

JSONL Structure

Use JSON Lines (JSONL) format
Each line contains a single key, "messages," followed by a list of chat message dictionaries
Each dictionary should have three keys: "system," "user," and "assistant"

XML Standard (within "assistant" key)

Implement XML tags to structure the Chain of Thought process
Initial set of tags to include (but not limited to):
- <planning>
- <thinking>
- <output>
- <attempt>
- <reflection>
Establish <output> as the final tag in the sequence
Allow flexibility for additional tags as needed
Ensure proper XML escaping within the JSONL format

Considerations

The XML structure should be simple and easy to understand
XML tags should be clearly distinguishable from JSON structure
Consider how this format will be rendered in UX
Ensure compatibility with LLM processing of XML structures

Tasks

[ ] Define the complete set of initial XML tags
[ ] Establish rules for tag order and nesting (if any)
[ ] Create examples of properly formatted JSONL files with embedded XML
[ ] Document the standard, including purpose and usage of each tag
[ ] Implement a validation script to ensure adherence to the standard
[ ] Create guidelines for adding or modifying tags in the future

Questions to Address

Are there any specific Chain of Thought patterns we should account for in our tag structure?
Should we implement any attributes within the XML tags for additional metadata?
Do we need to establish any length or content restrictions for the text within each tag?

Please review and provide feedback on this proposed structure. We can iterate on this to ensure it covers all necessary aspects of the standard.

daveshap commented 3 days ago

{"messages": [
    {"system": "???", 
    "user": "write a 10 word coherent sentence where each word has exactly 1 letter more than its previous word starting with first word having one letter and so on till tenth word having 10 letters",
    "assistant": "<XML>"}
    ]

anselale commented 3 days ago

Potential System Prompt:

"system": "You are a thinking agent responsible for developing a detailed, step-by-step thought process in response to a request, problem, or conversation. Your task is to break down the situation into a structured reasoning process. If feedback is provided, integrate it into your thought process for refinement."

daveshap / Raspberry