bin123apple / Fortran2Cpp

Fortran2Cpp: A new model designed for the Code translation between the Fortran and C++
Apache License 2.0
4 stars 1 forks source link

Fortran2Cpp dataset questions #24

Open chunhualiao opened 1 month ago

chunhualiao commented 1 month ago

a set of dialogues: id values: 1 to 2529

One example dialogue

Round1 : Translate Fortran code into C++

Round 2: Provide Unit testing for both Fortran and translated C++ (this prompt may be too complex, better divide into two?)

Round 3: What does this step do?? You may need to explain to LLMs what the error message is about and how it is generated.

Round 4: why do you need to ask LLMs to check unit test results? Unit test results directly provide the answer already.,

Round 5: create the fortran-C++ code pair

I guess you store the final pairs extracted from the dialogues into some dataset somewhere.

Where to find them?

bin123apple commented 1 month ago

Just uploaded the dataset, it is in data/F2C_dialogue_25K.json file.

chunhualiao commented 4 days ago

Please explain the format of our dataset to be released to the community: write in paper

Please justify your choice,

How do you expect users to extract traditional code Paris from your dialogue?

bin123apple commented 4 days ago

It is a sequence of dialogue.

chunhualiao commented 15 hours ago

The dataset eventually should be released on huggingFace with sufficient documentation .