evalplus repoqa issues - Githubissues

evalplus / repoqa

RepoQA: Evaluating Long-Context Code Understanding

https://evalplus.github.io/repoqa.html

Apache License 2.0

96 stars 3 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Question about the evaluation metric.

#51 shubhamrgandhi opened 1 day ago
0
feat: add scripts for question generation, LLM-based question/issue a…

#50 claudeyj opened 1 week ago
0
Problem with tree_sitter_languages

#49 gaotianyu1350 closed 1 month ago
1
[DEV] Document result maintenance in `result/README.md`

#48 ganler closed 2 months ago
1
Adding embeddings results

#47 vdaita opened 3 months ago
0
feat: no padding option

#46 JialeTomTian closed 2 months ago
1
feat: position & content consistent comment removal

#45 JialeTomTian closed 3 months ago
4
`wget` missing in the requirement?

#44 sfc-gh-ywei closed 3 months ago
3
Comment Free Dataset

#43 JialeTomTian closed 3 months ago
0
[Bug] Huggingface backend seems to be broken

#42 ganler closed 4 months ago
3
Discrepancies in Model Evaluation Results for Mistral-7B-Instruct-v0.2 and phi-3-mini-128-instruct

#41 zyzzzz-123 closed 4 months ago
19
This test is heavily dependent on the code capability of the model ?

#40 hijkzzz closed 4 months ago
3
feat: comment free compute

#39 JialeTomTian closed 4 months ago
2
minor: update context fetching

#38 JialeTomTian closed 4 months ago
0
Use dynamic rope scaling by default for models with <=16k context length

#37 ganler closed 4 months ago
10
claude-3-opus-20240229 results

#36 scf4 closed 4 months ago
6
feat: base model prompt template

#35 UniverseFly closed 1 week ago
0
feat(dependency):Go

#34 Katherine0105 closed 4 months ago
2
feat(compute): fetch train size

#33 JialeTomTian closed 4 months ago
2
feat(cherry-pick): Go

#32 Katherine0105 closed 4 months ago
0
feat(cherry-pick): Go

#31 Katherine0105 closed 4 months ago
1
RepoQA v0.1.0 Release

#30 ganler closed 4 months ago
3
feat(dep): typescript

#29 JialeTomTian closed 5 months ago
0
Leaderboard construction

#28 ganler closed 4 months ago
2
feat: model output evaluation framework

#27 JialeTomTian closed 4 months ago
3
Accelerate evaluation using a cache mechanism

#26 ganler closed 5 months ago
1
[Tracking] Evaluating models on base dataset using 16k context

#25 ganler closed 4 months ago
7
Result evaluator of code needle search

#24 ganler closed 4 months ago
0
[Design] Searching Needle Functions

#23 ganler closed 4 months ago
0
feat: removing natural language function names and comments

#22 vdaita closed 4 months ago
4
feat: function similarity

#21 JialeTomTian closed 5 months ago
0
added 10 Typescript repositories to lists.json

#20 vdaita closed 5 months ago
2
feat: comment analysis

#19 JialeTomTian closed 5 months ago
1
refactor(rust & java): dependency

#18 JialeTomTian closed 5 months ago
4
Isolate dependency

#17 ganler closed 5 months ago
0
feat(rust): dependency analysis

#16 JialeTomTian closed 5 months ago
4
Function selection by analyzing code-comment ratio

#15 ganler closed 5 months ago
0
feat(cherry-pick): C++

#14 claudeyj closed 5 months ago
8
feat(cherry-pick): rust

#13 JialeTomTian closed 5 months ago
2
Cherrypick + Depedency analysis for TypeScript

#12 ganler closed 5 months ago
1
Cherrypick + Depedency analysis for Rust

#11 ganler closed 5 months ago
3
Cherrypick + Depedency analysis for C++

#10 ganler closed 5 months ago
5
Function extraction

#9 vdaita closed 5 months ago
1
Function picking

#8 ganler closed 5 months ago
1
Thoughts on QA design

#7 UniverseFly opened 6 months ago
6
Add Cherry Pick Projects for Java

#6 JialeTomTian closed 6 months ago
0
Dataset ensemble

#5 ganler closed 6 months ago
1
Dependency analysis

#4 ganler closed 5 months ago
1
Format of RepoQA

#3 UniverseFly closed 4 months ago
3
feat: add some fine-grained repo filtering

#2 soryxie closed 6 months ago
0