issues
search
evalplus
/
repoqa
RepoQA: Evaluating Long-Context Code Understanding
https://evalplus.github.io/repoqa.html
Apache License 2.0
96
stars
3
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Question about the evaluation metric.
#51
shubhamrgandhi
opened
1 day ago
0
feat: add scripts for question generation, LLM-based question/issue a…
#50
claudeyj
opened
1 week ago
0
Problem with tree_sitter_languages
#49
gaotianyu1350
closed
1 month ago
1
[DEV] Document result maintenance in `result/README.md`
#48
ganler
closed
2 months ago
1
Adding embeddings results
#47
vdaita
opened
3 months ago
0
feat: no padding option
#46
JialeTomTian
closed
2 months ago
1
feat: position & content consistent comment removal
#45
JialeTomTian
closed
3 months ago
4
`wget` missing in the requirement?
#44
sfc-gh-ywei
closed
3 months ago
3
Comment Free Dataset
#43
JialeTomTian
closed
3 months ago
0
[Bug] Huggingface backend seems to be broken
#42
ganler
closed
4 months ago
3
Discrepancies in Model Evaluation Results for Mistral-7B-Instruct-v0.2 and phi-3-mini-128-instruct
#41
zyzzzz-123
closed
4 months ago
19
This test is heavily dependent on the code capability of the model ?
#40
hijkzzz
closed
4 months ago
3
feat: comment free compute
#39
JialeTomTian
closed
4 months ago
2
minor: update context fetching
#38
JialeTomTian
closed
4 months ago
0
Use dynamic rope scaling by default for models with <=16k context length
#37
ganler
closed
4 months ago
10
claude-3-opus-20240229 results
#36
scf4
closed
4 months ago
6
feat: base model prompt template
#35
UniverseFly
closed
1 week ago
0
feat(dependency):Go
#34
Katherine0105
closed
4 months ago
2
feat(compute): fetch train size
#33
JialeTomTian
closed
4 months ago
2
feat(cherry-pick): Go
#32
Katherine0105
closed
4 months ago
0
feat(cherry-pick): Go
#31
Katherine0105
closed
4 months ago
1
RepoQA v0.1.0 Release
#30
ganler
closed
4 months ago
3
feat(dep): typescript
#29
JialeTomTian
closed
5 months ago
0
Leaderboard construction
#28
ganler
closed
4 months ago
2
feat: model output evaluation framework
#27
JialeTomTian
closed
4 months ago
3
Accelerate evaluation using a cache mechanism
#26
ganler
closed
5 months ago
1
[Tracking] Evaluating models on base dataset using 16k context
#25
ganler
closed
4 months ago
7
Result evaluator of code needle search
#24
ganler
closed
4 months ago
0
[Design] Searching Needle Functions
#23
ganler
closed
4 months ago
0
feat: removing natural language function names and comments
#22
vdaita
closed
4 months ago
4
feat: function similarity
#21
JialeTomTian
closed
5 months ago
0
added 10 Typescript repositories to lists.json
#20
vdaita
closed
5 months ago
2
feat: comment analysis
#19
JialeTomTian
closed
5 months ago
1
refactor(rust & java): dependency
#18
JialeTomTian
closed
5 months ago
4
Isolate dependency
#17
ganler
closed
5 months ago
0
feat(rust): dependency analysis
#16
JialeTomTian
closed
5 months ago
4
Function selection by analyzing code-comment ratio
#15
ganler
closed
5 months ago
0
feat(cherry-pick): C++
#14
claudeyj
closed
5 months ago
8
feat(cherry-pick): rust
#13
JialeTomTian
closed
5 months ago
2
Cherrypick + Depedency analysis for TypeScript
#12
ganler
closed
5 months ago
1
Cherrypick + Depedency analysis for Rust
#11
ganler
closed
5 months ago
3
Cherrypick + Depedency analysis for C++
#10
ganler
closed
5 months ago
5
Function extraction
#9
vdaita
closed
5 months ago
1
Function picking
#8
ganler
closed
5 months ago
1
Thoughts on QA design
#7
UniverseFly
opened
6 months ago
6
Add Cherry Pick Projects for Java
#6
JialeTomTian
closed
6 months ago
0
Dataset ensemble
#5
ganler
closed
6 months ago
1
Dependency analysis
#4
ganler
closed
5 months ago
1
Format of RepoQA
#3
UniverseFly
closed
4 months ago
3
feat: add some fine-grained repo filtering
#2
soryxie
closed
6 months ago
0
Next