-
Hello, thank you for supplying the code for the paper.
I consider this paper as currently state-of-the-art LLM-based vulnerability detection because recent ICSE papers depend on sLLMs such as codeB…
-
Hi, I am trying to use your code. However, I noticed this repo is not a compeleted version, as training data is missing.
Is there a more detailed dcoumentation how to use your code? And if you can…
hxue3 updated
11 months ago
-
您好,我想要尝试使用UniXcoder实现code-refine任务,这是一个类似于神经机器翻译的下游任务。
我在实现时参考了UniXcoder的Code-generation和Code-summarization这两个下游任务,但是我的实现可能有些问题,表现在训练集上的loss正常收敛,但是xmatch和bleu指标效果并不理想(还不如CodeBERT和GraphCodeBERT),甚至有时候…
-
Hi,
Referring this link https://github.com/microsoft/CodeBERT/tree/master/GraphCodeBERT/codesearch ,
I unzipped dataset, and tried to execute run.sh, But getting 403 status code on the http requ…
-
Hi @urialon, Is there a way to run Code2Vec on Javascript Benchmarks. Can you guide me how do we get Code2Vec input format for Javascript benchmarks.
Currently, I am using ASTminer library but the r…
-
为什么这个脚本finetune-msg.sh没有生成checkpoints的step,而是直接finished了是我哪里修改问题吗?
```bash
cat finetune-msg.sh
# batch size 6 for 16 GB GPU
mnt_dir="/home/codereview"
# You may change the following block for…
-
Hi,
`run_finetune_msg`文件中的第82行:`with open(os.path.join(args.model_name_or_path, "preds.txt"), "w", encoding="utf-8") as f:`,在huggingface中无法找到preds.txt文件,golds.txt文件也没有找到,请问怎么解决呢?
报错信息如下:`FileNotFo…
-
As we extend deduplication to a wide range of languages, what tokenization method to use will have an impact on the final results.
The current script uses a simple regex and uni-gram to perform min…
-
It is mentioned on the website (https://codist-ai.com/) that this model gives 90% accuracy.
Can you elaborate what exactly is this accuracy and how is it measured?
-
https://github.com/microsoft/CodeXGLUE/blob/6744a7f6ab658a15382f842df6b9c5f148423a49/Code-Code/code-to-code-trans/evaluator/CodeBLEU/dataflow_match.py#L100
The result of `list(set())` is random und…