-
Thanks for the great work. And I have a question about the GPT scores for video comprehension.
I evaluated the GPT score for video comprehension using the evaluation code you published and the hugg…
-
Post your response to our challenge questions.
First, write down three intuitions you have about broad content patterns you will discover in your data. Plan an asterisk next to the one you expect m…
lkcao updated
6 months ago
-
It would be interesting to see if/how `aider` performs against the SWE-Bench benchmarks:
- https://www.swebench.com/
- https://github.com/princeton-nlp/SWE-bench
- > [ICLR 2024] SWE-Bench: Can …
-
It would be great if we could group the tasks by whether they require money or not.
In short, we need to split tasks by their need for OpenAI API configurations.
-
![image](https://github.com/wbbeyourself/MAC-SQL/assets/80022154/50318e98-221d-467e-bab3-3eb5791113b7)
In question #7 , I see that the results of spider in the paper are obtained by GPT-4-32K, an…
-
Right now the user needs to explicitly return in the traced function a dict that contains the cost, message, and number of tokens.
However, this information is simply the sum of costs and tokens used…
-
**Is your feature request related to a problem? Please describe.**
gpt-4 is very costly and gpt-3.5 provides low grade output. I'd like to use gpt-4-turbo for evaluation
**Describe the solution yo…
-
## Keyword: efficient
### End-to-end codesign of Hessian-aware quantized neural networks for FPGAs and ASICs
- **Authors:** Javier Campos, Zhen Dong, Javier Duarte, Amir Gholami, Michael W. Mahoney,…
-
-
Tasks
- [x] Create showcase application which demonstrates all functionality below
To make filters non-experimental, the following user stories should be met:
- [x] **Telemetry** – any of the telem…