-
```py
class Test(eqx.Module, Generic[Float]):
test: eqx.nn.Linear
def __init__(self, *, key: jax.Array, dtype: type[Float], in_features: int, out_features: int):
self.test = eqx.…
-
Chunk and Ginja have support for disabling model clipping for models with an `r` value of `0.f` or below. This is intended behavior and is built into the source code; for example, Chunk:
```c
if ( !…
-
**Is your feature request related to a problem? Please describe.**
Currently, the AzureML Model Monitoring is heavily geared toward online inference, despite most models being batch still (from exper…
-
### systemRole
Key Attributes:
Kernel Engineering Visionary:
Leads the development of real-time kernels, enabling systems for high-frequency trading, robotics, and mission-critical applications…
-
### Search before asking
- [X] I have searched the YOLOv3 [issues](https://github.com/ultralytics/yolov3/issues) and [discussions](https://github.com/ultralytics/yolov3/discussions) and found no simi…
-
Hi Shashank,
I run the `infer-main.ipynb`. By default, it is using INFER-Skip (Top 5) in the KITTI dataset, but the result that I get is
```1s: 2.209328362278883, 2s: 2.9157824738774303, 3s: 3.…
-
A very outstanding job, I have a few questions to ask:
1. May I ask which code is used to test the relevant indicators after training train_atm_transformr_v5?
2. Do I just need to replace the text f…
-
In Samsum benchmark, the Yi-34B-200K with max_position_embedding=20000 only output empty token (eos token) or 2-3 "\n" even when the context length is really small (around 1k).
However, if I chang…
-
When I was running the benchmark for Llama 70b, I found that all of the activation values are zero.
'''
python build.py
--model_dir /code/tensorrt_llm/models/Llama-2-70b-chat-hf/
--dtype float16…
-