bert-inference-performance Search Results

nod-ai/SHARK-TestSuite #325

MiGraphx CPU/GPU Status Tracking

This issue will be used to track compilation failures for migraphx models on CPU and GPU. Compile failures for each model should have a link to an issue with a smaller reproducer in the notes column. …

zjgarvey updated 10 hours ago

VellankiSaiKiran/PII-Detection-Using-BERT-and-ELECTRA #1

Request for Info

Hello Sai Kiran, I did came across your medium [blog](https://medium.com/@kiranspixel/advanced-pii-detection-in-educational-data-using-bert-and-electra-5dc21571b610) on "Advanced PII Detection in E…

nikhilkomakula updated 3 weeks ago

mlcommons/inference #1722

Performance improvement- GPT-J and BERT Offline scenario

The current implementation of GPT-J and BERT carries out the prediction in sequential manner. Could the performance of GPT-J and BERT be improved by implementing parallel processing through threads ra…

anandhu-eng updated 4 months ago

mlcommons/inference #1846

sudo access needed to run cm run script "get sys-utils-cm"

Hi, This is my attached cm-repro file [cm-repro.zip](https://github.com/user-attachments/files/16969739/cm-repro.zip) I'm trying to run the MLPerf Reference Implementation for bert-large at h…

writingindy updated 2 weeks ago

mlcommons/ck #1184

[W:onnxruntime:, graph.cc:3593 CleanUnusedInitializersAndNod…

When I run the command cm run script --tags=generate-run-cmds,inference,_find-performance,_all-scenarios --model=bert-99 --implementation=reference --device=cuda --backend=onnxruntime --category=edg…

KingICCrab updated 3 weeks ago

pytorch/pytorch #133079

Python 3.10 + intel_openmp shows performance regression when…

### 🐛 Describe the bug We are planning upgrading our python environment from 3.8 to 3.10, because pytorch has deprecated python 3.8 recently. But we found that there are performance gaps between pyt…

WeizhuoZhang-intel updated 2 months ago

mlcommons/inference #1746

CM running failed when cloning from https://github.com/GATEO…

I installed CM following the guide in https://docs.mlcommons.org/ck/install/ successfully and then refer to https://docs.mlcommons.org/inference/benchmarks/language/bert/ to run the scripts as belo…

Bob123Yang updated 3 months ago

mlcommons/inference #1817

CM error: no scripts were found with above tags and variatio…

(python3-venv) aarch64_sh ~> cm run script --tags=run-mlperf,inference,_find-performance,_full,_r4.1 --model=dlrm_v2-99 --implementation=reference --framework=pytorch --category=datacenter…

wlhtjht updated 1 month ago

microsoft/onnxruntime #22242

[Performance] fp16 support and performance

### Describe the issue FP16 model inference is slower compared to FP32. Does FP16 inference require additional configuration or just need to convert the model to FP16 ### To reproduce convert onnx …

cbingdu updated 1 week ago

NVIDIA/DeepLearningExamples #1183

[BERT/PyTorch] How to get the inference performance with IN…

Related to BERT/PyTorch Describe the bug: I want to reproduce the inferencing performance with INT8 on T4 or A2, but I don't know how to reproduce and compare with the inferencing performance NV…

Zack0617 updated 2 years ago

857 results for bert-inference-performance

857 results
for bert-inference-performance