issues
search
hpcaitech
/
EnergonAI
Large-scale model inference.
Apache License 2.0
630
stars
90
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Added parallel code for chatglm-6B
#225
Caesar1993
opened
1 year ago
0
关于示例代码版本落后,无法运行等问题About the example code version backward, can not run and other issues
#224
AntyRia
opened
1 year ago
7
[bug] The InferenceEngine used in the example for distributed inference cannot be imported.
#223
flybird11111
opened
1 year ago
0
OPT-125m problem
#222
Yummy813
opened
1 year ago
0
Location of logs
#221
Yummy813
closed
1 year ago
0
Docker cannot find the parent image defined in `docker/Dockerfile`
#220
Aavache
opened
1 year ago
0
Does EnergonAI support accelerated inference for segmenting anything?
#219
sanbuphy
opened
1 year ago
0
Support GPT BigCode (bigcode/starcoder, bigcode/gpt_bigcode-santacoder, etc.)
#218
liulhdarks
opened
1 year ago
1
Failed to load pre-trained model weights for OPT_125M
#217
zhengmk321
closed
1 year ago
2
Where is InferenceEngine definition?
#216
yehx1
opened
1 year ago
0
EnergonAI running OPT reasoning example: When encountering a client request, the server is blocked and cannot return the result
#215
colynhn
opened
1 year ago
1
Is there an example of the http client?
#214
frankxyy
opened
1 year ago
1
question about load model state_dict in multi-gpus
#213
irasin
closed
1 year ago
2
Concrete doc of this project
#212
frankxyy
opened
1 year ago
1
_pickle.UnpicklingError: invalid load key, '{'.
#211
RundongCao
opened
1 year ago
4
Where is InferenceEngine definition?
#210
liujuncn
opened
1 year ago
2
Is there any examples of using offload feature in GPT/BLOOM/OPT inference?
#209
YJHMITWEB
opened
1 year ago
1
[hotfix] fix rpc init and opt checkpoint key mapping
#208
ver217
closed
1 year ago
0
polish code and add setup script
#207
juncongmoo
opened
1 year ago
2
an error caused by running the example of the opt
#206
LemonSqi
opened
1 year ago
4
failure to compile energonai by the command : python setup.py build
#205
LemonSqi
opened
1 year ago
1
fail to install EnergonAI
#204
NewDriverLee
opened
1 year ago
3
OPT demo TEST
#203
Batizhao8899
opened
1 year ago
2
Add a sync version of wait()
#202
juncongmoo
closed
1 year ago
1
miss cache error when pose generation opt
#201
tycallen
opened
1 year ago
2
[misc] add license
#200
ver217
closed
1 year ago
0
How to use dynamic batch features
#199
hudengjunai
opened
1 year ago
1
OPT inference
#198
Joanna-0421
opened
1 year ago
2
Fix bloom-int8 for v100
#197
Oliver-ss
closed
1 year ago
1
Doesn't run gpt reference?
#196
YuchengWang
opened
1 year ago
1
[kernal] remove redundant cuda kernal
#195
binmakeswell
closed
1 year ago
1
Maybe you should add license for using OneFlow's LayerNorm Kernel implement?
#194
MARD1NO
closed
1 year ago
2
Does not support Cuda 10.2 ?
#193
0-1CxH
opened
1 year ago
1
Not compatible with the latest version of transformers? (4.26.1)
#192
skiingpacman
opened
1 year ago
2
Can not start the Bloom server
#191
SAI990323
opened
1 year ago
3
fix typo in p2p.py
#190
eltociear
closed
1 year ago
1
How to inference BLOOM-176B by multi-node multi-card?
#189
vicwer
opened
1 year ago
2
[doc] fix typo of BLOOM
#188
binmakeswell
closed
1 year ago
0
Why does it unreadable generated by OPT-30B inferring with EnergonAI
#187
ericxsun
closed
1 year ago
11
Cannot run opt 125m examples with latest energonai docker images
#186
zhanghaoie
opened
1 year ago
2
[example] fix requirements
#185
binmakeswell
closed
1 year ago
0
Support OPT-IML model
#184
ericxsun
closed
1 year ago
1
Failed to load OPT-30B checkpoint
#183
ericxsun
closed
1 year ago
2
Detected RRef Leaks during shutdown, empty pipe, tests_engine failed
#182
nostalgicimp
opened
1 year ago
1
Update bloom/README with Generate time tesing
#181
ht-zhou
closed
1 year ago
0
Update README.md
#180
ht-zhou
closed
1 year ago
0
Update README
#179
ht-zhou
closed
1 year ago
0
polish
#178
ht-zhou
closed
1 year ago
0
[bloom] remove config.json
#177
feifeibear
closed
1 year ago
0
Add Bloom-int8-tp support
#176
ht-zhou
closed
1 year ago
0
Next