gpu-optimization Search Results

1000+ results
for gpu-optimization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PennyLaneAI/pennylane #6135

[BUG] pattern_matching_optimization fails for some templates

### Expected behavior ```python with qml.tape.QuantumTape() as temp_go: qml.CNOT([1, 2]) qml.CNOT([0, 1]) qml.CNOT([2, 3]) qml.CNOT([1, 2]) qml.CNOT([2, 3]) qm…

gideonuchehara updated 2 months ago
2
autowarefoundation/modelzoo #83

CenterPoint Backbone preprocessing optimization

The current implementation of `scatter` has some limitation. 1. the GPU implementation hard coded iterator bindings which might not work for certain devices. For example, for OpenCL backend, if a GP…

angry-crab updated 1 year ago
1
KhronosGroup/OpenCL-TTL #5

Whether it can support the mobile GPU well?

I learned from the Adreno GPU optimization manual(https://developer.qualcomm.com/download/adrenosdk/adreno-opencl-programming-guide.pdf?referrer=node/6114): Avoid using the function called async_wo…

ckhfor updated 1 year ago
1
torch/nn #1112

nn.GPU is incompatible with flattening parameters

I used `nn.GPU` to allocate model on different GPUs. Things were going well until I flattened the parameters and gradParameters for optimization. Here is my code: ``` local model = nn.Sequenti…

buttomnutstoast updated 7 years ago
1
wudingjian/rkllm_chat #4

量化模型转化

我尝试转化7B的量化模型都失败了，AWQ提示仅支持GPU，GPTQ则提示.float()不支持量化模型，请问你转化过量化模型吗，或者说是否无法转化量化模型？我看test中有一个loss.float(), 但是感觉也无从下手。谢谢！

PlanetesDDH updated 1 week ago
3
microsoft/onnxruntime #14526

[Performance] Find out why the GPU memory allocated with `CU…

### Describe the issue I have a model that is 4137 MB as a .onnx, exported from a PyTorch's `ScriptModule` through `torch.onnx.export`. When loading the ONNX model through an InferenceSession us…

fxmarty updated 1 month ago
16
E3SM-Project/scream #2921

[Workaround available] Unable to use e3sm_deoptimize_file() …

Below are typical usages to change the optimization level of some Fortran files: ``` set(NOOPT eam/src/physics/cam/zm_conv.F90) if (NOT DEBUG) foreach(ITEM IN LISTS NOOPT) e3sm_deoptim…

dqwu updated 3 months ago
1
isl-org/Open3D #6772

Implement 3D Gaussian splatting viewer

### Milestones: - Study 3DGS implementations and identify best option, risks and blockers for integration - Study [filament renderer](https://google.github.io/filament/Filament.html) and Open3D-fi…

ssheorey updated 3 months ago
1
QwenLM/Qwen2-VL #334

Bug in beam_search: memory allocation

If run this code: ```python model = Qwen2VLForConditionalGeneration.from_pretrained( "Qwen/Qwen2-VL-7B-Instruct", torch_dtype=torch.bfloat16, attn_implementation="flash_attention_2", …

greeksharifa updated 2 weeks ago
2
pytorch/pytorch #134646

[DTensor] use P2P for complicated transformation when redist…

### 🚀 The feature, motivation and pitch # Motivation For complicated `DTensor` redistribution (e.g. `[S(0), S(1)] -> [S(1), S(0)]`), it's likely that only GPU1 and GPU2 need to communicate (when t…

botbw updated 2 months ago
4

上一页 1...16 17 18 19 20 21 22...100 下一页

1000+ results for gpu-optimization

1000+ results
for gpu-optimization