architecture-optimization Search Results

1000+ results
for architecture-optimization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

STH-Dev/linux-bench #9

Potential GPGPU bench

The Parboil benchmarks are a set of throughput computing applications...Each benchmark includes several implementations. Some implementations we provide as readable base implementations from which new…

Stevensb updated 8 years ago
2
kokkos/kokkos #4953

Compiling for multiple CUDA architectures

NVCC can generate machine code for multiple compute capabilities (a fat binary), but Kokkos CMake list does not let me specify multiple CUDA architectures: ``` CMake Error at cmake/kokkos_arch.cma…

carterbox updated 1 year ago
4
janhq/cortex.cpp #470

hardware: Intel iGPU, dGPU and NPU support

## Overview - Intel's Lunar Lake is releasing soon, which has CPU, NPU and iGPU in a single chip ## Tasklist - [x] https://github.com/janhq/cortex.cpp/issues/677 - [x] https://github.com/janhq/cort…

xiangyang-95 updated 1 week ago
4
oxc-project/backlog #91

Pointer tagging

## What is pointer tagging? I think on most (or quite possibly all) Oxc's supported 64-bit architectures, the top 6 or 7 bits of pointers are unused, and could be used to pack additional data into …

overlookmotel updated 1 month ago
3
openvinotoolkit/openvino #25571

[Bug]: OpenVINO as backend to Pytorch - integrated GPU works…

### OpenVINO Version 2024.2.0 ### Operating System Other (Please specify in description) ### Device used for inference GPU ### Framework PyTorch ### Model used _No response_ ### Issue descri…

js333031 updated 2 months ago
1
renpy/renpy-build #139

ctypes.CFUNCTYPE raises MemoryError on Apple Silicon

```py import ctypes @ctypes.CFUNCTYPE( None, ) def test(): return None ``` Perhaps the `libffi` version should be raised higher. ``` 3.4.6 Feb-18-2024 Fix long double regr…

qTich updated 4 months ago
2
golang/go #21735

cmd/compile: avoid slow versions of LEA instructions on x86

On newer x86 cpus (amd and intel) 3 operand LEA instructions with base, index and offset have a higher latency and less throughput than 2 operand LEA instructions. The compiler when emitting the i…

martisch updated 1 year ago
21
InternLM/lmdeploy #2040

[Feature] Why not support vision model quantilization?

### Motivation Set internvl as an example, it's vision model is 6B. If the vision model can be quantilized, the inference process can be done in only one 4090. 请问目前vision model不支持量化的原因，是因为feature暂时还…

Leo-yang-1020 updated 4 months ago
1
rohitinu6/Stock-Price-Prediction #76

Enhancing Real-Time Data, Model Performance, and Scalability

### Is this a unique feature? - [X] I have checked "open" AND "closed" issues and this is not a duplicate ### Is your feature request related to a problem/unavailable functionality? Please descr…

Keerthi421 updated 1 month ago
3
vllm-project/vllm #9006

[Roadmap] vLLM Roadmap Q4 2024

This page is accessible via [roadmap.vllm.ai](https://roadmap.vllm.ai) ### Themes. As before, we categorized our roadmap into 6 broad themes: broad model support, wide hardware coverage, state of…

simon-mo updated 5 days ago
25

上一页 1...19 20 21 22 23 24 25...100 下一页

1000+ results for architecture-optimization

1000+ results
for architecture-optimization