PaddlePaddle / PaddleNLP

πŸ‘‘ Easy-to-use and powerful NLP and LLM library with πŸ€— Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including πŸ—‚Text Classification, πŸ” Neural Search, ❓ Question Answering, ℹ️ Information Extraction, πŸ“„ Document Intelligence, πŸ’Œ Sentiment Analysis etc.
https://paddlenlp.readthedocs.io
Apache License 2.0
11.71k stars 2.86k forks source link

[LLM] support sparse attention for LLAMA #8592

Closed GuoxiaWang closed 1 week ago

GuoxiaWang commented 2 weeks ago

PR types

New features

PR changes

Models

Description

[LLM] support sparse attention for LLAMA

paddle-bot[bot] commented 2 weeks ago

Thanks for your contribution!

codecov[bot] commented 2 weeks ago

Codecov Report

Attention: Patch coverage is 14.89362% with 40 lines in your changes missing coverage. Please review.

Project coverage is 54.17%. Comparing base (a8d446c) to head (61f2314). Report is 1 commits behind head on develop.

Files Patch % Lines
paddlenlp/transformers/llama/modeling_pp.py 10.25% 35 Missing :warning:
paddlenlp/transformers/llama/fusion_ops.py 0.00% 4 Missing :warning:
paddlenlp/transformers/llama/modeling.py 75.00% 1 Missing :warning:
Additional details and impacted files ```diff @@ Coverage Diff @@ ## develop #8592 +/- ## =========================================== - Coverage 54.18% 54.17% -0.02% =========================================== Files 625 625 Lines 98942 98972 +30 =========================================== + Hits 53612 53615 +3 - Misses 45330 45357 +27 ```

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.