Closed WoosukKwon closed 1 week ago
Currently, the log is printed whenever the paged attention op is compiled (for every layer), which is not needed for end users.
@vanbasten23 Let me know if there's a better way to log this.
hey @WoosukKwon , could you point me to your vLLM and the new paged attention integration PR in vLLM?
Currently, the log is printed whenever the paged attention op is compiled (for every layer), which is not needed for end users.
@vanbasten23 Let me know if there's a better way to log this.