`ibm/granite-instruct` preset for the IBMvLLM adapter is vague and resolves incorrect max_sequence_length

Describe the bug The ibm/granite-instruct preset for the IBMvLLM adapter is not specific to the model being used. I assume its ibm-granite/granite-3.0-8b-instruct and should be named as such to avoid confusion.

Assuming ibm-granite/granite-3.0-8b-instruct then the backend returns the incorrect max_sequence_length of 8192. The correct max_sequence_length is 4096

To Reproduce Steps to reproduce the behavior:

Create an GraniteBeeAgent with IBMvLLM backend using preset ibm/granite-instruct
Breakpoint the meta method in the corresponding llm.ts
Result will indicate a max_sequence_length of 8192

Expected behavior max_sequence_length of 4096

Screenshots / Code snippets Screenshot 2024-11-21 at 10 24 22 AM

Set-up:

Bee version: 0.0.41
Model provider: IBMvLLM

i-am-bee / bee-agent-framework

`ibm/granite-instruct` preset for the IBMvLLM adapter is vague and resolves incorrect max_sequence_length #190