basetenlabs / truss-examples

Examples of models deployable with Truss
https://trussml.com
MIT License
103 stars 24 forks source link

[Droid] DBRX Truss Implementation #283

Open factory-droid[bot] opened 2 months ago

factory-droid[bot] commented 2 months ago
Rotor

Summary of Changes

This PR implements the DBRX truss for the Baseten platform. It includes the model.py file with the Model class that loads the DBRX model and runs inference. The requirements.txt file lists the necessary dependencies, and the config.yaml file configures the truss for Python 3.11 and specifies the use of A10G GPUs. The README.md file documents the truss's functionality and usage. This implementation follows the instructions from the DBRX model's original repository and is designed for efficient inference on the Baseten platform.

Fixes BT-10619

Tips

linear[bot] commented 2 months ago

BT-10619 Create DBRX truss

factory-droid[bot] commented 2 months ago

[!IMPORTANT]

Solutions for GitHub Action Failures - b2e77b2864b00ad265e8b5ebb7a4b16289a13ece - 1

Droid deploy truss

The error is due to an incorrect or missing API key for the Baseten API. To resolve this issue, please ensure that the correct API key is set in the environment variable BASETEN_API_KEY.

If you are running this script in a GitHub Actions workflow, you can set the environment variable in the workflow file as shown below:

steps:
  - name: Set up Python
    uses: actions/setup-python@v2
    with:
      python-version: '3.x'
  - name: Run script
    run: python bin/test_truss_deploy.py
    env:
      BASETEN_API_KEY: ${{ secrets.BASETEN_API_KEY }}

In the above snippet, replace BASETEN_API_KEY with your actual API key. If your API key is sensitive, it is recommended to store it as a secret in your GitHub repository settings and then reference it in the workflow file as shown above.

After updating the API key, re-run the GitHub Actions workflow. The script should now be able to authenticate with the Baseten API and the error should be resolved.

factory-droid[bot] commented 2 months ago

[!IMPORTANT]

Solutions for GitHub Action Failures - 42c30826c687048fd53f8f9bac6b729af7cd3158 - 2

Droid deploy truss

The 'BASETEN_API_KEY' environment variable is not being set correctly in the 'truss_deploy.yml' workflow file. To fix this issue, you should set the 'BASETEN_API_KEY' environment variable in the 'env' section of the 'Run tests' step. Here is how you can do it:

   - name: Run tests
     env:
       BASETEN_API_KEY: ${{ secrets.BASETEN_API_KEY }}

This will ensure that the 'BASETEN_API_KEY' environment variable is correctly set and available to the commands in the 'Run tests' step of your GitHub Actions workflow.

factory-droid[bot] commented 2 months ago

[!IMPORTANT]

Solutions for GitHub Action Failures - 17f9d35a024c5d1648119c45e43785ac7a275c4f - 3

Droid deploy truss

The error you're encountering is due to an invalid or incorrect API key. Please follow these steps to resolve the issue:

  1. Navigate to your GitHub repository settings.
  2. Select 'Secrets' from the left sidebar.
  3. Find the 'BASETEN_API_KEY' secret and click on 'Update'.
  4. Enter the correct API key and click on 'Save changes'.

After updating the API key, re-run the GitHub Actions workflow. If the API key is correct, the workflow should run successfully.