Open musunita opened 2 months ago
cc @michaelbenayoun
Thank for the fix. I tried the fix and I dont see above errors anymore which is great. But I encounter a new error now as below.
No input shapes provided, using default shapes, {'batch_size': 1, 'sequence_length': 128} ___ test_export_no_parameters[feature-extraction-hf-internal-testing/tiny-random-XLMModel] ___
self = <optimum.neuron.utils.hub_neuronx_cache.CompileCacheHfProxy object at 0x7f0d8009f850>, repo_id = 'optimum-internal-testing/optimum-neuron-cache-for-testing' default_cache = <libneuronxla.neuron_cc_cache.CompileCacheFs object at 0x7f0f393d7c70>, endpoint = None, token = None
def __init__(
self, repo_id: str, default_cache: CompileCache, endpoint: Optional[str] = None, token: Optional[str] = None
):
# Initialize the proxy cache as expected by the parent class
super().__init__(default_cache.cache_url)
self.cache_path = default_cache.cache_path
# Initialize specific members
self.default_cache = default_cache
self.api = HfApi(endpoint=endpoint, token=token, library_name="optimum-neuron", library_version=__version__)
# Check if the HF cache id is valid
try:
if not self.api.repo_exists(repo_id):
raise ValueError(f"The {repo_id} repository does not exist or you don't have access to it.")
E ValueError: The optimum-internal-testing/optimum-neuron-cache-for-testing repository does not exist or you don't have access to it.
optimum/neuron/utils/hub_neuronx_cache.py:116: ValueError
You need to set the CUSTOM_CACHE_REPO
environment variable with a Hub repo to use as a cache repo and then set the HF_TOKEN
environment variable with a token that has write access to the repo.
Thanks Michael. There are around 35 outstanding failures. Attached logs here. [Uploading outinf2.txt…]()
Would you please recommend next steps on the failures.
System Info
Who can help?
@dacorvo, @JingyaHuang,
Inference test details is enclosed in tests in optimum-neuron.pdf. Tests in Optimum Neuron (3) (4).pdf
While running inference tests on INf2.48xlarge, encounter following errors.
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction (minimal, reproducible, runnable)
steps: Run tests a. pytest -m is_inferentia_test tests
Expected behavior
Expect to see inference suite passing.