mezbaul-h / june

Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit
MIT License
677 stars 39 forks source link

IndexError: index 47762 is out of bounds for dimension 0 with size 46028 #13

Closed ElhamAhmedian closed 4 days ago

ElhamAhmedian commented 3 weeks ago
[system]> Listening for sound...
[system]> Sound detected, starting recording...
[system]> Silence detected, stopping recording...
Traceback (most recent call last):
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\runpy.py", line 196, in _run_module_as_main
    return _run_code(code, main_globals, None,
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\runpy.py", line 86, in _run_code
    exec(code, run_globals)
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\Scripts\june-va.exe\__main__.py", line 7, in <module>
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\click\core.py", line 1157, in __call__
    return self.main(*args, **kwargs)
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\click\core.py", line 1078, in main
    rv = self.invoke(ctx)
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\click\core.py", line 1434, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\click\core.py", line 783, in invoke
    return __callback(*args, **kwargs)
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\june_va\cli.py", line 202, in main
    asyncio.run(_real_main(**kwargs))
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\asyncio\runners.py", line 44, in run
    return loop.run_until_complete(main)
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\asyncio\base_events.py", line 649, in run_until_complete
    return future.result()
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\june_va\cli.py", line 109, in _real_main
    producer(text_queue, llm_model, stt_model)
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\june_va\cli.py", line 248, in producer
    user_input = get_user_input()
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\june_va\cli.py", line 225, in get_user_input
    transcription = stt_model.forward(audio_data)
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\june_va\models\stt.py", line 59, in forward
    transcription = self.model(audio, **self.generation_args)
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\transformers\pipelines\automatic_speech_recognition.py", line 285, in __call__
    return super().__call__(inputs, **kwargs)
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\transformers\pipelines\base.py", line 1234, in __call__
    return next(
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\transformers\pipelines\pt_utils.py", line 124, in __next__
    item = next(self.iterator)
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\transformers\pipelines\pt_utils.py", line 269, in __next__
    processed = self.infer(next(self.iterator), **self.params)
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\transformers\pipelines\base.py", line 1149, in forward
    model_outputs = self._forward(model_inputs, **forward_params)
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\transformers\pipelines\automatic_speech_recognition.py", line 496, in _forward
    tokens = self.model.generate(
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\transformers\models\whisper\generation_whisper.py", line 577, in generate
    outputs = super().generate(
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\torch\utils\_contextlib.py", line 116, in decorate_context
    return func(*args, **kwargs)
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\transformers\generation\utils.py", line 1576, in generate
    result = self._greedy_search(
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\transformers\generation\utils.py", line 2507, in _greedy_search
    next_tokens_scores = logits_processor(input_ids, next_token_logits)
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\transformers\generation\logits_process.py", line 98, in __call__
    scores = processor(input_ids, scores)
  File "C:\Users\elham\AppData\Local\Programs\Python\Python310\lib\site-packages\transformers\generation\logits_process.py", line 1733, in __call__
    suppress_token_mask = torch.isin(vocab_tensor, suppress_tokens)
IndexError: index 47762 is out of bounds for dimension 0 with size 46028
github-actions[bot] commented 1 week ago

This issue is stale because it has been open 15 days with no activity. Remove stale label or comment or this will be closed in 5 days.