I can successfully execute the query when waiting for the full response, but once I enable the streaming flag it just starts throwing exceptions
response = query_engine.query(compiledQuery)
for token in response.response_gen:
print(token)`
TypeError: 'NoneType' object is not iterable
I have a tried a number of different ways to get streaming to work, and from what I can see in the RTX Chat codebase, this is what they are doing, but it is not working for me, with the above error
I am attempting to build a chatbot using TrtLlmAPI as the llm
and a query retrieve engine to perform the query
I can successfully execute the query when waiting for the full response, but once I enable the streaming flag it just starts throwing exceptions
TypeError: 'NoneType' object is not iterable
I have a tried a number of different ways to get streaming to work, and from what I can see in the RTX Chat codebase, this is what they are doing, but it is not working for me, with the above error