An open source chat bot architecture for voice/vision (and multimodal) assistants, local and remote to run; if u run achatbot by yourself, u can learn more, star and fork to contribute~
add transformers vision molmo, MolmoE-1B-0924, Molmo-7B-O-0924, Molmo-7B-D-0924 on transformers inference(generate) with streamer(streaming); need use A100 40G to run
feat:
notebook:
fix: