Gene-Weaver / VoucherVision

Initiated by the University of Michigan Herbarium, VoucherVision harnesses the power of large language models (LLMs) to transform the transcription process of natural history specimen labels.
https://huggingface.co/spaces/phyloforfun/VoucherVision
GNU General Public License v3.0
18 stars 4 forks source link

Estimate API cost for OCR models #31

Open mickley opened 3 months ago

mickley commented 3 months ago

Currently, the cost of OCR API access, such as Google Vision aren't included in the cost calculations, expense report, or cost estimations. They are inexpensive, but as LLM costs go down, they are becoming a larger portion. Google Vision API is now about 10x more per specimen than Gemini 1.5 Flash.

Gene-Weaver commented 3 months ago

Well that's good to know. I hadn't checked prices for OCR in a while. I'll add this to the issue #27