nelsonic commented 9 months ago

@LuchoTurtle you've done a superb job of building a fully functional image captioning app! 😍 🎉 Now it's time to get some credit for it 💳 😉 by submitting a "Show HN" :shipit: And in the process raise your public profile for applying to #NextAdventure 🚀

Todo

[x] Address the start-up time (currently 25sec): https://github.com/dwyl/image-classifier/issues/20
- [ ] And/or create an endpoint to serve a 1x1px GIF to start the app proactively when someone is reading the README.md ref: https://github.com/dwyl/ping/issues/7
[x] Improve the Why? section of README.md to answer the Reader's question succinctly.

Currently it's: https://github.com/dwyl/image-classifier/tree/a3c8c3bf79a0f5e7bca160b6e460d883bc6e3973#why-

This is good for an internally-focussed tutorial for us ... 👍 but as an LLM-curious person casually reading on HN, 👀 this isn't going to "hook" me into reading a 5k word tutorial for 30+ mins ... ⏳

# Why? 💭 

We needed a fully-offline capable (no 3rd party APIs/Services) image captioning service 
using a state-of-the-art pre-trained image model to describe images uploaded in our 
[`App`](https://github.com/dwyl/app).

# What?

A step-by-step tutorial building a fully functional 
`Phoenix LiveView` web application that allows anyone 
to upload an image and have it described 
by the `Open Source` `BLIP` image captioning (`Large`) model.

[x] Simplify layout of homepage of the App and add examples: https://github.com/dwyl/image-classifier/issues/7
[x] Prepare Intro/Demo GIF: https://github.com/dwyl/image-classifier/issues/21
[x] Add demo GIF to intro of README.md
- [x] GIF should be below the badges and above the TOC:

[ ] Record Tutorial Video: #23

Part 2

[ ] Study the 20 most successful "Show HN" posts: https://hn.algolia.com/?query=show+hn&sort=byPopularity (spend 1-2 mins skimming each one to extract the info - or use ChatGPT to do some pre-analysis if you think it will be useful ...) Summarise what you learned as comments in this issue thread. 💬 🙏
[ ] Carefully craft your Show HN Title to maximise its chance of success. 💭
[ ] Share on https://elixirforum.com/ to get some initial traction/stars.
- [ ] As soon as you submit on HN, update the Elixir Forum topic to include the HN link to drive up-votes.
[ ] Submit your Show HN!! 🚀
[ ] Share the link so we can up-vote it. 🔗 ⬆️
- [ ] Get everyone you know to up-vote it so it gets and initial boost. 📈

nelsonic commented 8 months ago

@LuchoTurtle how close do you feel this repo is to sharing on HN? 💭 🚀

LuchoTurtle commented 8 months ago

Safe some changes to the README (as outlined on this issue), I think it brings sufficient value for those that want to get started with Bumblebee (unfortunately there aren't many examples out there with in-depth guides and comparisons).

However, I think #18 would also bring great value to this project and would be extremely interesting as it would use a part of Bumblebee that is not used here - voice-to-text.

ndrean commented 8 months ago

I can push several versions if you want help. 1) add an audio capture (an HTML

LuchoTurtle commented 8 months ago

@ndrean any PR is helpful :). I think your idea of speech-to-text really takes this to another level. The purpose is to document the process, which is severely lacking in fly.io articles and in the bumblebee repo.

ndrean commented 8 months ago

@LuchoTurtle I understand your idea. Note that the key point of selecting the right model is probably the most difficult part. However, I did no effort on this. I just followed and adapted an article of Sean Moriarty on semantic search with ExFauss and adapted it. . It just works, and I feel I did not really learnt something - except the Elixir point of view if using tasks - in the sense that if you ask for something more difficult, like say an interactive LLM or build a model, then I have clearly no clue what to do. It's more like a one shot. But you have to start somewhere don't you!? Anyway, I always cite my sources and will push things as soon as my computer is available.

dwyl / image-classifier

EPIC: Prepare the repo for `public` share on `HN` 🚀 #22

Todo

Part 2