Not Total Recall (1990)

ordellrb@lemmy.world · edit-2 9 months ago

Not Total Recall (1990)

R00bot@lemmy.blahaj.zone · 9 months ago

I can’t imagine it’d be that hard to write some code that does that using an existing AI model.

JackGreenEarth@lemm.ee · 9 months ago

You’re probably right.

not_amm@lemmy.ml · 9 months ago

I found a small command to run KDE Spectacle (screenshot software) with Tesseract so I can OCR a screenshot if I want to, I only had to install Tesseract and a main language, you could easily do the same with an API and/or a local AI.

MacN'Cheezus@lemmy.today · 9 months ago

Llava and Bakllava are two Ollama models than can not only extract text but also describe what’s happening on screen.

Using tesseract-ocr, as the other guy suggested, is probably simpler and less resource intensive though.