ivanafterall ☑️

  • 14 Posts
  • 654 Comments
Joined 2 years ago
cake
Cake day: June 20th, 2023

help-circle








  • Thanks! I got hired at a pretty big university (better than the one I attended for sure) to do documentation (and hopefully, increasingly data) work in their advancement department. Kind of random and the pay could definitely be better, but I’m generally pretty happy with the environment. Nice to not be supporting abject evil. First actual work-from-home job without feeling a suspicious eye on me at all times. Trying to make use of the free certification courses they offer and am halfway through CompTIA Data+. Nice break from the Uber/Lyft grind for awhile, anyway.


  • I don’t have a specific figure for you. My use-case is I’m trying to write a non-fiction book. I’ve got a ton of old newspaper articles in PDF format. The Library of Congress’ built-in OCR is very helpful, but very lacking and, in some cases, can miss large swaths of pages or generate really unhelpful gibberish that requires painful cleaning. I’ve had similar results from every other OCR tool I’ve tried.

    Thus far, in using Claude/ChatGPT for transcription of a few dozen articles, I’ve only had to fix one individual stray word a few times. It’s been very close to perfect in my limited testing. High 90%. Impressively, with old newspaper articles where words have worn away or are otherwise very hard to make out even for me, it has done a great job of inferring/recognizing, where OCR would start generating gibberish. I haven’t tried hand-writing and suspect that’s a different beast, but I know there are tools that have cropped up to that end.