- cross-posted to:
- programmer_humor@programming.dev
- cross-posted to:
- programmer_humor@programming.dev
Yes it’s an LLM called pandoc, you can run it locally
You don’t need a private nuclear plant to run it? Wow very efficient.
Black magic software.
Learn more: openai.com/pandoc
Technically OCR is an application of machine learning.
Not an LLM, though.
regularizing the OCR’d form into a json/html file might be a good application of an LLM though. Perhaps this is what they were asking about.
I doubt they even know what they are asking about?
The secret to success in software engineering:
- Lie and say that there is
- Write or use a conversion algorithm
- Boss won’t know the difference
- Collect bonus at performance evaluation
- Put “AI engineer” on resume
- Boss thinks AI can code at senior developer level and fires you and the entire team
- Never plan on staying at a SE job for longer than a few years. Not in a market that volitile.
No need, there’s an unmaintained javascript library for that (written by a 12-yr old)
Omg, sign me up! I’m gonna put that script in production for a server used by millions of customers around the world!
Oh no, now there is a security audit and the pdf generated is insecure, the unpaid developer that has not logged in since 2015 has to fix this ASAP
This is that special blend of Tablet Kid “I don’t need to know things I can google them” and Rich Kid “I don’t need to do things I can crowdsource them” that makes for that Distinctively VP “I don’t know what I’m doing and nobody can tell 👈😎👉”
Initially, I didn’t think these kids were fall guys.
Now I think they’re fall guys.
That was my thought. Young kids fresh out of school are really easy to manipulate into delusions of grandeur, especially when said delusions are offered by the richest person in the world. He’s gonna leave them out for the wolves.
yes me send me what you want me to parse and i will get back to you in 3-4 business days
I have to admit, PDF parsing being such a hot and profitable topic in computer science was really something I never saw coming.
PDFs? The things you can select text from? And when not, there’s decent OCR? And when not, you just ask the person to send you an email or a word doc?
It sounds like LLMs are looking for a new unpolluted source of historical data that they can learn from, and this source exists in the form of old scanned-in paper documents. That’s the only reason I can fathom as to why this is such a big thing now.
How is that a stupid question?
Large Language Models are for natural language processing, not for converting between text document file formats.
Perhaps the best way would be through an analogy:
“Are there any thermonuclear bombs made specifically for lighting candles?”
Imagine getting a job like this and now half the nation knows your name…thats terrifying. being an intern may mean you have no idea of the true scope of what they are asking you to do.
They are public employees who are changing things at the core of our government. Why wouldn’t we know their names?
Government employees names aren’t secret (asides from a few exceptions) nor is their pay
We know that his dad is an engineering professor at university of Nebraska too. Really calls into question his credentials. I checked the other day and they had already removed his contact info from their website.
Yeah, seems that’s the point. Old enough to competently perform what they’re told, but too young to realize the gravity of the situation and how wrong it is to partake in it.
that’s why we have 18 year soldiers …
It’s ok, with the experienced gained from being forced to grow up, some will come home and use their savings to buy a dodge ram on a 7 year loan at 18% apr.