toad.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
Mastodon server operated by David Troy, a tech pioneer and investigative journalist addressing threats to democracy. Thoughtful participation and discussion welcome.

Administered by:

Server stats:

273
active users

#softwaredevelopment

16 posts16 participants2 posts today

How Accurately Do Large Language Models Understand Code?

arxiv.org/html/2504.04372v1

"This paper presents the first large-scale empirical investigation into the ability of LLMs to understand code. Inspired by mutation testing, we use an LLM’s ability to find faults as a proxy for its deep understanding of code. This approach is based on the insight that a model capable of identifying subtle functional discrepancies must understand the code well."

It appears that coding LLMs are vulnerable to misleading code comments, misleading variable names and misleading dead code. They still have shallow understanding of code, based on syntax and tokenization designed for natural languages, instead of analyzing code semantics. Writing a lot of incorrect comments can confuse them 😉

arxiv.orgHow Accurately Do Large Language Models Understand Code?

I must say I was involved about midway through my software development career in an early form of AI. At the time we called it rules-based system, and then expert system. It actually seemed intriguing and was custom-molded for the solution I was trying to architect at the time. It also was very constrained and didn't have the "ick" feel of the reach of AI nowadays.

Is there anyone from that timeframe that can share their experiences? I hope I'm not seeing it through rose-colored glasses.

en.wikipedia.org/wiki/Expert_s

en.wikipedia.orgExpert system - Wikipedia

TechCrunch: Automattic says it will start contributing to WordPress again after pause. “In a blog post titled ‘Returning to Core’ published Thursday evening, Automattic announced it will unpause its contributions to the WordPress project. This is despite having said only last month that the 6.8 WordPress release would be the final major release for all of 2025.”

https://rbfirehose.com/2025/06/04/techcrunch-automattic-says-it-will-start-contributing-to-wordpress-again-after-pause/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · TechCrunch: Automattic says it will start contributing to WordPress again after pause | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose

Are you struggling to retain new information?

Sleep plays a crucial role in transferring short-term memory to long-term memory, which is essential for learning new skills.

Make sure you get 7-9 hours of sleep to improve your memory and learning abilities.

Here are my top tips for better sleep?

👉 Earplugs
👉 A good quality mattress
👉 A well-fitting sleep mask
#developers #coding #softwaredevelopment #softwareengineering #wellbeing #mindset #mentalhealth

A key advantage of Python is its rich ecosystem of libraries. Beyond the powerful standard library, Python boasts a vast collection of third-party packages and frameworks that significantly boost productivity and simplify development across countless domains

Here are useful and popular third-party #python libraries 😎👇 #coding #programming #softwaredevelopment #softwareengineering

Find high-res pdf books with all my #DevOps related infographics at study-notes.org

Heard at #Devconf2025 (maybe paraphrased, I can't remember the exact words): remember, #VibeCoding was *not* intended for people to use to produce production-ready software; it was intended for developers to use to quickly play around with and learn new tech and new techniques, etc. And you were *certainly* never meant to just take what your favourite LLM spat out, paste it as-is into your IDE, run it and deploy it into a mission-critical application. You're supposed to tinker with it, understand it, modify it, use it to practice and learn.

And for that purpose, it is perfectly fit.

But it's the same thing with StackOverflow, right? How often do you copy code out of a SO answer and just use it as-is without giving a thought to how it actually works? And how many times, as a young developer, have your seniors told you that you should *never* do that? :P