toad.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
Mastodon server operated by David Troy, a tech pioneer and investigative journalist addressing threats to democracy. Thoughtful participation and discussion welcome.

Administered by:

Server stats:

274
active users

#datascience

27 posts24 participants0 posts today

Dr. Ellie Murray ScD @epiellie has been breaking down the MAHA report, piece by piece.

If you enjoy looking at bad data, or are interested in learning what is in the report and would rather read a review, take a look. I have been thoroughly enjoying it, as a data nerd.

Parts I & II are currently available:
epiellie.substack.com/p/the-ma

E is for Epi · The MAHA Report: I Read it so You Don't Have to.By Ellie Murray, ScD

«Workshop: Ας φτιάξουμε τη δική μας γλώσσα προγραμματισμού»
---
>> youtu.be/sIDj5dFSXWw (1ο μέρος)
>> youtu.be/U7GognkkUcU (2ο μέρος)
>> open.spotify.com/episode/5ndE6 (podcast)

Υλικό παρουσίασης:
* github.com/xgeorgio/ApneaCodin

Renku 2.0 has launched! The new release was rolled out this week and replaced the "legacy" platform. This is a major milestone and represents ~18 months of work - a huge effort by the whole team! We're excited to see how our users leverage this new modular design - check it out at renkulab.io! Join us tomorrow for the launch webinar to learn more.

Release blog post: blog.renkulab.io/launch-renku-
Webinar: blog.renkulab.io/renku-2-launc

renkulab.ioRenku | Collaborative Data Science | Open ResearchAn open-source platform for connecting data, code, and compute and empowering collaborative data science.
Replied in thread

💬 Want to use GPT-4, Claude, Gemini, Ollama & more directly from R?
Meet {ellmer}: a powerful wrapper to access a wide range of LLM providers via a unified interface.
Includes function/tool calling, structured output, image input & streaming!

📦 install.packages("ellmer")
📘 Docs: ellmer.tidyverse.org/
#rstats #LLM #AI #OpenSource #DataScience #RPackage #NLP

ellmer.tidyverse.orgChat with Large Language ModelsChat with large language models from a range of providers including Claude <https://claude.ai>, OpenAI <https://chatgpt.com>, and more. Supports streaming, asynchronous calls, tool calling, and structured data extraction.

Garbage in, garbage out – even Agentic AI can’t save you from yourself.

Artificial intelligence is only as brilliant as the data it’s spoon-fed – and spoiler alert: your data is often trash.
Whether it’s traditional machine learning, generative models, or your shiny new agentic systems, the pattern remains insultingly consistent:
• Bad data? Expect bad decisions.
• Incomplete data? Enjoy half-baked ideas.
• Outdated data? Say hello to irrelevant nonsense.

I often talk about what AI can or tragically still can’t do.
But here’s the real twist: the problem isn’t the system. It’s you. Or more specifically, the glorious mess you call your “data foundation.”

You don’t have a lack of innovation.
You have a lack of clean data structures, maintained knowledge bases, and basic contextual awareness.
And then you expect the AI to magically fill gaps that should never have existed in the first place.