toad.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
Mastodon server operated by David Troy, a tech pioneer and investigative journalist addressing threats to democracy. Thoughtful participation and discussion welcome.

Administered by:

Server stats:

267
active users

#dataengineering

1 post1 participant0 posts today
Gordon Inggs<p>Come work with us at the City of <a href="https://fosstodon.org/tags/CapeTown" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CapeTown</span></a> - <a href="https://uct.ac.za/sites/default/files/2025-08/com-e25810-datascientist-jpal-saldru-soe.pdf" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">uct.ac.za/sites/default/files/</span><span class="invisible">2025-08/com-e25810-datascientist-jpal-saldru-soe.pdf</span></a></p><p>The job says data scientist, but really this is a <a href="https://fosstodon.org/tags/CivicTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CivicTech</span></a> <a href="https://fosstodon.org/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a> position. We run a decent Open Source, modern data stack, but what sets us apart is our great culture - family first, psychologically safe, curiosity encouraged and reward!</p><p>Would appreciate any boosts for reach!</p>
kamatahvel<p>Hi <a href="https://infosec.exchange/tags/GetFediHired" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GetFediHired</span></a>, I'm looking for a <a href="https://infosec.exchange/tags/remote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>remote</span></a> role in the US (or <a href="https://infosec.exchange/tags/sweden" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>sweden</span></a> if you provide visa assistance!). </p><p>I've worked mostly in <a href="https://infosec.exchange/tags/SoftwareEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SoftwareEngineering</span></a> space, but I do lean closer to the <a href="https://infosec.exchange/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a> side of things (past 3 years). Before that I was varying levels of doing SWE things inside a <a href="https://infosec.exchange/tags/BusinessIntelligence" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>BusinessIntelligence</span></a> role (~5 years).</p><p>Looking for something that demands strong <a href="https://infosec.exchange/tags/Python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Python</span></a> skills (~5+ years of heavy, daily use), though wouldn't mind having to learn something new. Quite comfortable in a few <a href="https://infosec.exchange/tags/SQL" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SQL</span></a> flavors. I can actually read most <a href="https://infosec.exchange/tags/regex" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>regex</span></a>, if that's a thing worth bragging about. Love writing <a href="https://infosec.exchange/tags/xpath" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>xpath</span></a> in personal webscraping projects. Somewhat familiar with <a href="https://infosec.exchange/tags/SpringBoot" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpringBoot</span></a> and <a href="https://infosec.exchange/tags/Kotlin" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Kotlin</span></a> (1 year, occasional) and would like to eventually use more Java in work, but not a hard requirement. </p><p>I love refactoring/improving old code, and I have lots of experience with CI/CD, coding best practices, testing, web scraping, backend (Flask) &amp; frontend (React, Typescript). </p><p>Send me a message if this sounds like I'd be a great fit on your team!</p>
Posit<p>What makes tools truly useful? </p><p>Episode 2 of <a href="https://fosstodon.org/tags/TheTestSet" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TheTestSet</span></a> features Wes McKinney (Part 1of 2!) sharing his experience building Pandas &amp; Arrow, plus his surprising past in speedrun communities.</p><p>Tune in for his story at thetestset.co, on Spotify, or Apple Podcasts</p><p><a href="https://fosstodon.org/tags/DataStack" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataStack</span></a> <a href="https://fosstodon.org/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a> <a href="https://fosstodon.org/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://fosstodon.org/tags/Podcast" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Podcast</span></a> <a href="https://fosstodon.org/tags/Python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Python</span></a></p>
Posit<p>Ever wonder about the mind behind Pandas &amp; Apache Arrow? 🤔 Ep. 2 of <a href="https://fosstodon.org/tags/TheTestSet" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TheTestSet</span></a> (Part 1!) unpacks Wes McKinney's journey – including his speedrunning past! What makes good tools good?</p><p>🎧 Listen at <a href="https://thetestset.co" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">thetestset.co</span><span class="invisible"></span></a>, on Spotify, or Apple Podcasts</p><p><a href="https://fosstodon.org/tags/DataStack" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataStack</span></a> <a href="https://fosstodon.org/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a> <a href="https://fosstodon.org/tags/Pandas" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Pandas</span></a> <a href="https://fosstodon.org/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://fosstodon.org/tags/PodcastLaunch" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PodcastLaunch</span></a> <a href="https://fosstodon.org/tags/Python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Python</span></a></p>
Will Hopkins 🌈📸<p><a href="https://a2mi.social/tags/dataengineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>dataengineering</span></a> If you needed to use a data lake with Redshift, would you use Iceberg, given some native support, over Delta Lake, which is arguably a better format?</p><p>Asking for a friend who is me</p>
blaze.email<p>🔍 Excited about AXLearn for modular ML training, Pinterest's Moka for massive data processing, and PromiseTune for causal configuration tuning! <a href="https://mastodon.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://mastodon.social/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a> </p><p><a href="https://blaze.email/Machine-Learning-Engineer" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blaze.email/Machine-Learning-E</span><span class="invisible">ngineer</span></a></p>
Rami Krispin :unverified:<p>My weekly newsletter is out! 🚀</p><p>This week's agenda:<br>🔹 Open Source of the Week - The dagster project <br>🔹 New learning resources - Forecasting with linear regression, multi-model LLM, multiprocessing with Python<br>🔹 Book of the week - Visualization for Social Data Science by Roger Beecham</p><p>📌 Join 29k subscribers and subscribe to get weekly updates 🗞️👇🏼<br><a href="https://ramikrispin.substack.com/p/the-dagster-project-visualization" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">ramikrispin.substack.com/p/the</span><span class="invisible">-dagster-project-visualization</span></a></p><p><a href="https://mstdn.social/tags/DataScience" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataScience</span></a> <a href="https://mstdn.social/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a> <a href="https://mstdn.social/tags/Python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Python</span></a> <a href="https://mstdn.social/tags/RStats" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RStats</span></a> <a href="https://mstdn.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mstdn.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a></p>
⚯ Michel de Cryptadamus ⚯<p>pro tip for user interface designers:</p><p>if you have hundreds of millions of dollars of venture capital and you want to make a user facing data analytics tool of some kind and you think it's reasonable to ask an average human being to type this:</p><p> CAST('2023-05-01' AS TIMESTAMP)</p><p>to do literally anything with a date or time in your application's user interface, just stop right there. do not pass go, do not collect $200, and do not ever attempt to offer feedback to a UX designer ever again. something is deeply broken inside you that means there are certain mysteries of the universe that even the guys who designed the postgres command line can access that you will never know, and that's ok. You can still live a really rad life.</p><p><a href="https://universeodon.com/tags/SQL" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SQL</span></a> <a href="https://universeodon.com/tags/dba" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>dba</span></a> <a href="https://universeodon.com/tags/dataengineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>dataengineering</span></a> <a href="https://universeodon.com/tags/postgres" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>postgres</span></a></p>
⚯ Michel de Cryptadamus ⚯scariest shit i've seen in years
GrowthBook<p>The problem: Setting up GA + BigQuery = 40+ manual steps, delayed insights, expensive queries</p><p>Our solution: </p><p>Managed Warehouse with:</p><p>One-click deployment<br>Real-time ClickHouse backend<br>Usage-based pricing<br>Built-in feature flag analytics</p><p>First 2M events/month free for Pro users. Raw SQL access maintained for power users.<br>Self-hosters: We're working on bringing Feature Usage Analytics to on-prem deployments too 👀<br><a href="https://mastodon.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://mastodon.social/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a> <a href="https://mastodon.social/tags/ClickHouse" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ClickHouse</span></a></p>
OS-SCI<p>Rust is transforming data engineering by offering unparalleled performance and cost efficiency. Singular's Extract platform, powered by Rust, achieves 17x performance improvements and up to 70% cost reductions. With memory safety and modern design, Rust is becoming the go-to for data-intensive workloads. Learn how Rust is outperforming Python and Java in enterprise data pipelines. <a href="https://mastodon.social/tags/RustLang" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RustLang</span></a> <a href="https://mastodon.social/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a> <a href="https://mastodon.social/tags/TechAdvancements" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TechAdvancements</span></a> <a href="https://mastodon.social/tags/Programming" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Programming</span></a>"</p><p><a href="https://thenewstack.io/rust-eats-pythons-javas-lunch-in-data-engineering/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">thenewstack.io/rust-eats-pytho</span><span class="invisible">ns-javas-lunch-in-data-engineering/</span></a></p>
pipTrends<p>Pinpointing differences between two tables is very important for tasks like validating data migrations or spotting corruption. But when those tables live in different databases, it becomes tricky due to issues like network costs and different SQL dialects. In this article, Erez Shinnan shared how Reladiff tackles these challenges and its development journey.</p><p><a href="https://eshsoft.com/blog/how-reladiff-works" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">eshsoft.com/blog/how-reladiff-</span><span class="invisible">works</span></a></p><p><a href="https://mastodon.social/tags/python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>python</span></a> <a href="https://mastodon.social/tags/Programming" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Programming</span></a> <a href="https://mastodon.social/tags/PythonProgramming" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PythonProgramming</span></a> <a href="https://mastodon.social/tags/SoftwareDevelopment" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SoftwareDevelopment</span></a> <a href="https://mastodon.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://mastodon.social/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a></p>
Lenin alevski 🕵️💻<p>New Open-Source Tool Spotlight 🚨🚨🚨</p><p>Transform any URL into an LLM-ready input with `Reader`. Just prefix the URL with `<a href="https://r.jina.ai/`" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">r.jina.ai/`</span><span class="invisible"></span></a> for clean, readable content extraction. Perfect for enhancing agents &amp; RAG pipelines. <a href="https://infosec.exchange/tags/LLM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LLM</span></a> <a href="https://infosec.exchange/tags/NLP" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>NLP</span></a></p><p>Need web search results for your LLM? Prepend queries with `<a href="https://s.jina.ai/`" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">s.jina.ai/`</span><span class="invisible"></span></a> to fetch top results—content included. E.g., `<a href="https://s.jina.ai/your+query`" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">s.jina.ai/your+query`</span><span class="invisible"></span></a> brings knowledge directly to your model. <a href="https://infosec.exchange/tags/AItools" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AItools</span></a> <a href="https://infosec.exchange/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a> </p><p>Reader API now supports images! Captions are auto-generated for images missing alt tags, giving LLMs better context for reasoning and summarizing multimedia pages. <a href="https://infosec.exchange/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://infosec.exchange/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a></p><p>🔗 Project link on <a href="https://infosec.exchange/tags/GitHub" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GitHub</span></a> 👉 <a href="https://github.com/jina-ai/reader" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/jina-ai/reader</span><span class="invisible"></span></a></p><p><a href="https://infosec.exchange/tags/Infosec" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Infosec</span></a> <a href="https://infosec.exchange/tags/Cybersecurity" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Cybersecurity</span></a> <a href="https://infosec.exchange/tags/Software" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Software</span></a> <a href="https://infosec.exchange/tags/Technology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Technology</span></a> <a href="https://infosec.exchange/tags/News" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>News</span></a> <a href="https://infosec.exchange/tags/CTF" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CTF</span></a> <a href="https://infosec.exchange/tags/Cybersecuritycareer" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Cybersecuritycareer</span></a> <a href="https://infosec.exchange/tags/hacking" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>hacking</span></a> <a href="https://infosec.exchange/tags/redteam" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>redteam</span></a> <a href="https://infosec.exchange/tags/blueteam" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>blueteam</span></a> <a href="https://infosec.exchange/tags/purpleteam" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>purpleteam</span></a> <a href="https://infosec.exchange/tags/tips" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>tips</span></a> <a href="https://infosec.exchange/tags/opensource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opensource</span></a> <a href="https://infosec.exchange/tags/cloudsecurity" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cloudsecurity</span></a></p><p>— ✨<br>🔐 P.S. Found this helpful? Tap Follow for more cybersecurity tips and insights! I share weekly content for professionals and people who want to get into cyber. Happy hacking 💻🏴‍☠️</p>
pipTrends<p>While indexes are useful, relying on them too much can be like Maslow's hammer. <span class="h-card" translate="no"><a href="https://mastodon.social/@treyhunner" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>treyhunner</span></a></span> has shown some fantastic alternative methods for common tasks without constantly needing to use indexes.</p><p><a href="https://www.pythonmorsels.com/avoid-indexes-in-python/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">pythonmorsels.com/avoid-indexe</span><span class="invisible">s-in-python/</span></a></p><p><a href="https://mastodon.social/tags/python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>python</span></a> <a href="https://mastodon.social/tags/Programming" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Programming</span></a> <a href="https://mastodon.social/tags/PythonProgramming" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PythonProgramming</span></a> <a href="https://mastodon.social/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://mastodon.social/tags/ml" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ml</span></a> <a href="https://mastodon.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://mastodon.social/tags/SoftwareDevelopment" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SoftwareDevelopment</span></a> <a href="https://mastodon.social/tags/WebDevelopment" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WebDevelopment</span></a> <a href="https://mastodon.social/tags/TechNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TechNews</span></a> <a href="https://mastodon.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://mastodon.social/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a></p>
pipTrends<p>The <span class="h-card" translate="no"><a href="https://mas.to/@huggingface" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>huggingface</span></a></span> team has created tiny-agents, a new feature that lets their huggingface_hub software act as a Model Context Protocol (MCP) Client. In their recent article, they explained how to set up these tiny agents to give new abilities to your LLMs to interact with the world and perform complex tasks.</p><p><a href="https://huggingface.co/blog/python-tiny-agents" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">huggingface.co/blog/python-tin</span><span class="invisible">y-agents</span></a></p><p><a href="https://mastodon.social/tags/python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>python</span></a> <a href="https://mastodon.social/tags/Programming" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Programming</span></a> <a href="https://mastodon.social/tags/PythonProgramming" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PythonProgramming</span></a> <a href="https://mastodon.social/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://mastodon.social/tags/ml" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ml</span></a> <a href="https://mastodon.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://mastodon.social/tags/SoftwareDevelopment" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SoftwareDevelopment</span></a> <a href="https://mastodon.social/tags/WebDevelopment" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WebDevelopment</span></a> <a href="https://mastodon.social/tags/TechNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TechNews</span></a> <a href="https://mastodon.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://mastodon.social/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a></p>
dealingwith<p>If anyone knows Data Engineers looking for work, this is our next hire: <a href="https://www.linkedin.com/posts/dealingwith_dataengineering-hiring-startuplife-activity-7338312558455476224-jvGh" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">linkedin.com/posts/dealingwith</span><span class="invisible">_dataengineering-hiring-startuplife-activity-7338312558455476224-jvGh</span></a></p><p><a href="https://billee.applytojob.com/apply/iTXqZOqOUu/Senior-Data-Engineer" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">billee.applytojob.com/apply/iT</span><span class="invisible">XqZOqOUu/Senior-Data-Engineer</span></a></p><p><a href="https://indieweb.social/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a> <a href="https://indieweb.social/tags/hiring" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>hiring</span></a> <a href="https://indieweb.social/tags/getfedihired" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>getfedihired</span></a> <a href="https://indieweb.social/tags/FediHire" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FediHire</span></a></p>
pipTrends<p>Last month, two new Rust-based Python type checkers, pyrefly and ty were released. Both of them are in the alpha stage. While they share some similarities, they differ significantly in design and features. In this article, Edward Li dove deep into both tools, highlighted their differences and what makes each one unique.</p><p><a href="https://blog.edward-li.com/tech/comparing-pyrefly-vs-ty/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.edward-li.com/tech/compar</span><span class="invisible">ing-pyrefly-vs-ty/</span></a></p><p><a href="https://mastodon.social/tags/python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>python</span></a> <a href="https://mastodon.social/tags/Programming" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Programming</span></a> <a href="https://mastodon.social/tags/PythonProgramming" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PythonProgramming</span></a> <a href="https://mastodon.social/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://mastodon.social/tags/ml" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ml</span></a> <a href="https://mastodon.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://mastodon.social/tags/SoftwareDevelopment" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SoftwareDevelopment</span></a> <a href="https://mastodon.social/tags/WebDevelopment" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WebDevelopment</span></a> <a href="https://mastodon.social/tags/TechNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TechNews</span></a> <a href="https://mastodon.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://mastodon.social/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a></p>
Mike Spencer<p>A great job with a fantastic group: <a href="https://www.dataorchard.org.uk/analytics-engineer-vacancy" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">dataorchard.org.uk/analytics-e</span><span class="invisible">ngineer-vacancy</span></a></p><p><a href="https://mastodon.scot/tags/DataScience" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataScience</span></a> <a href="https://mastodon.scot/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a> <a href="https://mastodon.scot/tags/RStats" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RStats</span></a> <a href="https://mastodon.scot/tags/JobFairy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>JobFairy</span></a> <a href="https://mastodon.scot/tags/FediHire" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FediHire</span></a> <span class="h-card" translate="no"><a href="https://data-folks.masto.host/@data_orchard" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>data_orchard</span></a></span></p>
Seán Fobbe<p>🔔 New Slides 🔔 </p><p>I've made the slides for my recent talk on "Legal Data Engineering" available <a href="https://fediscience.org/tags/OpenAccess" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenAccess</span></a> </p><p>Slides: <a href="https://zenodo.org/records/15575231" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">zenodo.org/records/15575231</span><span class="invisible"></span></a> (in German) </p><p><a href="https://fediscience.org/tags/OpenAccess" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenAccess</span></a> <a href="https://fediscience.org/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a> <a href="https://fediscience.org/tags/Law" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Law</span></a></p>
Seán Fobbe<p>🔔 Slides zu Legal Data Engineering 🔔 </p><p>Was ist Legal Data Engineering? Wie sieht die Praxis juristischer Daten in Deutschland aus? Welche rechtlichen Probleme ergeben sich im Zusammenhang mit Legal Data Engineering? Diese Präsentation bietet eine Einführung zu Legal Data Engineering und sucht Antworten auf diese Fragen.</p><p>Slides: <a href="https://zenodo.org/records/15575231/files/Fobbe_2025-05-28_Legal-Data-Engineering.pdf?download=1" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">zenodo.org/records/15575231/fi</span><span class="invisible">les/Fobbe_2025-05-28_Legal-Data-Engineering.pdf?download=1</span></a></p><p>Legal Data Engineering ist der Schwerpunkt eines jeden Legal Data Science Projekts. Kern von Data Engineering ist der ETL-Prozess: Extraktion, Transformation und das (Hoch-)Laden von Daten. Die Slides bieten dazu einen allgemeinverständlichen Überblick.</p><p>Weitere praktische Themen sind die Verfügbarkeit juristischer Daten in Deutschland (insbesondere strukturierter Daten und Programmierschnittstellen), Probleme bei der Tokenisierung in Large Language Models und die Fehlerkennung von Gen-Namen in Microsoft Excel.</p><p>Bei den rechtlichen Fragen des Legal Data Engineering behandle ich die tradierte Rechtslage, das neue Datennutzungsgesetz (DNG) und Bayern als Negativbeispiel einer verschlossenen juristischen Datenkultur. Eine Diskussion der Datenschutzklage gegen OpenJur und der Open Data-Klage der Gesellschaft für Freiheitsrechte (GFF) gegen die Bundespolizei klären über aktuelle Entwicklungen in diesem Rechtsbereich auf.</p><p><a href="https://fediscience.org/tags/DataEngineering" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataEngineering</span></a> <a href="https://fediscience.org/tags/OpenAccess" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenAccess</span></a> <a href="https://fediscience.org/tags/OpenScience" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenScience</span></a> <a href="https://fediscience.org/tags/Law" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Law</span></a> <a href="https://fediscience.org/tags/ETL" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ETL</span></a> <a href="https://fediscience.org/tags/Datennutzungsgesetz" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Datennutzungsgesetz</span></a> <a href="https://fediscience.org/tags/Pipeline" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Pipeline</span></a> <a href="https://fediscience.org/tags/Data" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Data</span></a></p>