← Back to data subTLDR

data subTLDR week 21 year 2025

r/MachineLearningr/dataengineeringr/SQL

Right Joins vs Left Joins Debate, Navigating the Shift from Data Analyst to Data Engineer, The Misuse of 'Real-Time' and 'AI-driven' in Tech

Week 21, 2025
Posted in r/dataengineeringbyu/SureResort64445/21/2025
977

when will they learn?

Meme
Many Reddit users express skepticism towards the use of terms such as real-time and AI-driven in tech and business settings, feeling they are often used inaccurately or as buzzwords. There's a shared insight that the term 'real-time' often leads to confusion and unexpected costs when its implications are not fully understood. Similarly, the idea of replacing analysts with 'AI-driven insights' is met with cynicism. Users also discuss the importance of clear communication and understanding the reasons behind requests for specific metrics, suggesting that failure to do so could lead to inefficiencies and wasted resources. Overall, the sentiment appears mixed to negative.
30 comments
Share
Save
View on Reddit →
Posted in r/dataengineeringbyu/HMZ_PBI5/22/2025
391

When i was a Data Analyst i enjoyed life, when i transitioned to Data Engineer i feel like i aged 10 years in a year

Discussion
The transition from Data Analyst to Data Engineer appears to be challenging and stressful for many, leading to a feeling of accelerated aging. The shift involves increased responsibilities, high-pressure situations, and often inadequate resources or support. The Data Engineers often serve as the bottleneck and scapegoat for data-related issues, dealing with constant change requests and difficult vendor APIs. However, some individuals thrive in the complex and nuanced environment, enjoying their role as a technical advisor across departments. Job satisfaction seems to greatly depend on personal interest in Data Engineering, managerial expectations, company culture, and the tools used. The lack of an on-call rotation is a major advantage for Data Analysts. Overall sentiment is mixed.
125 comments
Share
Save
View on Reddit →
Posted in r/MachineLearningbyu/hiskuu5/22/2025
251

[D] Google already out with a Text- Diffusion Model

Discussion
Google's release of Gemeni Diffusion, a new text-diffusion model, has sparked widespread discussion. The model is seen as a conceptual shift from traditional transformer-based language models, offering a whole and complete answer rather than a step-by-step construction. While some respondents are optimistic, suggesting that the model could be used for tasks such as text inpainting and code generation, others express concerns about potential issues with alignment and reasoning. However, the overall sentiment is of excitement for this innovative approach, which is viewed as a promising alternative to traditional autoregressive models. The model's potential to reshape thinking and expression in language learning models was also noted.
68 comments
Share
Save
View on Reddit →
Posted in r/dataengineeringbyu/Yoyo_Baggins5/23/2025
211

New data engineer getting paid more than me, a senior DE

Discussion
The thread reveals a common employment issue where a senior data engineer discovers a new hire with less experience earns more. Many suggest the senior employee should seek new job offers to leverage a better salary, reflecting a widely-held sentiment that companies lack loyalty to their staff, and that employees must protect their own interests. Several comments cite 'wage compression' as a real challenge, where internal wage growth lags behind market rates, suggesting that changing jobs often results in higher pay. This advice is tempered with caution not to leverage a colleague's salary in pay negotiations. Overall, the sentiment is mixed, with frustration at company practices but pragmatic advice for action.
139 comments
Share
Save
View on Reddit →
Posted in r/MachineLearningbyu/asankhs5/20/2025
194

[P] OpenEvolve: Open Source Implementation of DeepMind's AlphaEvolve System

Project
OpenEvolve, the open-source implementation of Google DeepMind's AlphaEvolve system, has successfully replicated two AlphaEvolve examples. The system evolves entire codebases using LLMs, enabling continuous program improvement for a variety of tasks. Positive reactions focused on the quick turnaround time since the AlphaEvolve announcement and the applicability of this move fast, break things approach to meaningful technological advancements. Interest was also shown in the system's performance, hardware requirements, and the prospects for further development. OpenEvolve's compatibility with any LLM via OpenAI-compatible APIs and its ability to use an ensemble of models for better results were also appreciated.
37 comments
Share
Save
View on Reddit →
Posted in r/SQLbyu/rataksh5/24/2025
117

One must imagine right join happy.

Discussion
There is a lively discussion around the necessity of right joins in SQL, in relation to left joins. Many believe that their function depends on perspective and convenience, with no inherent superiority. Some suggest that right joins may be more intuitive for those reading scripts right-to-left, like Urdu speakers. Others argue that it's about personal preference or habit, with no significant impact on results. A few participants highlight that both joins serve the purpose of merging data based on a common key, and the difference is only in the order of display. The sentiment is largely constructive and inquisitive.
40 comments
Share
Save
View on Reddit →
Posted in r/SQLbyu/ShuffleStepTap5/19/2025
114

How did I not know this?

SQL Server
The sentiment in the Reddit discussion is mixed towards the concept of manually editing top rows in data tables. Some users express excitement about the method, while others advise caution. The concern is the potential for exclusive lock on records during editing, urging the use of update scripts instead of manual editing through SQL Server Management Studio (SSMS) and to always have a rollback plan. Some seasoned professionals reflect that they haven't manually updated data in years, preferring SQL. Newer data analysts are encouraged to overcome fear of the UPDATE command, highlighting the importance of testing before executing in production.
30 comments
Share
Save
View on Reddit →
Posted in r/SQLbyu/AggravatingResist6355/20/2025
71

:)

Discussion
The dominant sentiment in this thread is cautionary, with the most upvoted comment advising against getting into the van, reflecting a sense of skepticism or wariness. A lower-scoring comment humorously points out that a certain website is for sale. Another comment expresses enthusiasm at the prospect of free candy, indicating some positive but possibly naïve anticipation. Overall, this mix of caution, humor, and anticipation suggests a lively but potentially wary discussion.
3 comments
Share
Save
View on Reddit →

Subscribe to data-subtldr

Get weekly summaries of top content from r/dataengineering, r/MachineLearning and more directly in your inbox.

Get the weekly data subTLDR in your inbox!

We respect your privacy. No spam, ever.