data subTLDR week 5 year 2026

r/MachineLearningr/dataengineeringr/SQL

Flappy Bird Meets SQL: A Joyous Abomination, The Unending SQL Struggles Even Experts Face, Evolving Expectations from Full Stack Builders, The Expanding Role of Data Engineers

February 1, 2026•Week 5, 2026

Posted in r/dataengineeringbyu/Thinker_Assignment•1/27/2026

468

Are you seeing this too?

Discussion

There's a consensus that the demand for full stack builders in data roles has increased, with companies expecting a single person to handle all aspects of data handling, from Big Data to DevOps and beyond. This shift is seen as a way to lower salaries, with some expressing frustration at the unrealistic expectations and potential for underpayment. However, some professionals have been able to adapt and thrive in this environment, embracing a wide range of skills and tasks. The sentiment in the discussion is mixed, with many feeling overwhelmed by the expectations, but others leveraging the situation to maintain job security.

64 comments

Save

View on Reddit →

Posted in r/MachineLearningbyu/DrXiaoZ•1/27/2026

439

[D] Some thoughts about an elephant in the room no one talks about

Discussion

Concerns are growing within the research community about the negative impact of prioritizing publication and social media mentorship over rigorous training and mentorship. Many believe that the current system encourages superficial optimization over deep understanding, leading to a decline in research quality and ethical standards. This issue is seen as pervasive, affecting both junior and senior researchers. The consensus is that the system's incentives are misaligned and need to be revised. Suggestions include slowing the publication pace, encouraging collaboration between research and business, and emphasizing team-based works over individual papers. The sentiment is one of urgency to reverse these trends before they become the norm.

104 comments

Save

View on Reddit →

Posted in r/dataengineeringbyu/FreshIntroduction120•1/28/2026

419

The Data Engineer Role is Being Asked to Do Way Too Much

Discussion

Data engineers feel overwhelmed by their rapidly expanding roles, which now encompass a range of duties from pipeline development to security. Many appreciate the variety and learning opportunities this provides, but express concerns about pay not keeping pace with skill requirements. Some see broadening responsibilities as job security and an antidote to monotony, but others voice frustration about the unrealistic expectations to master all areas. There's a consensus that the role is not suitable for juniors and requires reliance on others for areas outside one's expertise. The sentiment is generally mixed, with concerns about sustainability counterbalanced by the benefits of diverse challenges.

42 comments

Save

View on Reddit →

Posted in r/MachineLearningbyu/ade17_in•1/26/2026

214

Advice for PhD students in this Al slop paper era - I feel academia needs serious revisions! [D]

Discussion

Academic publishing, particularly in Machine Learning (ML), is facing criticism as AI-written papers and reviews flood conferences. Some PhD students find this trend concerning, stating that quality research is getting lost amidst mediocre studies. Others see it as a result of high demand for ML research, leading to an overload of low-quality work. The publishing bar is viewed as being both too low and too high, with resource-rich labs able to churn out incremental improvements, while others struggle. A potential solution could be to prioritize lower-tier and specialized conferences, or journals. This issue has sparked discussions around the future of academia, suggesting a return to human-centric research sharing and collaboration. The overall sentiment appears to be mixed, with concerns about the quality of research and the dilution of impactful work.

60 comments

Save

View on Reddit →

Posted in r/dataengineeringbyu/Thinker_Assignment•1/29/2026

213

With "full stack" coming to data, how should we adapt?

Discussion

The discussion centers around the evolution of data engineering roles towards a full stack approach, prompting mixed reactions. Many professionals voice concerns about the constant emergence of new tools and the expectation to master an increasingly broad skillset. Others assert that a solid foundation in data structures, OOP, and algorithms, along with expertise in SQL and understanding of distributed computing, can facilitate adaptation to any tool. Some see potential benefits in expanding their skills to include AI engineering, given the higher salaries. Yet, there's a call for clearer role definitions in the industry and appropriate compensation for the expanded scope of responsibilities.

101 comments

Save

View on Reddit →

Posted in r/SQLbyu/Low-Distance9808•1/27/2026

207

I built the Flappy Bird game using SQL only... Now I need Therapist

SQL Server

The creator of a Flappy Bird game developed entirely through SQL received mixed reactions. Many praised the project as a testament of sheer dedication and a fine line between genius and insanity. There was also a sense of amusement, with some characterizing the project as an abomination that nonetheless sparked joy. Others expressed concern for the creator's well-being, albeit humorously. Practical suggestions for improvement were also proposed, such as tweaking the sleep time in the loop to increase difficulty progressively. The overall sentiment was one of admiration, amusement, and mild concern.

33 comments

Save

View on Reddit →

Posted in r/SQLbyu/joins_and_coffee•1/26/2026

Even after years of SQL experience, what still trips you up the most?

Discussion

Despite years of experience, SQL users still face frequent struggles. Common issues include forgetting to use the 'group by' clause in queries with an aggregation (206 upvotes), misuse of 'LEFT JOINs' in a way that unintentionally converts them into 'INNER JOINs' (120 upvotes), and dealing with varying date/time formats (102 upvotes). Users also express annoyance with writing pivots (56 upvotes) and spelling 'coalesce' (36 upvotes). Performance tuning on legacy systems is a significant concern, especially when the schema cannot be controlled (35 upvotes). The thread's sentiment is of shared frustration and a desire for more efficient solutions.

130 comments

Save

View on Reddit →

Posted in r/SQLbyu/aleda145•1/29/2026

Oops it's a Drakanian Product

Discussion

The commenters in this thread primarily reacted with humor, referencing programmer-focused subreddits and making light of the content's lack of humor. There was a minor critique about the incomplete results, implying a technical issue. The overall sentiment was light-hearted and slightly critical. Despite the limited number of comments, there seems to be a consensus that the post didn't quite hit the mark in terms of expected humor or technical correctness.

4 comments

Save

View on Reddit →

Subscribe to data-subtldr

Get weekly summaries of top content from r/dataengineering, r/MachineLearning and more directly in your inbox.

data subTLDR week 5 year 2026

Subscribe to data-subtldr

Get the weekly data subTLDR in your inbox!