data subTLDR week 9 year 2026

r/MachineLearningr/dataengineeringr/SQL

Future of SQL Developer Jobs in 2026, Challenges with Legacy Apps' Relational Diagrams, Mixed Reactions to AI Tools in Data Engineering, Controversy Surrounding DE Academy's Job Guarantee Program

March 1, 2026•Week 9, 2026

Posted in r/MachineLearningbyu/Healthy_Horse_2183•2/23/2026

757

[D] Is Conference prestige slowing reducing?

Discussion

The surge in papers accepted at prestigious conferences like CVPR and ICLR has led to a debate on whether the prestige of such events is waning. There's a consensus that the sheer volume of accepted papers poses a challenge to peer review quality and information absorption. Critics argue that the system is unable to handle the upscaling and is letting false results slip through. However, the prestige of these conferences remains high due to the desirability to publish and attend. Some suggest that crowd review, evidenced by GitHub stars or integration into major frameworks, is becoming a more reliable quality indicator than peer review.

63 comments

Save

View on Reddit →

Posted in r/dataengineeringbyu/KindTeaching3250•2/24/2026

323

Am I missing something with all this "agent" hype?

Discussion

AI tools are being widely discussed in data engineering fields, with mixed views on their effectiveness. The most agreed-upon sentiment is that this may be shiny-tool syndrome, with many people excited about the novelty without significant productivity gains. However, some users find these tools beneficial when integrated into well-documented, organized systems. Others stress that these tools can't replace processes requiring review or technical expertise. Some have used AI tools successfully for tedious, time-consuming tasks. A few have experienced dramatic productivity boosts from AI tools, leading to existential concerns about job displacement. The overall sentiment is mixed, with both skepticism and excitement.

128 comments

Save

View on Reddit →

Posted in r/MachineLearningbyu/osamabinpwnn•2/24/2026

199

[D] Papers with no code

Discussion

Frustration is widespread over the lack of code provided in machine learning research papers, impeding the ability to verify claims or reproduce results. Many agree that even when code is provided, it's often difficult to get it working or it is poorly written and hard to understand. There's a call for better standards in code sharing, including possible requirements for Docker or flake.nix files and standard deviations across multiple runs with different seeds. Some commenters argue that the absence of code undermines the scientific credibility of the field, pointing to a reproducibility crisis. The sentiment is largely negative, with calls for change to ensure research integrity.

94 comments

Save

View on Reddit →

Posted in r/MachineLearningbyu/ofmkingsz•2/25/2026

179

[D] ML Engineers — How did you actually learn PyTorch? I keep forgetting everything.

Discussion

Long-term retention of PyTorch skills hinges on continuous use in projects and regular reference to documentation, even for experienced ML Engineers. Relying on AI to code can expedite processes but shouldn't replace understanding PyTorch, as debugging AI-generated code requires this knowledge. Memorizing PyTorch behaviors isn't as crucial as statistics and machine learning knowledge in the current job market. However, creating datasets, dataloaders, and training loops often requires more memorization. Overall, the sentiment is positive towards continuous learning and adapting in a rapidly developing field.

60 comments

Save

View on Reddit →

Posted in r/dataengineeringbyu/Hour-Yesterday-6197•2/24/2026

174

My experience with DE Academy’s “job guarantee” program (1-year review)

Career

The DE Academy's job guarantee program has come under scrutiny, with a user sharing a negative experience of not securing a job despite a year's participation, and being denied a refund. The post sparked a wave of agreement, suggesting widespread dissatisfaction with the program. Commenters criticized the academy's leadership, alleged fraudulent activities, and the program's high cost. There was also frustration about the denial of refunds based on contract technicalities and concerns about the academy's communication practices. The overall sentiment was negative, advising potential participants to steer clear of the academy and seek self-learning or other avenues.

47 comments

Save

View on Reddit →

Posted in r/dataengineeringbyu/arimbr•2/27/2026

160

Which data quality tool do you use?

Personal Project Showcase

Data professionals are expressing a preference for using simple, cost-effective methods for maintaining data quality over specialized tools. The high cost and potential limitations of these tools are seen as deterrents. Many prefer to build their own systems or use basic, easily accessible tools like Python or SQL queries. Others rely on user feedback to identify issues. Despite this, some find value in specialized tools for very specific tasks such as anomaly detection or diff-ing in-development models against production tables. The overall sentiment leans towards a do-it-yourself approach to data quality, with a focus on efficacy rather than using the latest equipment.

56 comments

Save

View on Reddit →

Posted in r/SQLbyu/Interesting-Park2465•2/24/2026

Is SQL Developer jobs still alive in 2026?

Discussion

The consensus among professionals is that SQL developer roles will remain relevant and in demand, even by 2026. The job offers continuous learning opportunities, with areas like performance, indexing, query plans, and other SQL related tasks. There's also potential to expand into Business Intelligence and data warehousing. However, it's advisable to diversify one's skills, combining SQL with business acumen and other programming languages (like Python or Java) to bridge technical and business/domain knowledge. The hybrid nature of the role offers a good career opportunity, especially in regions like West Europe, where SQL developers are sought after.

82 comments

Save

View on Reddit →

Posted in r/SQLbyu/Shikitsumi-chan•2/24/2026

These are the relational diagrams of my company’s 16-year-old app

MySQL

The lack of foreign key constraints and referential integrity in a company's 16-year-old app has been causing difficulty in writing relational queries. This, however, is not uncommon in legacy apps, according to the majority of commentators. They suggest that referential integrity is often built into the app writing to the database, rather than the database itself. This approach is often seen as a way to optimize speed over data integrity. To help with the issue, it was suggested to query system 'INFORMATION_SCHEMA' tables to identify potential naming conventions. Overall, the sentiment was understanding but advised that such practices can lead to long-term difficulties.

37 comments

Save

View on Reddit →

Posted in r/SQLbyu/Money-Fan-2587•2/28/2026

How many Sql server DBA’s are currently laid off?

SQL Server

The overall sentiment in the discussion suggests a significant shift in the job market for SQL Server DBAs, with many experienced professionals struggling to find work. This is attributed to an oversaturated market, and a shift toward cloud-based services and automation reducing demand for traditional roles. However, some DBAs who have diversified their skill set, especially into cloud technologies, report a steady demand for their services. The job hunt duration varies greatly, with some individuals finding work relatively quickly, while others have been searching for several months. The job market's competitiveness is highlighted with thousands reportedly applying for single postings.

38 comments

Save

View on Reddit →

Subscribe to data-subtldr

Get weekly summaries of top content from r/dataengineering, r/MachineLearning and more directly in your inbox.

data subTLDR week 9 year 2026

Subscribe to data-subtldr

Get the weekly data subTLDR in your inbox!