← Back to data subTLDR

data subTLDR week 29 year 2025

r/MachineLearningr/dataengineeringr/SQL

Simplifying Complex Code: The Balance of Efficiency vs Readability, Free Resources for SQL Learning, Microsoft's Regex Update: Boon or Bane?, Triumph Over Adversity: From Homelessness to Data Engineering Intern, HR and CEOs: The Misunderstood Dynamics in Tech Companies

Week 29, 2025
Posted in r/dataengineeringbyu/Parking_Anteater9437/14/2025
1868

I want to cry

Career
The sentiment is overwhelmingly positive surrounding the story of a once-homeless individual who secured a data engineering internship and received high praise from upper management. The individual's journey involved significant struggle, including working for free at small companies, but the recent feedback and extension of the internship until graduation is seen as deserved and promising. Many comments encouraged the individual not to undervalue themselves, expressing excitement for their success and future prospects. With the internship paying $30 an hour, there's optimism regarding a potential full-time offer post-graduation.
110 comments
Share
Save
View on Reddit →
Posted in r/dataengineeringbyu/hatsandcats7/18/2025
1483

Me IRL

Meme
The thread emphasizes the disconnect between HR, CEOs, and technical roles, with a humorous undertone. The highest upvoted comment questions the situation where HR is contacting another HR, suggesting a misunderstood role dynamic. There's a notable sentiment of frustration towards non-technical people in CEO positions, with several comments emphasizing the challenges faced by technical staff. A few comments also humorously exaggerate the isolation of data engineers, indicating the high-stress and demanding nature of the job. Overall, the thread reflects a mixed sentiment, combining humor with criticism of current workplace dynamics in tech companies.
32 comments
Share
Save
View on Reddit →
Posted in r/dataengineeringbyu/hatsandcats7/18/2025
704

My biggest question

Meme
The discussion revolved around the unexpected popularity of a niche software company, Astronomer, following a scandal involving its CEO. Many were surprised by the company's value and speculated that the publicity could be a strategic move for exposure. Some expressed skepticism about the estimated share price and potential for IPO. Others humorously noted the difficulty of explaining what the company does and the irony of turning scandal into increased engagement. Overall sentiment was mixed, with a sense of astonishment, skepticism, and humor prevailing.
43 comments
Share
Save
View on Reddit →
Posted in r/MachineLearningbyu/yuntiandeng7/19/2025
296

[R] NeuralOS: a generative OS entirely powered by neural networks

Research
There's a lot of excitement and skepticism surrounding NeuralOS, an experimental generative OS powered by neural networks. Supporters are captivated by the prospect of a fully customizable OS, envisioning a future where OS interfaces could become human-like avatars, and app-specific UIs could become obsolete. Critics, however, express concern about the OS's slow performance and its dependence on high-end hardware. Additionally, there's speculation about the potential of diffusion-based games replacing traditional ones. Despite divergent views, there's consensus about the innovative nature of the project, and curiosity about its future applications.
49 comments
Share
Save
View on Reddit →
Posted in r/SQLbyu/Various_Candidate3257/15/2025
290

Wrote a 5-layer nested CTE, boss said "can you simplify this?"

Discussion
The thread focuses on balancing code complexity and readability. The consensus leans towards favoring maintainability over efficiency, emphasizing that clear, simple code is preferable unless specific circumstances necessitate complexity. A top-rated comment warns about overcomplicating SQL queries, suggesting that simpler methods can often achieve the same results. Another popular anecdote highlights the pitfalls of ignoring readability for the sake of performance, resulting in avoidable issues. Other advice includes using the least power principle when coding and ensuring code correctness before optimizing. The sentiment is positive, with contributors offering constructive advice to improve work habits and code quality.
136 comments
Share
Save
View on Reddit →
Posted in r/MachineLearningbyu/berkusantonius7/14/2025
115

[P] Anyone interested in TinyML?

Project
The open-source project sklearn2c, which generates lightweight C code from trained scikit-learn models for use on microcontrollers, has been positively received. This tool is particularly useful for real-time machine learning inference in resource-constrained environments where a full Python environment is not available. There's interest in the project's future roadmap and potential support for additional models, like RandomForests. Some users suggest exploring WebAssembly support for broader applicability, or using Emscripten to compile the generated C code to WebAssembly. The overall sentiment is positive, with appreciation for the project's utility and potential.
9 comments
Share
Save
View on Reddit →
Posted in r/MachineLearningbyu/glorious__potato7/18/2025
99

[P] Understanding Muon: A Revolutionary Neural Network Optimizer

Project
Muon, a new neural network optimizer, enhances optimization by treating weight matrices as geometric objects, facilitating 35% faster training with 15% fewer tokens. This revolutionary approach is powered by the rare directions hypothesis and employs the Newton-Schulz method for orthogonalization. Despite these second-order characteristics, Muon is primarily a first-order optimizer. It has similarities with Natural Gradient and Shampoo, but lacks the latter's accumulation of second-order stats. However, there are limitations; it only works with 2D weights, somewhat narrowing its use-case. Some also debate the trade-offs involved and question whether its performance improvement is substantial enough to be deemed revolutionary. Overall, the sentiment is mixed-positive.
24 comments
Share
Save
View on Reddit →
Posted in r/SQLbyu/Inner_Feedback_40287/17/2025
94

Good SQL courses

BigQuery
The majority of Reddit users recommended DataLemur as a beneficial free resource for learning SQL, gaining enough knowledge for a career switch to analytics. It's also seen as a useful tool for practice. Other popular suggestions included Alex the Analyst's beginner-friendly YouTube videos, W3Schools for understanding SQL logic, and Codecademy and Mode Analytics as beginner-friendly platforms. Some users found Edu4Sure's content particularly helpful due to its real-world examples. DataCamp was mentioned as a good overview but criticized for excessive hand-holding. Some users also suggested books like Modern Database Management by Hoffer and various resources including LinkedIn Learning.
56 comments
Share
Save
View on Reddit →
Posted in r/SQLbyu/Straight_Waltz_95307/18/2025
88

Regexps are Coming to Town

SQL Server
Microsoft SQL Server introducing regular expression support has been met with mixed reactions. While some users express enthusiasm for the new feature, noting it will ease string work and match feature parity with other databases, others question its frequent necessity. Critics highlight the potential misuse of regex, which could slow down searches in large databases. Some also suggest that learning regex is often a cyclical process, with developers forgetting and relearning it as the need arises. Overall, the update appears to be seen as a valuable tool when used sparingly and appropriately, despite not being universally deemed essential.
80 comments
Share
Save
View on Reddit →

Subscribe to data-subtldr

Get weekly summaries of top content from r/dataengineering, r/MachineLearning and more directly in your inbox.

Get the weekly data subTLDR in your inbox!

We respect your privacy. No spam, ever.