Movatterモバイル変換


[0]ホーム

URL:


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
 
It seems that you've previously unsubscribed from our newsletterin the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to ourTerms of Use andPrivacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
 
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become aTNS follower on LinkedIn.

Check outthe latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
 
NEW! Try Stackie AI
PODCASTSEBOOKSEVENTSWEBINARSNEWSLETTERCONTRIBUTE
ARCHITECTUREENGINEERINGOPERATIONSPROGRAMMING
Throwing AI at Developers Won’t Fix Their Problems
Jul 11th 2025 4:00am, byChris Westerhold and Ankit Jain
Will LLMs and Vibe Coding Fuel a Developer Renaissance?
Jul 9th 2025 1:52pm, bySylvain Kalache
Introduction to Large Language Models (LLMs)
Jun 30th 2025 11:00am, byTNS Staff
How To Build an AI Health Care Agent on Amazon Bedrock
Jul 17th 2025 8:00am, byHafiz Hassan
Harnessing AI To Elevate Automated Software Testing
Jul 9th 2025 9:00am, byLindsay Farny
Driving Platform Adoption: The Missed Opportunity of Marketing
Jul 7th 2025 9:00am, byMichael Coté and Rita Manachi
MCP Vulnerability Exposes the AI Untrusted Code Crisis
Jul 7th 2025 6:00am, byDan Fernandez
Hypercloud Players Reveal Their Latest AI FinOps Products
Jun 9th 2025 11:28am, bySteven J. Vaughan-Nichols
Why Everyone Hates Your DevOps Stack
Jul 16th 2025 7:30am, byVicki Walker
Driving Platform Adoption, Part 2: Yes, Create T-Shirts To Tout Your Brand
Jul 15th 2025 8:00am, byMichael Coté and Rita Manachi
Let AI Hustle So Employees Can Lead
Jul 7th 2025 10:00am, byMike Szilagyi
Driving Platform Adoption: The Missed Opportunity of Marketing
Jul 7th 2025 9:00am, byMichael Coté and Rita Manachi
Kubernetes Finally Solves Its Biggest Problem: Managing Databases
Jul 14th 2025 1:00pm, byRotem Tamir and Gabriele Bartolini
6 Design Principles for Edge Computing Systems
Jul 17th 2025 7:00am, byJoab Jackson
Smarter AI for Critical Operations: Why Data Matters 
Jul 15th 2025 10:00am, byScott Bastek
The Grep Command in Linux
Jul 13th 2025 7:00am, byJack Wallen
Driving Platform Adoption, Part 2: Yes, Create T-Shirts To Tout Your Brand
Jul 15th 2025 8:00am, byMichael Coté and Rita Manachi
Enterprise Platform Teams Are Stuck in Day 2 Hell
Jul 14th 2025 12:03pm, byJennifer Riggins
Throwing AI at Developers Won’t Fix Their Problems
Jul 11th 2025 4:00am, byChris Westerhold and Ankit Jain
How AI Can Help You Learn the Art of Programming
Jun 20th 2025 9:00am, byJack Wallen
How to Build Multistep Forms in React
May 22nd 2025 12:00pm, byVinod Pal
Bjarne Stroustrup on How He Sees C++ Evolving
Mar 7th 2025 6:00am, byDavid Cassel
Enterprise Platform Teams Are Stuck in Day 2 Hell
Jul 14th 2025 12:03pm, byJennifer Riggins
Warp Goes Agentic: A Developer Walk-Through of Warp 2.0
Jul 12th 2025 6:05am, byDavid Eastman
10 Insights from Integrating AI into My Coding Workflow
Jul 11th 2025 5:05am, byDieter Randolph
Prepare Your Mac for Go Development
Apr 12th 2025 7:00am, byDamon M. Garn
Pagoda: A Web Development Starter Kit for Go Programmers
Mar 19th 2025 6:10am, byLoraine Lawson
Go Power: Microsoft's Bold Bet on Faster TypeScript Tools
Mar 12th 2025 1:00pm, byDarryl K. Taft and Loraine Lawson
Using Java for Developing Agentic AI Apps
Jul 1st 2025 8:00am, byDaniel Oh
Inside Java's Language Renaissance
Jun 26th 2025 10:00am, byChad Arimura
Async Programming in Java Repositories
Jun 13th 2025 1:00pm, bySimon Ritter
Java at 30: Java Pioneers Look Back, Forward
Jun 5th 2025 4:00pm, byDarryl K. Taft
Web Dev Kent Dodds on Remix and His Frontend Secret Weapon
Jul 15th 2025 12:30pm, byLoraine Lawson
Introduction to JavaScript
Jul 14th 2025 2:00pm, byTNS Staff
Hono Shows the Way for Microframeworks in a Post-React World
Jul 12th 2025 10:20am, byRichard MacManus
Creators of Nuxt.js and Nitro Join Vercel
Jul 11th 2025 10:05am, byLoraine Lawson
5 Underappreciated JavaScript Libraries To Try in 2025
Jul 10th 2025 12:00pm, byAlexander T. Williams
Python `apply()` vs. `apply_async()`: Which Should You Use?
Jul 16th 2025 10:00am, byJessica Wachtel
Convert Timestamps To Strings Like a Python Pro
Jul 14th 2025 11:00am, byJessica Wachtel
Python Indexing vs. For Loops: What’s Really Faster?
Jul 10th 2025 10:00am, byJessica Wachtel
How to Write Rust Code Like a Rustacean
Jul 9th 2025 6:05am, byZziwa Raymond Ian
Decode Any Python Code With This 5-Step Method
Jul 7th 2025 11:00am, byJessica Wachtel
Python `apply()` vs. `apply_async()`: Which Should You Use?
Jul 16th 2025 10:00am, byJessica Wachtel
Convert Timestamps To Strings Like a Python Pro
Jul 14th 2025 11:00am, byJessica Wachtel
Python Indexing vs. For Loops: What’s Really Faster?
Jul 10th 2025 10:00am, byJessica Wachtel
Decode Any Python Code With This 5-Step Method
Jul 7th 2025 11:00am, byJessica Wachtel
Build Your Own AI Terminal Chatbot in Minutes
Jul 3rd 2025 12:00pm, byJessica Wachtel
How to Write Rust Code Like a Rustacean
Jul 9th 2025 6:05am, byZziwa Raymond Ian
Do Frontend Frameworks Still Matter? A Thread
Jun 28th 2025 6:00am, byLoraine Lawson
Rust Eats Python's, Java’s Lunch in Data Engineering
Jun 19th 2025 1:00pm, byDarryl K. Taft
InfluxDB v3: Why Rust Beat Go for Time Series Database
Jun 5th 2025 10:00am, byB. Cameron Gain
2025-06-21 09:00:58
New OLTP: Postgres With Separate Compute and Storage
Databases

New OLTP: Postgres With Separate Compute and Storage

Is Databricks onto something by focusing on streaming data by treating the data like developers have treated code?
Jun 21st, 2025 9:00am by
Featued image for: New OLTP: Postgres With Separate Compute and Storage
Featured image by Bernd Dittrich from Unsplash.

Areonline transaction processing (OLTP) databases stuck in the past?

Well, OLTP databases are tightly coupled, saidDatabricks co-founderReynold Xin in his keynote earlier this month at the DatabricksData + AI Summit in San Francisco. Such databases are monolithic, combining compute and storage in big machines, which leads to various problems, including over-provisioning, scaling challenges, performance issues and a range of system complexities.

In the newLakebase product from Databricks, compute and storage are separated. Lakebase is loosely coupled, which opens up opportunities to rethink transactional databases and the use ofagentic AI to accomplish what traditional OLTP databases can’t.

“If you look at OLTP databases you’re running today, whether it’s commercials or proprietary systems likeOracle or open source databases like MySQL, Postgres, they look more or less the same as they were in the ’90s,” Xin said at the Databricks event.

Viewed asheavyweight infrastructure that requires manual intervention and maintenance, OLTP databases act clunky and prove difficult to scale.

So what’s a developer to do?

Is Databricks onto something by focusing on streaming data — and thereby making it better suited to AI and agent-based architectures, by treating the data like developers have treated code?

I discussed this topic last week withSanjeev Mohan, an independent analyst at the Databricks conference. We talked about Lakebase and the way its approach enables scaling using Postgres. The new service is in part based upon technology developed byNeon, a company Databricks has acquired.

Thomas Gauvin, a Cloudflare developer,detailed on his personal blog how Neon uses Postgres to implement “a custom storage system (written inRust) that intercepts calls to update pages in the block storage and stores these updated pages on a cloud object store instead of the computer’s disk. This decoupling enables independent scaling of compute and storage.”

The New Stack’sSusan Hall interviewedNikita Shamgunov, cofounder and CEO of Neon, in early 2024.He said that, though commonly used in git repositories, branching was never a good fit for databases.

Branching is available with Neon, though it has come about through a lot of hard work, Shamgunov said, and has evolved from an infrastructure feature to a developer workflow tool.

“It takes a next-generation architecture, storage architecture, to enable branching, because the key feature of branching is copy-on-write,” he said. “That’s what git has. For example, when you create a branch, you’re basically moving a few pointers around. And that gives you an isolated, full copy of your data in a separate branch.”

Databricks sees how, with Neon, it can provide a technically superior way to achieve real-time data streaming, enabling the capability to also transform data in real time. Databricks can leverage its data intelligence platform, based on theLakehouse architecture, to provide an end-to-end experience.

“Why are they getting into this business?” Mohan asked. “Owning the analytical data is not enough. The keys to the kingdom reside on the operational or the transactional side. The world’s most important data is in Salesforce, it’s inSAP, is in a bunch of other ERPs.”

Purpose-Built for AI

The traditional methods for integrating databases are complex and not suited to AI, Xin said. The challenge lies in integrating analytics and AI with transactional workloads.

Consider what developers would do when adding a feature to a code base, Xin said in his keynote address at the Data + AI Summit. They’d create a new branch of the codebase and make changes to the new branch. They’d use that branch to check bugs, perform testing and so on.

Xin said creating a new branch is an instant operation. What’s the equivalent for databases? You only clone your production databases. It might take days. How do you set up secure networking? How do you create ETL pipelines and log data from one to another?

Lakebase takes the concept of OLTP databases and turns it on its head.

“First and foremost, it’s based on open source Postgres,” Xin said. “And second, it built on a novel decoupled storage from compute architecture that actually enables the modern-day developer workflow.”

The Databricks explanation: Storage and compute use separate clusters. Systems may scale to concurrent users and larger data sizes. The storage formats are open. Parquet, for example, provides an API to tools and engines, including machine learning (ML) and Python/R libraries.

Elastic scaling enables thousands of workloads to go live at a low cost, utilizing low-cost Postgres instances.

“The separation of storage from compute architecture also has a copy-on-write capability built in, so that we can instantly branch off a database,” Xin said. “It takes less than a second to create a whole clone of the database, and that includes most of the data and the schema of the database.

“And because of the copy-on-writing capability, you don’t actually have to pay for extra storage unless you start making changes, and only the changes themselves will incur extra charge, because under the hood, they all share the same storage.”

Streaming Is Changing Enterprise Data Needs

Streaming is now a first-class citizen in the enterprise, Mohan told me. The separation of compute and storage makes a difference. We are approaching an era when applications will scale infinitely, both in terms of the number of instances and their scale-out capabilities. And that leads us to new questions about how we start to think about evaluation, observability and semantics.

Accuracy matters. Language is semantic by nature, meaning there is a need for more capabilities to evaluate the veracity of the AI’s output.

ADP may have the world’s best payroll data, Mohan said, but then that data has to be processed through ETL into an analytics solution like Databricks. Then comes the analytics and the data science work. The customer has to perform a significant amount of data engineering work and preparation.

Databricks and others, likeSnowflake, don’t want to be on the receiving end of the data. They want to serve customers who, for instance, need reports and require them to be delivered quickly. It’s hard to do that with the systems they now have in place.

Take, for instance, Securities and Exchange Commission reporting, Mohan said. The customer has a dashboard or some reports. These customers need to know the business lineage of that data.

“I wanna know, where did this data originate?” Mohan said, “How was it transformed? How was it cataloged? How was it integrated with other pieces of data before I ran my dashboard on it?

“So now, if Databricks owns the entire life cycle of data from creation all the way to consumption, then they own the data. It never leaves Databricks’ ecosystem.”

A “Disaggregation of Storage and Compute”

So, how does that work?

The New Stack’sFrederic Lardinoiswrote that Lakebase combines the familiarity and extensibility of Postgres, the scalability of a modern serverless architecture, a modern developer experience, with the unified data experience of Databricks’ Lakehouse, and the operational maturity of the company’s Data Intelligence Platform.

Mohan suggested that Databricks’ purchase of Neon offers Databricks an advantage. “What they’re saying is that using their Neon acquisition, they can now have whatever frontend application sitting on top,” he said. “In the future, that may be an agent; that’s the bet, but agents are new. So that agent is going to write the data into an open standard file format like Parquet with Iceberg or Delta or Hudi on top.”

Parquet, Iceberg, Delta Lake and Hudi are all Apache projects.

“And then you’ll have a compute engine,” Mohan said. “So it’s a complete disaggregation of storage and compute.”

Xin said the separation of storage and computing is critical in the age of agentic coding and AI. With AI agents, an enterprise will have thousands of AI agents, even millions.

“The AI agents are acting as their own individual engineers,” Xin said. “They’re doing experiments on your codebase, maybe adding new features. You might even have multiple AI agents adding new features, adding the same feature, and you have judges to determine which feature is the best to implement. Every AI agent can actually add their own code branch, but also their own databases, at virtually no cost for experimentation.”

The underlying storage layer also makes it super easy to synchronize data at very high throughput from one object store to another object store, Xin said — so from one data lake to another data lake, from Lakehouse to Lakebase.

In conclusion, I asked Mohan: “Where do you see us now?”

He replied, “One thing that’s starting to stick out a little bit is the need for evaluation.”

And this comes down to the semantics of AI — the language nuances — that will require a deeper evaluation.

“The biggest problem customers have is reliability,” Mohan said. “Can you trust these models? Are they going to be accurate? Semantics become really important.”

TRENDING STORIES
GroupCreated with Sketch.
Alex Williams is founder and publisher of The New Stack. He's a longtime technology journalist who did stints at TechCrunch, SiliconAngle and what is now known as ReadWrite. Alex has been a journalist since the late 1980s, starting at the...
Read more from Alex Williams
SHARE THIS STORY
TRENDING STORIES
Snowflake is a sponsor of The New Stack.
TNS owner Insight Partners is an investor in: Databricks.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTERReceive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to ourTerms of Use andPrivacy Policy.

[8]ページ先頭

©2009-2025 Movatter.jp