
(Gorodenkoff/Shutterstock)
Fixing huge information issues typically requires creating new computing approaches and new applied sciences. However typically, the newer applied sciences and strategies create extra issues that didn’t beforehand exist. One upcoming huge information analytics vendor that’s discovered a contented medium balancing new tech and confirmed strategies is Ocient.
Ocient was based in 2016 by a gaggle of technologists led by Chris Gladwin, who was the founding father of object storage vendor Cleversafe, which IBM purchased in 2015 for $1.4 billion. Again then, huge information lakes constructed on distributed file methods like HDFS and object storage methods like S3 had been thought of innovative. Equally, many corporations had been advised that one of the best ways to scale huge information workloads was to separate the compute and storage layers, which allowed them to scale independently.
Information was so huge, we had been advised, that one needed to centralize the info, ideally within the cloud, and produce compute to the storage. The storage media underlying HDFS or S3–and which most cloud information warehouses, like Snowflake and Redshift, are designed to make use of–was invariably spinning disk, which even at the moment is the most cost effective type of on-line storage.
However Gladwin and his crew had a distinct tackle the scenario. They noticed the spinning disk that AWS was investing so closely in as an obstacle to progress. One may run huge SQL analytics jobs by sending information throughout NICs to the storage layer, nevertheless it wouldn’t essentially be the quickest nor the most cost effective method.
As an alternative, Ocient developed its personal analytics database with a brand new structure that’s designed round NMVe drives. And as a substitute of separating compute and storage, Ocient’s design introduced the 2 again collectively. These two architectural design factors allowed Ocient to ship huge efficiency positive factors on among the hardest huge information challenges, in keeping with George Kondiles, Ocient’s co-founder and chief architect.

NVMe drives maintain substantial efficiency benefits over spinning disks (ALPAL-images/Shutterstock)
“When information get very giant, within the petabytes and above, with tens to tons of of trillions of data, what we see, what we imagine to be true, is that the abstraction layer that exists between the storage and the compute is a considerable obstacle to realizing big efficiency positive factors on the queries, comparatively talking, on that information,” Kondiles stated. “We work very carefully with eliminating all these abstraction layers in order that we’re in a position to simply mainly speak on to the info, learn instantly from the info in situ, do as a lot of the evaluation as we are able to with these actually wider pipes proper off the packing containers on these actually giant information units.”
A typical NVMe drive can learn information at speeds as much as 3,000 MB per second and 200,000 IOPS with direct connections to the PCIe bus. A ten,000 RPM spinning disk, alternatively, can learn information at accelerates 250 MB per second and ship perhaps 160 IOPS. When compute and storage are disconnected, as is the style, there’s extra community latency.
Ocient’s concentrate on using NVMe drives gave it a giant efficiency enhance over information lakes, which invariably use spinning disk. Whereas NVMe drives are costlier than spinning disk, they will entry information 30x sooner or extra, which give them an enormous efficiency benefit. For sure forms of always-on huge information workloads, the speedup that Ocient’s method is effectively price any further prices which will consequence from having to purchase a lot of NVMe drives and operating them in an on-prem trend.
Again in 2016, few analytics database distributors had been creating databases with NVMe in thoughts. Ocient sensed a possibility. “We had been all in on this NVMe drive idea very early on based mostly on simply us noticing that the present database software program that was on the market wasn’t essentially capitalizing on the type of comparatively novel capabilities that the drives have,” Kondiles advised BigDATAwire in a current interview. “And that was why we leaned in on it.”
That’s to not say that information lakes operating on object storage don’t have their place. Firms that may’t predict what their analytical wants are going to be will profit from the extra elasticity that the separation of compute and storage deliver. However for sure forms of always-on OLAP workloads–the kind that contain tens of petabytes of information and tons of of trillions of data–the overhead incurred by accessing HDDs over the community in an information lake setting is simply an excessive amount of.
“The types of issues that we’re concentrating on and making an attempt to unravel, we see some actually substantial efficiency enhancements, price enhancements…precisely for these types of information that the info lake method doesn’t essentially at all times have the very best outcomes,” Kondiles stated. “In some situations, there’s numerous worth available by maintaining them separate. And in others, there’s numerous worth to not essentially attempt to pressure a sq. peg in a spherical gap.”
Ocient caters to corporations with among the greatest huge information necessities, corresponding to telcos, advert tech corporations, governmental companies, monetary providers, and enterprises with large-scale observability workloads. A lot of Ocient’s prospects run their Ocient clusters on-prem, though there’s nothing to forestall the Ocient software program from being run within the cloud, which some prospects do.
Co-locating compute and storage reduces prices, however brings ancillary advantages too, Kondiles stated. “We had been focusing totally on efficiency and price effectiveness,” he stated. “However it’s additionally house discount and power discount, since you’re taking what was a bunch of storage nodes and a bunch of compute nodes and also you mix them collectively right into a single set of nodes, and the result’s decrease information middle footprint and decrease energy utilization.”
Ocient’s analytics database is constructed on the relational mannequin and makes use of commonplace ANSI SQL to entry information. On prime of that, it provides time-series and geospatial elements, which invariably are vital within the type of huge IoT- and senor-generated information units that Ocient prospects wish to crunch. It additionally consists of some machine studying primitives that enable prospects to run predictive analytic capabilities.
However Ocient’s database isn’t your backyard selection SQL retailer. As an illustration, the corporate has constructed erasure coding instantly into its question engine, which permits it to reduce the quantity of duplicate information that prospects retailer whereas retaining the aptitude to do a full restoration within the occasion of drive losses. That’s an instance of Ocient borrowing concepts from object retailer distributors.
Right here’s one other space the place Ocient zigs whereas the remainder of the business zags: secondary indexes.
“It’s one thing that numerous the larger names type of moved away from simply due to the perceived complexity of managing a schema and the forms of queries and no matter else,” Kondiles stated. “And what we discovered is that, particularly at these scales, the secondary indexes will be essential for reaching cheap both execution instances or prices for the system just because the info quantity is so excessive.”
Ocient has taken a realistic method to the way it develops its software program. It helps newer applied sciences, corresponding to NVMe and erasure encoding, whereas concurrently adopting older architectures, like secondary indexes and co-located compute and storage, when it is sensible to take action.
The method appears to be reasoning. The corporate stated final week that bookings over the primary 5 months of 2025 had been almost triple the speed of final yr. In April, the Chicago-based firm introduced the closure of its $42.1 million Sequence B spherical, bringing the corporate’s complete enterprise funding to $132 million.
The corporate is now in growth mode and looking for to develop revenues. As a part of that drive, final week Ocient introduced in John Morris to be its new CEO, changing Gladwin within the nook suite. Morris and Gladwin labored collectively beforehand at Cleversafe, the place Morris was introduced in as CEO previous to the IBM acquisition.
“I couldn’t be extra thrilled to welcome John as he takes the helm as CEO,” stated Gladwin, who’s now Ocient’s govt chairman. “His operational and strategic management come at a pivotal time for Ocient, very like when he joined Cleversafe and helped drive tripled revenues, which resulted within the firm’s $1.4B acquisition and 10x returns for traders.
Will historical past repeat itself? Solely time will inform.
Associated Gadgets:
Hyperscale Analytics Rising Quicker Than Anticipated, Ocient Says
Ocient Report Chronicles the Rise of Hyperscale Information
The Community is the New Storage Bottleneck