
(Marcio Jose Bastos Silva/Shutterstock)
Like many analysis establishments, Harvard College struggled to handle the quickly rising storage atmosphere backing its HPC clusters. The college had little visibility into which analysis tasks have been consuming storage, and the 5,000 customers throughout the College of Arts and Sciences (FAS) had little cause to care. However due to storage insights enabled by a Starfish knowledge administration software, the college developed a chargeback system that enabled it to invoice customers for the storage they used, which helped it get off the storage treadmill and put it on a course for sustainability.
Harvard College’s FAS Analysis Computing group manages HPC and basic compute sources for a big swath of the college, together with core sciences like astronomy, physics, and chemistry in addition to engineering–mainly, every thing on the faculty besides the medical and enterprise colleges. The group manages one main supercomputer, Cannon, with 1,800 compute nodes and greater than 1,000 GPUs, in addition to just a few smaller programs.
Earlier than it began getting a deal with on its storage atmosphere, FASRC managed a mish-mash of varied file programs and storage servers. In keeping with Raminder Singh, the FASRC senior director who heads the group, it had 30 petabytes of Luster programs and one other 46PB of different file system sorts, together with NFS and Isilon programs from Dell/EMC and others. It utilized a lot of white field storage servers, a lot of which have been upwards of 10 years previous and now not underneath guarantee.
FASRC struggled so as to add sufficient capability to satisfy storage calls for. In keeping with Singh, who joined FASRC in 2018 and was promoted to the senior director position in June 2023, storage capability was rising at a 20% to 30% annual fee. From 2020 to 2025, the quantity of storage underneath administration doubled, he stated.

Storage at Harvard FASRC was rising at 20% to 30% per 12 months earlier than the chargeback system (Joe Techapanupreeda/Shutterstock)
About 4 years in the past, Singh’s predecessor at FASRC acknowledged that the scenario was unsustainable. An enormous a part of the managed storage development was that researchers had no incentive to reduce their storage use. The price of storage was not included as a element of researchers’ grant purposes, which led customers to grossly over- or underestimate their storage wants. One thing needed to change.
The concept that the FASRC director got here up with was to implement a chargeback system. By tying researchers’ precise storage consumption again to their grants, researchers could be accountable for paying for the storage that they used. Having some pores and skin within the recreation would hopefully incentivize researchers to develop into extra accountable customers of storage sources, and storage consumption would decline.
Different analysis establishments have carried out chargeback programs to tie the sources consumed as a part of a venture again to the grants obtained from the Nationwide Institutes of Well being (NIH) and Nationwide Science Basis (NSF). Nonetheless, the federal authorities has strict accounting guidelines for chargeback programs. Researchers should be capable of justify the prices, and the work have to be auditable.
“There was the concept to truly make storage sustainable,” Singh stated. “How we will provide storage as a service, the place we will get well the associated fee? And to do this, we needed to give you sure tiers of storage.”
FASRC developed a multi-tiered storage system with completely different efficiency, value, and recoverability attributes. Tier 0 is meant for energetic evaluation analysis knowledge linked to the supercomputer, and prices $50 per 12 months per TB, however no snapshots or catastrophe restoration. Tier 1 is meant for basic objective knowledge and comes with each day snapshots, however is comparatively value at $250 per 12 months per TB. Tier 2 capabilities as an middleman storage repository for older analysis knowledge, with no weekly snapshots at $100 per 12 months per TB. The most affordable type of storage is tape, at $5 per 12 months per TB.
Along with the multi-tiered storage, FASRC realized it wanted a greater manner to determine who’s utilizing the storage. How might it implement a chargeback system to trace precise storage utilization in such a busy analysis establishment, with 1000’s of researchers throughout a whole lot of labs creating billions of recordsdata throughout half-a-dozen file programs? FASRC technicians could possibly get insights for a single well-managed storage system utilizing native instruments, however getting the overarching view throughout all the atmosphere could be fairly troublesome.
The answer got here within the type of a a lot beloved sea animal: the Starfish. Particularly, it was the unstructured knowledge administration software from Starfish Storage that finally gave FASRC the perception it wanted to get a deal with on its quickly increasing storage atmosphere and to implement the chargeback system.
Starfish provides FASRC an in depth view into what’s being saved. Its patented index system permits it to construct a world stock of all of its storage property, and tie that storage consumption again to particular customers and analysis venture.
“One massive cause for choosing Starfish was it could scan completely different file programs,” Singh says. “You need unified data to current to your customers. They can’t log into three or 4 completely different programs to get that data.”
Harvard’s researchers usually aren’t laptop consultants who intrinsically know easy methods to create an environment friendly and high-performance storage atmosphere. They’re undergrad, graduate, and postgrad college students who’re enmeshed in pursuing scientific information.
“In case you go ask the researchers what they want, they don’t know. That’s what working in analysis computing from final 10 years taught me,” Singh stated. “Individuals will ask for 100 terabytes of storage as a startup package deal, and 90% of the time they solely use 10% of it.”
Due to Starfish, the researchers have a comparatively easy mechanism to realize some semblance of storage effectivity–through the use of the Starfish tagging system to mark which recordsdata might be archived.
“All they must do is put a tag down after which the Starfish mechanism within the background does the entire operational housekeeping work to verify these recordsdata into the tapes, into the archival system,” stated Starfish Founder and Chief Evangelist Jacob Farmer. “And it gives the restoration mechanism to deliver them again out once more, and it gives the historic itemizing of what was there initially.”
Due to Starfish, FASRC was capable of uncover that almost 50% of the info it was storing had not been accessed up to now two years. Armed with this data, the heads of analysis applications have been incentivized to both delete unneeded knowledge or to maneuver knowledge that also had worth to the archive or its IBM tape library.
“After we began charging for sure storage, folks instantly went forward and deleted a bunch of issues by themselves, and that in itself truly helped with the expansion facet of issues,” Singh stated. “Plenty of this [data] was from individuals who truly left Harvard perhaps 5 years in the past, and their knowledge was nonetheless there. That was their first response as a result of now they’re paying, and so they don’t wish to pay for the info which doesn’t even have worth.”
The chargeback system not solely generated $750,000 in direct income the primary 12 months and $1.5 million the next 12 months, however the surge in knowledge deletion and archiving to tape helped the group get a deal with on its out-of-control storage development. Singh now could be seeking to retire older, out-of-warranty storage programs storing a number of tens-of-PBs. On the similar time, he’s seeking to shift to enterprise storage system and away from open supply programs. The income and financial savings generated by the chargeback system has given Singh credibility with the college’s monetary people to proceed with the storage modernization.
“We’re making an attempt to maintain the associated fee underneath management, we’re making an attempt to maintain that stability, in order that any new storage we’re shopping for, we’re capable of get well prices,” Singh stated.
However the success of the chargeback system had yet another massive affect: It gave FASRC the monetary headroom to rent a storage knowledgeable to personally work with the researchers to give you efficient knowledge storage methods, which is able to drive much more storage financial savings sooner or later. On this respect, the implementation has been a win-win-win.
“The primary two years was spent to have a system, have the instruments, have every thing out there for this [chargeback system] to truly develop into sensible. You begin to see outcomes after a few years,” Singh stated. “We’re beginning the subsequent section of our storage venture, the place we can be consolidating a few of the choices based mostly on the learnings we now have from working the storage service middle for the final two to 3 years.”
It was Singh’s predecessor who introduced in Starfish almost 4 years in the past to pave the way in which for the chargeback system and the brand new storage tiers. However Singh has accomplished a lot of the work of implementing the chargeback system, together with the event of a brand new system based mostly on the open supply ColdFront software program out of the College of Buffalo to automate a lot of the chargeback duties.
In keeping with Farmer, there’s an previous saying in analysis computing to the impact that “there’s cash for storage, however there’s by no means any cash for storage administration.” Maybe it’s time to revisit that phrase in gentle of FASRC’s expertise with Starfish?
“Storage administration contains all types of issues, like curating your knowledge and sharing your knowledge and charging it again,” Farmer stated. “We’ve type of at all times thrown the concept on the market that when you herald Starfish that will help you with the operational facet of saving cash, like deleting and archiving and housekeeping, that that may get you the ROI that pays for every thing else that you may want for knowledge administration.”
Associated Gadgets:
Starfish Helps Tame the Wild West of Huge Unstructured Knowledge
Peering Into the Unstructured Knowledge Abyss
Unstructured Knowledge Progress Carrying Holes in IT Budgets
chargeback, ColdFront, FAS Analysis Computing, file system, Harvard College, hpc, Jacob Farmer, Raminder Singh, analysis, StarFish, unstructured knowledge