qlyoung's wiki

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
tiered_storage [2025/10/06 04:06] – created qlyoungtiered_storage [2025/10/22 15:33] (current) – wording qlyoung
Line 1: Line 1:
 ===== tiered storage ===== ===== tiered storage =====
  
-I think a lot of people - myself included - get fixated on the "single source of truth" storage paradigmFiles live in one big beautiful cluster that you can access from any device. I see the appeal. You can focus all your efforts making that cluster as big and beautiful as you want, as resilient as you wantBut after spending lots of time building out a big beautiful cluster, I kept having the experience that I wanted some file and I just couldn't get to itAnd when I could get to it the experience of browsing aroundsearchingtransferring files all over a network just...sucks. After you buy into the paradigm, after a while you get used to it and forget that it doesn't have to be like that. It's totally possible to have all that data on a superfast SSD drive right next to your CPU, easily searchable and accessible at all times.+There are two schools of thought regarding storage. One prioritizes resource efficiency and consistency; data should be stored in one location, and the location made accessible to all clientsThis maximizes total available storage space at the expense of availability and access speedThe other stores data where it is needed; this may be in multiple placesthus trading storage for availability and access speedand introducing the potential for inconsistency in multihomed data.
  
-finally realized that I fundamentally hate network storage as a primary storage locationIt requires a network and even when you have one it'just too slowShortly after that I looked at storage prices, and suddenly, it turns out that you can get huge amounts of storage for not a lot of moneyGranted, it depends on the size of your frequently accessed datawhich I term your "working set". For me that's photosmusic, documents, and projectsFor me, at today's storage unit prices, I can afford around 6-8tb of working setThat's a lot! My actual working set, the stuff I access on a regular basisfits in that with couple tb to spare.+For a long time was firmly in the first school. Files live in one big beautiful cluster that you can access from any device. see the appealYou can focus all your efforts making that cluster as big and beautiful as you want, as resilient as you want. There'only one copy of data, so you never have inconsistenciesBut after lots of time using a big beautiful cluster, I kept having the experience that I wanted some file and I just couldn'get to itAnd when I could get to it the experience of browsing aroundsearchingtransferring files all over a network just...sucks. After you buy into the paradigmafter while you get used to it and forget that it doesn't have to be like that.
  
-Yes, some things are just too big to store on deviceAt today's storage unit pricesstoring 30tb of movies that you access once a year on your laptop doesn't make sense yet for most peopleAnd for thatnetworked storage is fine - you rarely access those items.+In the past my laptop didn't have enough space to store all my dataSmall form factor drives just weren't big enoughand then they got big enough but they were so expensive that it only made sense to put them in a workstation (desktop)In 2025annoyed by the downsides of network storage, I started thinking about how to improve the experience. I realized I have two tiers of data.
  
-After assessing my current data distribution and doing some consolidation I established that 8tb could store all of my  +=== working set ===
-frequently accessed" files. This covers my music, photos, documents, and projects. It doesn't include stuff that falls in the bucket of "huge binary files" - movies, 300gb DNxHD Resolve projects, that sort of thing.+
  
-Since it was about time for a new laptop anyway, converted every computer I own to have at least 8tb of storageconsolidated my working set into a single directory, and synced it to all devices in one epic multi-day multi-terabyte sync session.+I have a set of files I frequently access; my musicphotos, documents, and projects. This is what I think of as my "working set". It doesn't include stuff that falls in the bucket of "huge binary files" movies, 300gb DNxHD Resolve projects, that sort of thing. Right now my working set is ~4tb.
  
-feel like hit storage nirvana. I get on an airplane, spend few hours editing spreadsheet or something, close my laptopBy the time I get home all of that work has opportunistically synced to all my other computers and I can sit down at my desktop and open that fileIt's amazing. I barely need to touch my big chungus pool.+=== other stuff === 
 + 
 +Some things are just too big to store on-device. At today's storage unit prices, storing 30tb of movies that you access once a year on your laptop doesn't make sense yet. Storing those things on a really big cluster accessed via the network is fine. 
 + 
 +=== thoughts === 
 + 
 +After thinking about these tiers realized: 
 + 
 +  - My working set is ~4tb 
 +  - m.2 NVMe drives can now accommodate that working set for a reasonable price 
 +  - which means can now have my working set completely stored on my laptop 
 + 
 +My desktop already had 10tb of storage and my home server 12tbDue for a laptop upgrade anyway, got 8tb of storage on my new machine and then set about collating my working set into an organized directory tree to prepare it for [[syncing]]. I then engaged in massive synchronization campaign that took about week to settle out. With careful preparation there were almost no sync conflicts and my working set is now replicated on all devices. 
 + 
 +At this point in time I exist in storage nirvana. I work on my desktop. Grab my laptop and catch an airplane; all the data I was working on is on my laptop and I pick up where I left off. No need to connect to airplane wifi. Get where I'm going, connect my laptop; it's syncing everything to my home server and desktop in the background. 
 + 
 +=== special case: phone === 
 + 
 +I only have 1tb of storage on my phone, so it can't store everything; and even if I wanted to, syncthing on iOS basically doesn't work because of the lack of background servicesThis is completely fine though, because 1) I don't really do much work on my phone and 2) my phone is always onlineIf I need access to something from my phone, I can download it from my home serverIf my phone is completely offline and I *really* need access to my data, it's on my laptop. And if my phone is offline and I don't have my laptop, well, I will do something else. 
 + 
 +==== Further thoughts ==== 
 + 
 +As it turns out, this is one of the few concepts in computing that I didn't invent /s. From wikipedia, the free encylopedia: 
 + 
 +<blockquote> 
 +HSM is a long-established concept, dating back to the beginnings of commercial data processing. 
 +</blockquote> 
 + 
 +[[wp>Hierarchical storage management]]
Panorama theme by desbest
tiered_storage.1759723608.txt.gz · Last modified: by qlyoung
CC Attribution-Noncommercial-Share Alike 4.0 International Except where otherwise noted, content on this wiki is licensed under the following license: CC Attribution-Noncommercial-Share Alike 4.0 International