Data Archive: Difference between revisions

From Sea of Fate
Jump to navigationJump to search
Created page with "==Introduction== We are building an offline vault of data for use over the next few years. To accomplish that We need to have one or more '''Virtual Machines''' that will be able to download and store the data. As most of the load will be on CPU storage and Internet bandwidth for the archival we will keep this part separate from the AI and Jellyfin host Quince that is powered by a ''' GPU Passthrough'''. ==Reasons for a Data..."
 
Line 1: Line 1:
==Introduction==
==Introduction==
We are building an offline vault of data for use over the next few years. To accomplish that We need to have one or more '''[[Virtual Machines]]''' that will be able to download and store the data. As most of the load will be on CPU storage and Internet bandwidth for the archival we will keep this part separate from the AI and Jellyfin host Quince that is powered by a  '''[[Linux Docker And GPU Passthrough | GPU Passthrough]]'''.  
We are building an offline vault of data for use over the next few years. To accomplish that We need to have one or more '''[[Virtual Machines]]''' that will be able to download and store the data. As most of the load will be on CPU storage and Internet bandwidth for the archival we will keep this part separate from the AI and Jellyfin host Quince that is powered by a  '''[[Docker Hosts | GPU Passthrough]]'''.
 


==Reasons for a Data Archive==
==Reasons for a Data Archive==


We want to have a completely offline copy of as much of the data currently on the WWW. The reason for doing so now as opposed to in the past is that the new wave of LLMs and AI are creating their own summarised and sanitised versions of the wealth of data that has accumulated in the last 30 or so years.
We want to have a completely offline copy of as much of the data currently on the WWW. The reason for doing so now as opposed to in the past is that the new wave of LLMs and AI are creating their own summarised and sanitised versions of the wealth of data that has accumulated in the last 30 or so years.

Revision as of 15:48, 3 February 2026

Introduction

We are building an offline vault of data for use over the next few years. To accomplish that We need to have one or more Virtual Machines that will be able to download and store the data. As most of the load will be on CPU storage and Internet bandwidth for the archival we will keep this part separate from the AI and Jellyfin host Quince that is powered by a GPU Passthrough.

Reasons for a Data Archive

We want to have a completely offline copy of as much of the data currently on the WWW. The reason for doing so now as opposed to in the past is that the new wave of LLMs and AI are creating their own summarised and sanitised versions of the wealth of data that has accumulated in the last 30 or so years.