This presentation addresses the challenges and solutions associated with managing large datasets of small files (LOSF) in AI-powered semiconductor or EDA (Electronic Design Automation) workflows. These workflows significantly increase data volume and provisioning needs due to their reliance on vast datasets for training and operation.