Yeah but can't they just use a file system that is intelligent enough to write exactly duplicated data only once, and store a pointer record so that it appears as 500 copies of the same file? I've read a couple articles about file systems doing this now.
As long as that is handled on low enough a level, I imagine they could get away with it.