I have to agree with you Julian in the sense that if you simply overlaid two samples, even if the velocity was identical and phases were aligned as much as possible at the beginning of the sample, that by the end the two would be out of phase. However, mdovey's approach would need little modification to do a pretty good job. With Christian's test with zipping one of the files, I doubt anything this cumbersome is needed, but here's the idea. Most of the energy of the string is at some fundamental frequency. Since a good number of the keys in the piano have three strings, technically you'd have to determine 3 close, but not identical frequencies for these and their phase alignment. For a given key at a given velocity, the amplitude is probably easily described as a function of time. Applying this envelope to the phase aligned fundamental frequency mashup thingy and subtracting it from the sample would probably remove most of the energy of the sample. All of that data that was removed could be easily represented in a handful of parameters that would take only a few bits and could be stored as metadata. The residual sample data could then be expressed with far fewer bits. As big a pain in the butt as this would be, all the heavy lifting is on the encoding side. The decoding is much simpler. A lot more refinement would be needed to achieve a 10:1 compression ratio. Neat idea, but why waste the time developing from scratch some crazy algorithm when you can just tailor something like zip and get the same result. I think Football is right, this thread is nuts. Probably need a new thread just to debate the compression algorithm.
I may be misinterpreting this but weren't Christians results a saving of just 8% with the zip and 29% with the rar? This is lossless in that the complete file is re-created on decoding (un-zipping) So the question is: is the VSL system actually a zip like function where the exact file is reconstructed on decoding (unlikely, i would guess at 10:1) or in fact data compression where 90% of the original data is removed forever? And if this is the case how much is this changing the original sound quality. I would have thought most good musicians/engineers would detect 10:1 compression ratios in A/B comparison however good the algorithms used - particularly when there is low level ambience involved (the room mics recordings)
Julian