The Voice storage used is the part of the hard drive used by Voice and the Text storage used the part used by Text

(Couldn't resist...)
Voice include all audio "data", Greeting messages, Voice Mail messages, Voice Menus, voice forms, etc.
Text or Data is for all non audio informations, users'voice mails information, corporate directory, system profile, operation measurement traffic, etc.
More clear ?
