Hmm, just a thought - you could use a small script to put their data in any form you want - so if you found some data in which you'd want, that might work?
Otherwise, looking for databases, searching with the parameter .edu might be of interest, to cut out a lot of the nonsense.