Provide info about tools to create artificial data sets with similar statistical properties. Provide info on open data sets