As a scientist, so far I have primarily generated my own data sets. This takes a long time and a lot of effort. Since I started my training, there is a lot more data in the world. In fact, total data capacity has been growing exponentially for quite some time. Perhaps I can track down that data set later. My point is, lots of people have learned lots of things about data handling and storage since I last check in, and I can learn from them.
As far as I can tell, there are 3 ways to have access to a data set.
- Generate your own data.
- Download public data sets or analyze other people’s data available through API(Application Programming Interface)s.
- Scrape the web for data.
Now off to find and access some public data sets!