Speed Up Your Data Collection With Listly
As data scientists, we always seek data, after all, you can’t spell data science without “data”. There are many ways to get data, if you work in a company, then probably your company will provide you with the data you need to work on. If you’re an independent data scientist or working freelance, then your clients will give you the data. But, if you’re new to data science or working on developing your skills, then you will need to collect data to use for analysis yourself.
Often, when we need to collect data for a project, the first place we consider is publically available data on the internet. As data scientist we should always be careful with the data we use, we need to make sure that we are only using publically available data and be considerate of data privacy laws and rules.
Once we are sure what data we can use, then we often start looking for the most optimal approach to collect that data and export it in a format that’s easy to use in our analysis process. During my career, using data science for school, work or writing, I got to try several data collection tools or web scrappers.
Today, I will propose to you a tool that I have found quite useful, flexible, and easy to install and start using, Listly. Listly is a free web scraping tool that you can add to chrome and export the data you need to start building projects. In this article, I will walk you through how to install and start using Listly.
First Step: Installing Listly
Listly is very easy to install and start using. All you have to do is head to Listly.io, create a free account, and then add Listly to your chrome.
That’s literally it. Once you do that, you are all set and can start using Listly.
To demonstrate how to use Listly, let’s assume you’re trying to scrap some data about the different shampoos you can buy from Amazon.com.
Second Step: Using Listly
After looking up “Shampoo” on Amazon, I can use the Listly chrome extension to either extract data from the whole page or from a specific part of it.