Machine learning is only possible when there’s enough of data available. Data collection is one of the most vital components of the machine learning lifecycle, but it’s also the one that’s often overlooked.
Web scraping—programmatically collecting data from web pages—is often the primary technique used to collect information from the Internet. While web scraping can be very straightforward in some instances, it can be very tricky in other instances. Especially when the data we’re interested in is hidden behind a button that requires a mouse click.
Continue reading “Building an Advanced Web Scraper for Kaggle Leaderboards Using RSelenium”