MIT OpenCourseWare Scraper
- 1 minMIT OpenCourseWare (MIT OCW) is an initiative of the Massachusetts Institute of Technology (MIT) to put all of the educational materials from its undergraduate – and graduate-level courses online, freely and openly available to anyone, anywhere. MIT OpenCourseWare is a large-scale, web-based publication of MIT course materials.
Recently while going through the website I had the idea of getting all the course materials and its details offline. So I checked how all the course details are being populated in the Search by Topic
page, as it would be easy to scrap the search page rather than other ones.
I am unable to write a brief description on how was this total script is running, but I made the script as simply as possible and you will be able to understand all the steps easily.
You can start the script by running python script.py
You can checkout the repo here. Fork it or Star if you like it.
You can mail me at sunil@suniltatipelly.in for any queries or doubts regarding this.
Below is the output of the script :
You can checkout the repo here. Fork it or Star if you like it.