
1 minute read
SCRAPING HOTEL DETAILS IN PYTHON
ONCE WE HAVE THE URL FOR THE HOTEL, WE CAN START SCRAPING THE DATA. WE CAN USE THE REQUESTS PACKAGE TO MAKE A GET REQUEST TO THE HOTEL’S TRIPADVISOR PAGE. IT WILL GIVE US THE HTML OF THE PAGE, WHICH WE CAN THEN PARSE USING BEAUTIFUL SOUP. WE CAN USE BEAUTIFULSOUP TO FIND ALL OF THE ELEMENTS ON THE PAGE THAT CONTAIN DATA ABOUT THE HOTEL. IN THIS CASE, WE WANT TO SEE THE ELEMENTS THAT COMPRISE THE HOTEL’S NAME, RATING, NUMBER OF REVIEWS, AND PRICE. WE CAN THEN EXTRACT THE DATA FROM THESE ELEMENTS AND STORE IT IN A LIST. WE CAN THEN WRITE THE HOTEL’S NAME AND COST INTO A TEXT FILE FOR LATER USE AND KEEP IT IN A PANDAS DATA FRAME.
THIS ARTICLE AIMS TO WRITE A PIECE OF CODE IN WEB SCRAPING TECHNIQUE, EXTRACTING ALL THE INFORMATION ON FAMOUS HOTELS AND THEIR HOTELS LOCATED AROUND THE WORLD AND COMPARING THEM WITH EACH OTHER CONSIDERING THEIR RATINGS, LOCATION, PRICES, AND REVIEWS.
Advertisement
