need a prof. guy who schould scrap data for 2 Sites.
From the 2 sites we need only comments from the last 6 month. 2 sites:
(1) how many fields the user should fill to send a request One input for the country and town and one input for the hotel name input text: name of the hotel, name of the country and city
(2) unicity of the hotel name different hotels of the same city can have the same name... We dont have a solution at moment...
(3) fields of the comments The scrapper get for each hotel the list of comment on request. It show to the user max 30 comments (staring from the last) The fields of each comment are: title date text content assessment (number of stars) link of the original comment (4) the comments are scrapped only on demand and saved in a database -> That means in the case of already existing hotel, that only DELTA is scrapped (the newest comments, that tha exististing one)
(5) I will prefer Java for the scrapper. and eclipse dev. environment