Closed

Scraping a website

Hello.

I want information of approximately 600~ apartments scraped from: [url removed, login to view]

I need to scrape both images and text from each individual apartment. The images I want to be scraped are under each indvidiual apartment in the top, (Illusration 1).

The text fields to be scraped can be found when pressing the button: “Se lejevilkår”, which will expand additional information. (Illustration 1).

I have listed the full set of fields below.

[Address]

[Description]

[Floor]

[HouseNo]

[ZipCode]

[City]

[Squaremeters]

[RentalPrice]

[RentalType]

[RoomSize]

[RentalAdvanceAmount]

[RentalDeposit]

[ACV]

[ACH2O]

Examples followed by the translated field in () if applicable:

[Address]: Nytorv 5,5 th. 1450 København K

[Description]: Vil du bo i hjertet af København?

[Floor]: 5,5 th. (From the address)

[HouseNo]: 5 (From the address)

[Zipcode]: 1450 (From the address)

[City]: København K (From the address)

[Squaremetersl]: 177 kvm (Areal)

[Rentalprice]: 30.000 kr (Leje pr. måned)

[Roomsize]: 4 (Antal værelser)

[RentalAdvanceAmount]: 1 måneds leje (Forudbetalt leje)

[RentalDeposit]: 90.000,- (Depositum I kr.)

[ACV]: 1.000,- (A conto varme)

[ACH20]: 400,- (A conto vand)

Images: All the carousel images of each apartment thumbnail views on page 1. They should be saved as the as address name followed by numbers.

Example:

Nytorv 5,5 th. 1450 København K 1

Nytorv 5,5 th. 1450 København K 2

Nytorv 5,5 th. 1450 København K 3

Questions for potential challenge-solvers:

- How do you scrape it? Any software that you have license to? In case, how much is the software license and is the executeable file transferable?

- If not any 3rd part software - Can we somehow do so it is scraped every 24 hours if any changes have been done to the website? For instance, an apartment got removed, then I want notice about it and only if new apartments are added it will scrape the pictures to lower the data-bandwitch on the other server by NOT scraping fully every 24 hours.

Kind regards

Nicklas Olsen

+63 998 010 8623

Skills: Data Mining, Data Processing, Excel, HTML, Web Scraping

See more: scraping website perl, scraping website emails, scraping website columns excel, orkut copy past scraping website, orkut copy paste scraping website, website developer earning potential, scraping website maintenance, data scraping website, website launch letter potential advertisers, orkut scraping website, scraping website data, free marketing website huge earning potential, scraping website mysql, scraping website, data scraping website using

About the Employer:
( 0 reviews ) Philippines

Project ID: #16009678

1 freelancer is bidding on average $9 for this job

IXV

Hired by the Employer

$9 USD / hour
(5 Reviews)
6.0