I would like to build an XML sitemap generator in either Python or other language that is preferred by the developer for this type of project.
Here are the features I would like to have for the generator:
- Pattern-matching to check for parallel pages in other languages/locale
- Configurable patterns based on the site URLs
- Ability to configure to crawl only specific directories
- Ability to disallow from crawling parameters
- Cloud-based GUI to add URL and start crawl (I can host on my VPS)
- Menu to export crawl results, configure crawl URLs, disallow parameters, etc. (See Screaming Frog)
- Crawl large sites up to 1,000,000 pages
- Check http status codes and ensure all 200 status
- Only add https URLs to sitemap and if http, check https for a page to add
- Check box for adding video, images, etc. to sitemap either combined or separate sitemaps
- Build sitemap index for large sites and then link off to each associated locale sitemap
- Option to divide sitemap based on URL patterns (Have a certain directory or part of the site in each sitemap)
- No more than 50,000 URLs per sitemap
26 freelancers are bidding on average $511 for this job
Hi, How are you? I am really interested in your project. I have enough experience on python, XML. I am 100% sure i can satisfy your requirements perfectly. I want a long term relationship with you. Thanks.
Dear client, how have you been? I've read your project description carefully. Presenting this proposal with 100% confident. How about discuss in chat? Waiting for your reply. Best regards.