We want to develop a product content scraper that gathers product information from selected manufacturer websites:
- product images
- product PDFs
- product specifications
- marketing texts
- List of URLs
- List of basic product data, incl Manufacturer name, Manufacturer Part Number, Model name, EAN / UPC code.
The output will be structured (icecat xml dtd), and will be input for our PIM system.