Automatic pattern detection (automatically selecting repeating data) is supported only in the starting page of the configuration. So, if you need to scrape repeating data (data in list or table format) from pages reached by following a link from the starting page of configuration, follow the steps below.


This is a 2 stage process. In the first stage get all details page URLs.

1. Open WebHarvy and load the starting page of configuration
2. Start Configuration
3. Click and select the next page link.
4. Click the first listing link and select Capture Target URL option.
5. Stop Configuration
6. Start Mine

At the end of above step you will get a list of URLs of details pages.

7. Load the first URL in the list in WebHarvy's browser
8. Start Configuration
9. First click and select the repeating data displayed in the page, i.e., data displayed in table/list format.
10. When all repeating items have been selected from the page, click and select non-repeating items like title, price etc.
11. Click the URLs button within Edit panel of Configuration menu. Paste the remaining URLs obtained in Step 6 above. Apply.
12. Stop Configuration
13. Start Mine.


Reference : https://www.webharvy.com/articles/howto.html