During mining, some cells in the data table remain blank. This happens due to the following reasons:

    • 1. Data is actually not present in the page
    • 2. Page load failed / Page was not completely loaded
    • 3. Data is present, but WebHarvy was not able to locate it based on the configuration

Try the following steps to make sure that the page is completely loaded before extraction starts:

1. Increase the 'AJAX Load Wait Time' in Miner Settings. Websites which employ AJAX to load and display elements may require some additional time after page load to display all data. The default value of this setting is 5 seconds. Increase it to 10, 15 or 20 seconds and see if WebHarvy is able to get results during mining.

2. Soon after starting configuration, click anywhere on the page and select More Options > Scroll Page from the resulting Capture window. Then continue with the configuration. Similarly, after following links from the start page, once the new page is loaded, before selecting any data, click anywhere on the page and select More Options > Scroll Page. This helps to completely load all page elements before extraction is attempted.

3. Decrease the value of Maximum number of parallel mining threads setting in Advanced Miner Options. Set the value of this option to 1 and try mining again. If your system does not have adequate CPU/memory/bandwidth, then having a higher value for this setting can result in missing data.

Sometimes, the location of the text which you need to extract varies slightly from page to page. In such cases, during configuration, if you directly click on the required text and select it, during mining data will not be extracted for some pages where it occurs at a slightly different location. Try the following to overcome this problem:

1. If the required text always appears after a heading text, then use the Capture following text method to select it during configuration. This method works independent of the location of text.

2. Instead of clicking directly on the required text and selecting it, capture a larger area of text, and select the required portion from it by highlighting or by applying regular expressions.