-
Just double checking if I understand project assumptions correctly. Is it only the matter of site response or Scrapegraph-ai additionally restrict access base on eg robot.txt? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 5 replies
-
I think we can overcome the problem with the robots txt, pls write the code |
Beta Was this translation helpful? Give feedback.
-
we suggest you to use this proxy for making the proxy rotation https://dashboard.statproxies.com/?refferal=scrapegraph |
Beta Was this translation helpful? Give feedback.
Hey @fx71 try setting the headless flag to False and you will be able to fetch the HTML. Sometimes it happens for javascript-heavy website