Writing a bot (crawler) for websites based on web forms
I need to apify a website based on web forms. I will create an API, once a request made to API, it will navigate the website and do the job under the hood, it is pretty easy for a standard website.
But, web forms. you know, many viewstate and action state form parameters that means nothing for a human. I could not figure out what is going on under the hood, even the navigation buttons are submitting 100+ form elements with random content.
Browser based automation tools (ie selenium) are easiest option but It will be hundreds of requests at the same time, I do not want to launch 100+ browser in a server.
I have tried Mechanize.Net, it did not help. Because I could not figure out how navigation works on the website.
Any thoughts?
0 comments:
Post a Comment