我正在尝试抓取其中包含标记的网址,但它返回400错误的请求错误。
有没有办法在PHP中使用CURL正确解析这个?
I'm trying to crawl a url that has has tag in it but it returns 400 bad request error.
Is there a way to parse this correctly using CURL in PHP?
最满意答案
锚片段( #和后面的所有内容)不是请求URL的一部分。 它永远不应该发送到服务器。 cURL与此无关。 只有浏览器关心这部分。
如果此页面正在执行某些AJAX或以其他方式基于锚片段动态加载内容,那么您将需要一个浏览器来运行该页面并执行JavaScript。 您可以使用PhantomJS执行此任务。
The anchor fragment (# and everything after) is not part of the request URL. It should never be sent to the server. cURL has nothing to do with this. Only the browser cares about this part.
If this page is doing some AJAX or otherwise dynamically loading content based on the anchor fragment, then you will need a browser to run that page and execute the JavaScript. You can use PhantomJS for this task.
更多推荐
发布评论