MANDY: Fetch a page with cookies using Python requests library

Sunday, 29 September 2013

Fetch a page with cookies using Python requests library

Fetch a page with cookies using Python requests library

I'm just studying the requests
library(http://docs.python-requests.org/en/latest/), and got a problem on
how to fetch a page with cookies using requests.
for example:
url2= 'https://passport.baidu.com'
parsedCookies={'PTOKEN': '412f...', 'BDUSS': 'hnN2...', ...} #Sorry that
the cookies value is replaced by ... for instance of privacy
req = requests.get(url2, cookies=parsedCookies)
text=req.text.encode('utf-8','ignore')
f=open('before.html','w')
f.write(text)
f.close()
req.close()
when I use the codes above to fetch the page, it redirects to the login
page, also I see the login page in 'before.html', it refers that actually
I haven't logged in successfully.
But if I use URLlib2 to fetch the page, it works properly as expected.
parsedCookies="PTOKEN=412f...;BDUSS=hnN2...;..." #Different format but
same content with the aboved cookies
req = urllib2.Request(url2)
req.add_header('Cookie', parsedCookies)
ret = urllib2.urlopen(req)
f=open('before_urllib2.html','w')
f.write(ret.read())
f.close()
ret.close()
When I use these codes, it saves the login page in before_urllib2.html.
--
Are there any mistakes in my code?

MANDY

Sunday, 29 September 2013

Fetch a page with cookies using Python requests library

No comments:

Post a Comment