Organizational Research By

Surprising Reserch Topic

Question:Is there a way to crawl all facebook fan pages?


Is there a way to crawl all facebook fan pages and collect some information? like for example crawling facebook fan pages and save their names, or how many fans, etc? Or at least, do you have a hint of how this could be possibly done?

asked Sep 13, 2013 in Crawl by rajesh
edited Sep 12, 2013
0 votes
45 views



Related Hot Questions

4 Answers

0 votes
Write a crawler.

    I used Coca-Cola's page as an experiment: http://www.facebook.com/cocacola?v=wall

    Parse out the "Fans" div, which contains an "All Fans" link. View source in your web browser, it looks like this: /social_graph.php?node_id=40796308305&class=FanManager

    Turn that into a facebook URL and crawl it: http://www.facebook.com/social_graph.php?node_id=40796308305&class=FanManager

    Parse out the fans, then parse out the "Next page" link.

    Repeat, ad nauseum.

    Throttle your requests so facebook doesn't blacklist you.
answered Sep 13, 2013 by rajesh
edited Sep 12, 2013
0 votes
First select a page that contains your desired category for pages:

For Example: http://www.facebook.com/pages/ or http://www.facebook.com/pages/?browse&ps=93

Then use a crawler to get all pages links.

Now you can parse each page separately using extracted links.

You can use simple html dom for crawling.
answered Sep 13, 2013 by rajesh
edited Sep 12, 2013
0 votes
Write a crawler.

    I used Coca-Cola's page as an experiment: http://www.facebook.com/cocacola?v=wall

    Parse out the "Fans" div, which contains an "All Fans" link. View source in your web browser, it looks like this: /social_graph.php?node_id=40796308305&class=FanManager

    Turn that into a facebook URL and crawl it: http://www.facebook.com/social_graph.php?node_id=40796308305&class=FanManager

    Parse out the fans, then parse out the "Next page" link.

    Repeat, ad nauseum.

    Throttle your requests so facebook doesn't blacklist you.
answered Sep 13, 2013 by rajesh
edited Sep 12, 2013
0 votes
First select a page that contains your desired category for pages:

For Example: http://www.facebook.com/pages/ or http://www.facebook.com/pages/?browse&ps=93

Then use a crawler to get all pages links.

Now you can parse each page separately using extracted links.

You can use simple html dom for crawling.
answered Sep 13, 2013 by rajesh
edited Sep 12, 2013

...