Uncategorized
444 view

[node.js]what libraries we can use for crawling website : cheerio

I’ll use libraries which are “request” and “cheerio”.

npm install request
npm install cheerio

“request” allows us to get data from target URL.
And then, “cheerio” allows us to analyse the retrieved data with DOM. Sample is following.

#!/usr/bin/env node

var request = require("request");
var cheerio = require("cheerio");

var request_url = "http://www.google.com";

request({url: request_url}, function(error, response, body)
{
  if (!error &amp;amp;amp;&amp;amp;amp; response.statusCode == 200) {
    $ = cheerio.load(body);

    var url = response.request.href;
    var title = $("title").text();

    console.log(url);
    console.log(title);
  } else {
    console.log(response.statusCode);
  }
});

Author: zuqqhi2
Uncategorized

[express][socket.io]Check it out that ch…Prev post

[node.js]Insert restaurant information f…Next post

[node.js]what libraries we can use for crawling website : cheerio

Uncategorized recent post

Run Amazon FreeRTOS on M5Stack Core2 for AWS …

Udacity Self-Driving Car Engineer Nanodegree …

Install sbt 1.0.0 and run sample template

Visualization of Neural Network and its Train…

[Machine Learning]Created docker image includ…

関連記事

Install Apache Hadoop

[Algorithm]Solve Maze By Dept…

[Ruby]Retrieve English tweets …

[memo]granpark.rb security mee…

[Test][CoffeeScript]Measure co…

[Backbonjs][Javascript]Install…