JAVA HttpClient模拟网页登录抓取网页数据

目前正在学习安卓,并开发了一个类似于超级课程表和今日校园的APP,但是一直卡壳在抓取课程表这一步,遍历了很多资料任然无法解决,下定决心系统信息HttpClient,先写一个helloWord,一直持续记录学习!

一、使用org.apache.HttpClient模拟登录网页,并抓取数据,需要用到HttpClient包

import org.apache.http.HttpEntity;
import org.apache.http.client.methods.CloseableHttpResponse;
import org.apache.http.client.methods.HttpGet;
import org.apache.http.impl.client.CloseableHttpClient;
import org.apache.http.impl.client.HttpClients;
import org.apache.http.util.EntityUtils;

import java.io.IOException;

public class HellWord {
    //直接模拟
    public static void main(String[] a){
        //生成一个可关闭的HTTP浏览器(相当于)
        CloseableHttpClient httpClient=HttpClients.createDefault();
        CloseableHttpResponse response=null;
        //创建http Get请求
        HttpGet httpGet=new HttpGet("http://hll520.cn");
        try {
            response=httpClient.execute(httpGet);//执行
        } catch (IOException e) {
            e.printStackTrace();
        }

        //获取网页源码
        HttpEntity httpEntity=response.getEntity();//获取网页源码
        try {
            String h=EntityUtils.toString(httpEntity,"UTF-8");//指定编码避免乱码
            System.out.printf(h);
        } catch (IOException e) {
            //io异常(网络问题)
            e.printStackTrace();
        }

        //关闭HTTp
        try {
            response.close();
            httpClient.close();
        } catch (IOException e) {
            e.printStackTrace();
        }

    }
}

二、运行结果,模拟打开一个网页,并使用getEntity将网页HTML源码显示

发布了2 篇原创文章 · 获赞 0 · 访问量 32

猜你喜欢

转载自blog.csdn.net/XiaoYunKuaiFei/article/details/105409296
今日推荐