当块到达时,如何使用java.net.http阅读和打印块HTTP响应?

时间:2018-10-05 16:22:42

标签: java http reactive java-http-client

Java 11引入了一个新的软件包java.net.http,用于发出HTTP请求。对于一般用途,这很简单。

我的问题是:当客户端接收到每个块时,如何使用java.net.http处理块响应?

java.http.net包含一个响应式BodySubscriber,它似乎是我想要的,但是我找不到如何使用它的示例。

http_get_demo.py

下面是一个python实现,它在到达块时将其打印出来,我也想在java.net.http上做同样的事情:

import argparse
import requests


def main(url: str):
    with requests.get(url, stream=True) as r:
        for c in r.iter_content(chunk_size=1):
            print(c.decode("UTF-8"), end="")


if __name__ == "__main__":
    parser = argparse.ArgumentParser(
        description="Read from a URL and print as text as chunks arrive")
    parser.add_argument('url', type=str, help="A URL to read from")
    args = parser.parse_args()

    main(args.url)

HttpGetDemo.java

仅出于完整性考虑,这是一个使用java.net.http发出阻止请求的简单示例:

import java.net.URI;
import java.net.http.HttpClient;
import java.net.http.HttpResponse;
import java.net.http.HttpRequest;

public class HttpGetDemo {

  public static void main(String[] args) throws Exception {

    var request = HttpRequest.newBuilder()
            .uri(URI.create(args[0]))
            .build();

    var bodyHandler = HttpResponse.BodyHandlers
            .ofString();

    var client = HttpClient.newHttpClient();
    var response = client.send(request, bodyHandler);
    System.out.println(response.body());

  }
}

HttpAsyncGetDemo.java

这是发出非阻塞/异步请求的示例:

import java.net.URI;
import java.net.http.HttpClient;
import java.net.http.HttpResponse;
import java.net.http.HttpRequest;

/**
 * ReadChunked
 */
public class HttpAsyncGetDemo {

  public static void main(String[] args) throws Exception {

    var request = HttpRequest.newBuilder()
            .uri(URI.create(args[0]))
            .build();

    var bodyHandler = HttpResponse.BodyHandlers
            .ofString();

    var client = HttpClient.newHttpClient();

    client.sendAsync(request, bodyHandler)
            .thenApply(HttpResponse::body)
            .thenAccept(System.out::println)
            .join();

  }
}

4 个答案:

答案 0 :(得分:2)

您可以打印[jenkins@symphony ~]$ aws ecr get-login --no-include-email --region us-east-2 ,但不能保证ByteBuffer对应一个块。块由堆栈处理。将为每个块推送一个ByteBuffer切片-但是,如果缓冲区中没有足够的空间,则将推送部分块。消费者所看到的只是包含数据的ByteBuffer流。 因此,您可以做的是在打印ByteBuffer时将它们打印出来,但是您不能保证它们与服务器发送的每一个都完全对应一个块。

注意:如果请求的主体是基于文本的,则可以使用 ByteBuffer和自定义BodyHandlers.fromLineSubscriber(Subscriber<? super String> subscriber),将在出现时自动打印每一行。 Subscriber<String>用响应头中指示的字符集将字节解码为字符,这是硬道理,如果需要,则将字节缓冲直到可以解码为止(如果文本包含字符,则ByteBuffer可能会在编码序列的中间结束编码为多个字节),然后在行边界处将其分割。对于文本中的每一行,Subscriber :: onNext方法将被调用一次。有关更多信息,请参见https://download.java.net/java/early_access/jdk11/docs/api/java.net.http/java/net/http/HttpResponse.BodyHandlers.html#fromLineSubscriber(java.util.concurrent.Flow.Subscriber)

答案 1 :(得分:2)

python代码不能确保每次提供一个HTTP chunk响应主体数据。它仅向应用程序提供少量数据,从而减少了应用程序级别消耗的内存量(可以将其缓存在堆栈中的较低位置)。 Java 11 HTTP客户端支持通过流主体处理程序之一HttpResponse.BodyHandlersofInputStreamofByteArrayConsumerasLines等进行流传输。

或编写您自己的处理程序/订户,如下所示:   https://www.youtube.com/watch?v=qiaC0QMLz5Y

答案 2 :(得分:0)

感谢@pavel和@ chegar999的部分答案。他们带我找到了解决方案。

概述

我想出的解决方案如下。基本上,解决方案是使用自定义java.net.http.HttpResponse.BodySubscriber。 BodySubscriber包含反应性方法(onSubscribe,onNext,onError和onComplete)和一个getBody方法,该方法基本上返回一个Java CompletableFuture,最终将产生HTTP请求的主体。掌握好BodySubscriber之后,您可以像使用它一样

HttpClient client = HttpClient.newHttpClient();
HttpRequest request = HttpRequest.newBuilder()
    .uri(URI.create(uri))
    .build();

return client.sendAsync(request, responseInfo -> new StringSubscriber())
    .whenComplete((r, t) -> System.out.println("--- Status code " + r.statusCode()))
    .thenApply(HttpResponse::body);

请注意以下行:

client.sendAsync(request, responseInfo -> new StringSubscriber())

这是我们注册自定义BodySubscriber的地方;在这种情况下,我的自定义类名为StringSubscriber

CustomSubscriber.java

这是一个完整的工作示例。使用Java 11,您无需编译即可运行它。将其粘贴到名为CustomSubscriber.java的文件中,然后运行命令java CustomSubscriber <some url>。它在每个块到达时打印其内容。它还会收集它们,并在响应完成后将它们作为正文返回。

import java.net.http.HttpClient;
import java.net.http.HttpRequest;
import java.net.http.HttpResponse;
import java.net.http.HttpResponse.BodyHandlers;
import java.net.http.HttpResponse.BodySubscriber;
import java.net.URI;
import java.nio.ByteBuffer;
import java.nio.charset.StandardCharsets;
import java.util.ArrayList;
import java.util.concurrent.CompletableFuture;
import java.util.concurrent.CompletionStage;
import java.util.concurrent.CopyOnWriteArrayList;
import java.util.concurrent.Flow;
import java.util.stream.Collectors;
import java.util.List;

public class CustomSubscriber {

  public static void main(String[] args) {
    CustomSubscriber cs = new CustomSubscriber();
    String body = cs.get(args[0]).join();
    System.out.println("--- Response body:\n: ..." + body + "...");
  }

  public CompletableFuture<String> get(String uri) {
    HttpClient client = HttpClient.newHttpClient();
    HttpRequest request = HttpRequest.newBuilder()
        .uri(URI.create(uri))
        .build();

    return client.sendAsync(request, responseInfo -> new StringSubscriber())
        .whenComplete((r, t) -> System.out.println("--- Status code " + r.statusCode()))
        .thenApply(HttpResponse::body);
  }

  static class StringSubscriber implements BodySubscriber<String> {

    final CompletableFuture<String> bodyCF = new CompletableFuture<>();
    Flow.Subscription subscription;
    List<ByteBuffer> responseData = new CopyOnWriteArrayList<>();

    @Override
    public CompletionStage<String> getBody() {
      return bodyCF;
    }

    @Override
    public void onSubscribe(Flow.Subscription subscription) {
      this.subscription = subscription;
      subscription.request(1); // Request first item
    }

    @Override
    public void onNext(List<ByteBuffer> buffers) {
      System.out.println("-- onNext " + buffers);
      try {
        System.out.println("\tBuffer Content:\n" + asString(buffers));
      } 
      catch (Exception e) {
        System.out.println("\tUnable to print buffer content");
      }
      buffers.forEach(ByteBuffer::rewind); // Rewind after reading
      responseData.addAll(buffers);
      subscription.request(1); // Request next item
    }

    @Override
    public void onError(Throwable throwable) {
      bodyCF.completeExceptionally(throwable);
    }

    @Override
    public void onComplete() {
      bodyCF.complete(asString(responseData));
    }

    private String asString(List<ByteBuffer> buffers) {
      return new String(toBytes(buffers), StandardCharsets.UTF_8);
    }

    private byte[] toBytes(List<ByteBuffer> buffers) {
      int size = buffers.stream()
          .mapToInt(ByteBuffer::remaining)
          .sum();
      byte[] bs = new byte[size];
      int offset = 0;
      for (ByteBuffer buffer : buffers) {
        int remaining = buffer.remaining();
        buffer.get(bs, offset, remaining);
        offset += remaining;
      }
      return bs;
    }

  }
}

尝试一下

要测试此解决方案,您需要一台服务器,该服务器发送使用Transfer-encoding: chunked的响应,并以足够慢的速度发送响应,以观看块的到达。我已经在https://github.com/hohonuuli/demo-chunk-server创建了一个,但是您可以像这样使用Docker来启动它:

docker run -p 8080:8080 hohonuuli/demo-chunk-server

然后使用java CustomSubscriber.java http://localhost:8080/chunk/10

运行CustomSubscriber.java代码

答案 3 :(得分:0)

现在有一个新的Java库可以满足这种要求 RxSON:https://github.com/rxson/rxson 它利用JsonPath和RxJava尽快从响应中读取JSON流数据块,并将其解析为Java对象。

示例:

String serviceURL = "https://think.cs.vt.edu/corgis/datasets/json/airlines/airlines.json";
   HttpRequest req = HttpRequest.newBuilder(URI.create(serviceURL)).GET().build();
   RxSON rxson = new RxSON.Builder().build();

   String jsonPath = "$[*].Airport.Name";
   Flowable<String> airportStream = rxson.create(String.class, req, jsonPath);
   airportStream
       .doOnNext(it -> System.out.println("Received new item: " + it))
       //Just for test
       .toList()
       .blockingGet();