最新消息:雨落星辰是一个专注网站SEO优化、网站SEO诊断、搜索引擎研究、网络营销推广、网站策划运营及站长类的自媒体原创博客

java - Efficiently build a large array with MethodHandles - Stack Overflow

programmeradmin1浏览0评论

I have a need to construct a MethodHandle with signature (InternalContext)->Object[] from a List<MethodHandle> each with the signature (InternalContext)->Object.

If the list is small <253 elements I can do something like

  static MethodHandle makeArrayUsingCollector(List<MethodHandle> elements) {
    // (Object[]) -> Object[]
    var handle = MethodHandles.identity(Object[].class);
    // (Object,Object...Object) -> Object[]
    handle = handle.asCollector(Object[].class, elements.size());
    // (InternalContext,InternalContext..InternalContext) -> Object[]
    handle = MethodHandles.filterArguments(handle, 0, elements.toArray(new MethodHandle[0]));
    // (InternalContext) -> Object[]
    handle = MethodHandles.permuteArguments(
        handle, methodType(Object[].class, InternalContext.class), new int[elements.size()]);
    return handle;
  }

However, if the list is larger the asCollector() call will fail since the resulting handle will have too many parameters.

So I tried something like this:

  private static final int MAX_ARITY = 255;

  static MethodHandle buildLargeArrayNaive(List<MethodHandle> elementFactories) {
    if (elementFactories.size() < MAX_ARITY) {
      return makeArrayUsingCollector(elementFactories);
    }
    var setter = MethodHandles.arrayElementSetter(Object[].class);
    // (Object[], InternalContext) -> void
    MethodHandle handle = null;
    for (int i = 0; i < elementFactories.size(); i++) {
      // (Object[], InternalContext) -> void
      var setElement =
          MethodHandles.filterArguments(
              MethodHandles.insertArguments(setter, 1, i), 1, elementFactories.get(i));
      if (handle == null) {
        handle = setElement;
      } else {
        handle = MethodHandles.foldArguments(setElement, handle);
      }
    }

    // (Object[], InternalContext) -> Object[]
    handle =
        MethodHandles.foldArguments(
            MethodHandles.dropArguments(
                MethodHandles.identity(Object[].class), 1, InternalContext.class),
            handle);
    return MethodHandles.foldArguments(
        handle,
        MethodHandles.insertArguments(
            MethodHandles.arrayConstructor(Object[].class), 0, elementFactories.size()));
  }

Which basically constructs an array explicitly and then sets each element combining everything with foldArguments

This works, but the resulting MethodHandle produces StackOverflowError when constructing large arrays (say 100K elements). From the docs on foldArguments this makes sense since each call sets up a 'tail call' structure.

If i was generating literal bytecode to do this I would probably generate a method with as many aastore instructions as could fit and then set up a kind of 'tree' structure to call all these methods.

static makeArray(InternalContext ctx) {
  var a = new Object[<num-elements];
  fillArray0(a, context)
  fillArray1(a, context)
  ...
  return a;
}
static void fillArray0(Object[] a, InternalContext context) {
  a[0] = <element-factory>(context);
  ...
}

I did discover i could solve this with recursion

 static MethodHandle buildLargeArrayRecursive(List<MethodHandle> elementFactories) {
    // (Object[], InternalContext) -> void
    var handle = doBuildLargeArrayRecursive(0, elementFactories);
    // (Object[], InternalContext) -> Object[]
    handle =
        MethodHandles.foldArguments(
            MethodHandles.dropArguments(
                MethodHandles.identity(Object[].class), 1, InternalContext.class),
            handle);
    ;
    return MethodHandles.foldArguments(
        handle,
        MethodHandles.insertArguments(
            MethodHandles.arrayConstructor(Object[].class), 0, elementFactories.size()));
  }

  static final MethodHandle ARRAY_SETTER = MethodHandles.arrayElementSetter(Object[].class);

  static MethodHandle doBuildLargeArrayRecursive(int offset, List<MethodHandle> elementFactories) {
    int size = elementFactories.size();
    if (size == 0) {
      return MethodHandles.empty(methodType(void.class, Object[].class, InternalContext.class));
    }
    if (size == 1) {
      return MethodHandles.filterArguments(
          MethodHandles.insertArguments(ARRAY_SETTER, 1, offset), 1, elementFactories.get(0));
    }
    int half = size / 2;
    var left = elementFactories.subList(0, half);
    var right = elementFactories.subList(half, size);
    return MethodHandles.foldArguments(
        doBuildLargeArrayRecursive(offset + half, right), doBuildLargeArrayRecursive(offset, left));
  }

which seems to scale to at least 100K elements. But clearly this will consume O(logN) stack space.. that is probably fine but this is also a lot of method calls...

A few things occurred while staring at this:

  • Should i create helper 'bulk-set' methods to handle more base cases? e.g. static void set4(Object[], int offset, Object o1...Object o3){...} could maybe help reduce recursion? Or should i just rely on the VM to do this via inlining?

  • I see that Setting array elements is an 'intrinsic' operation that compiles to an aastore instruction, is there some way to combine things so that an arbitrary number of those could be in the same LambdaForm?

  • Would it be better to construct small arrays and patch them together using System.arraycopy?

发布评论

评论列表(0)

  1. 暂无评论