如何从迭代器中获取int []?

时间:2009-07-22 17:55:46

标签: java collections

我有Iterator<Integer> ...实际上是class Thing接受Visitor<SomeObject>并为visit()的子集调用SomeObject它包含,我必须实现Visitor<SomeObject>所以它做这样的事情:

// somehow get all the Id's from each of the SomeObject that Thing lets me visit
public int[] myIdExtractor(Thing thing)
{
    SomeCollection c = new SomeCollection();
    thing.visitObjects(new Visitor<SomeObject>()
         {
              public void visit(SomeObject obj) { c.add(obj.getId()); }
         }
    );
    return convertToPrimitiveArray(c);
}

我需要提取包含结果的int[],但我不确定SomeCollectionconvertToPrimitiveArray的用途。结果的数量提前未知,并且将很大(10K-500K)。除了ArrayList<Integer>使用SomeCollection之外,还有什么比这更好的选择,而且:

public int[] convertToPrimitiveArray(List<Integer> ints)
{
    int N = ints.size();
    int[] array = new int[N];
    int j = 0;
    for (Integer i : ints)
    {
        array[j++] = i;
    }
    return array;
}

效率和内存使用是一个值得关注的问题。

4 个答案:

答案 0 :(得分:2)

提出一个在数组中收集int的类并不太难(即使你没有使用某个为你做这个的库)。

public class IntBuffer {
    private int[] values = new int[10];
    private int size = 0;
    public void add(int value) {
        if (!(size < values.length)) {
            values = java.util.Arrays.copyOf(values, values.length*2);
        }
        values[size++] = value;
    }
    public int[] toArray() {
        return java.util.Arrays.copyOf(values, size);
    }
}

(免责声明:这是stackoverflow,我甚至没有尝试编译此代码。)

作为替代方案,您可以使用DataOutputStreamint存储在ByteArrayOutputStream中。

final ByteArrayOutputStream byteOut = new ByteArrayOutputStream();
final DataOutputStream out = new DataOutputStream(byteOut);
...
    out.writeInt(value);
...
out.flush();
final byte[] bytes = byteOut.toByteArray();
final int[] ints = new int[bytes.length/4];
final ByteArrayInputStream byteIn = new ByteArrayInputStream(bytes);
final DataInputStream in = new DataOutputStream(byteIn);
for (int ct=0; ct<ints.length; ++ct) {
    ints[ct] = in.readInt();
}

(免责声明:这是stackoverflow,我甚至没有尝试编译此代码。)

答案 1 :(得分:1)

您可以查看pjc之类的内容来处理此问题。这是为基元制作的集合框架。

答案 2 :(得分:0)

为了进行基准测试,我使用LFSR生成器组合了一个测试程序,以防止编译器优化测试数组。无法下载pjc,但我认为时间应该类似于Tom的IntBuffer类,这是迄今为止的赢家。 ByteArrayOutputStream方法与我原来的ArrayList<Integer>方法大致相同。我在3GHz Pentium 4上运行J2SE 6u13,并且在JIT运行之后大约有2个 20 值,IntBuffer方法大约需要40毫秒(每个项目仅40nsec!)使用“健忘”集合的引用实现,该集合仅将最后一个参数存储到visit()(因此编译器不会优化它)。另外两种方法大约需要300毫秒,大约是8倍。

编辑:我怀疑Stream方法存在的问题是,我不得不抓住异常的可能性,不确定。

(对于参数运行PrimitiveArrayTest 1 2)

package com.example.test.collections;

import java.io.ByteArrayInputStream;
import java.io.ByteArrayOutputStream;
import java.io.DataInputStream;
import java.io.DataOutputStream;
import java.io.IOException;
import java.util.ArrayList;
import java.util.Collections;
import java.util.List;

public class PrimitiveArrayTest {
    interface SomeObject {
        public int getX();
    }
    interface Visitor {
        public void visit(SomeObject obj);
    }

    public static class PlainObject implements SomeObject
    {
        private int x;
        public int getX() { return this.x; }
        public void setX(int x) { this.x = x; }  
    }

    public static class Thing
    {
        /* here's a LFSR        
         * see http://en.wikipedia.org/wiki/Linear_feedback_shift_register
         * and http://www.ece.cmu.edu/~koopman/lfsr/index.html
         */
        private int state;
        final static private int MASK = 0x80004;
        private void _next()
        {
            this.state = (this.state >>> 1) 
            ^ (-(this.state & 1) & MASK);
        }
        public Thing(int state) { this.state = state; }     
        public void setState(int state) { this.state = state; }

        public void inviteVisitor(Visitor v, int terminationPoint)
        {
            PlainObject obj = new PlainObject();
            while (this.state != terminationPoint)
            {
                obj.setX(this.state);
                v.visit(obj);
                _next();
            }
        }
    }

    static public abstract class Collector implements Visitor
    {
        abstract public void initCollection();
        abstract public int[] getCollection();
        public int[] extractX(Thing thing, int startState, int endState)
        {
            initCollection();
            thing.setState(startState);
            thing.inviteVisitor(this, endState);
            return getCollection();
        }
        public void doit(Thing thing, int startState, int endState)
        {
            System.out.printf("%s.doit(thing,%d,%d):\n",
                    getClass().getName(),
                    startState,
                    endState);
            long l1 = System.nanoTime();
            int[] result = extractX(thing,startState,endState);
            long l2 = System.nanoTime();
            StringBuilder sb = new StringBuilder();
            sb.append(String.format("%d values calculated in %.4f msec ",
                    result.length, (l2-l1)*1e-6));
            int N = 3;
            if (result.length <= 2*N)
            {
                sb.append("[");
                for (int i = 0; i < result.length; ++i)
                {
                    if (i > 0)
                        sb.append(", ");
                    sb.append(result[i]);
                }
                sb.append("]");
            }
            else
            {
                int sz = result.length;
                sb.append(String.format("[%d, %d, %d... %d, %d, %d]",
                        result[0], result[1], result[2], 
                        result[sz-3], result[sz-2], result[sz-1]));
            }
            System.out.println(sb.toString());          
        }
    }

    static public class Collector0 extends Collector
    {
        int lastint = 0;
        @Override public int[] getCollection() { return new int[]{lastint}; }
        @Override public void initCollection() {}
        @Override public void visit(SomeObject obj) {lastint = obj.getX(); }
    }
    static public class Collector1 extends Collector
    {
        final private List<Integer> ints = new ArrayList<Integer>();

        @Override public int[] getCollection() {
            int N = this.ints.size();
            int[] array = new int[N];
            int j = 0;
            for (Integer i : this.ints)
            {
                array[j++] = i;
            }
            return array;           
        }
        @Override public void initCollection() { }
        @Override public void visit(SomeObject obj) { ints.add(obj.getX()); }
    }

    static public class Collector2 extends Collector
    {
        /*
         * adapted from http://stackoverflow.com/questions/1167060
         * by Tom Hawtin
         */
        private int[] values;
        private int size = 0;
        @Override public void visit(SomeObject obj) { add(obj.getX()); }
        @Override public void initCollection() { values = new int[32]; }
        private void add(int value) {
            if (!(this.size < this.values.length)) {
                this.values = java.util.Arrays.copyOf(
                        this.values, this.values.length*2);
            }
            this.values[this.size++] = value;
        }
        @Override public int[] getCollection() {
            return java.util.Arrays.copyOf(this.values, this.size);
        }       
    }

    static public class Collector3 extends Collector
    {
        /*
         * adapted from http://stackoverflow.com/questions/1167060
         * by Tom Hawtin
         */
        final ByteArrayOutputStream byteOut = new ByteArrayOutputStream();
        final DataOutputStream out = new DataOutputStream(this.byteOut);
        int size = 0;
        @Override public int[] getCollection()  {
            try
            {
                this.out.flush();
                final int[] ints = new int[this.size];
                final ByteArrayInputStream byteIn 
                    = new ByteArrayInputStream(this.byteOut.toByteArray());
                final DataInputStream in = new DataInputStream(byteIn);

                for (int ct=0; ct<ints.length; ++ct) {
                    ints[ct] = in.readInt();
                }
                return ints;
            }
            catch (IOException e) { /* gulp */ }

            return new int[0]; // failure!?!??!
        }

        @Override public void initCollection() { }
        @Override public void visit(SomeObject obj) {
            try {
                this.out.writeInt(obj.getX());
                ++this.size;
            }
            catch (IOException e) { /* gulp */ }
        }       
    }
    public static void main(String args[])
    {
        int startState = Integer.parseInt(args[0]);
        int endState = Integer.parseInt(args[1]);
        Thing thing = new Thing(0);
        // let JIT do its thing
        for (int i = 0; i < 20; ++i)
        {
            Collector[] collectors = {new Collector0(), new Collector1(), new Collector2(), new Collector3()};
            for (Collector c : collectors)
            {
                c.doit(thing, startState, endState);
            }
            System.out.println();
        }
    }
}

答案 3 :(得分:-4)

您可以使用List.toArray(T [] a)代替convertToPrimitiveArray:

ArrayList<int> al = new ArrayList<int>();
// populate al
int[] values = new int[al.size()];
al.toArray(values);

对于您的其他问题,LinkedList可能稍微好于ArrayList,因为您事先并不知道结果集的大小。

如果性能确实是一个问题,那么你最好自己动手管理一个int [],并且每次增长时都使用System.arraycopy();从任何集合所需的装箱/拆箱从int到Integer可能会受到伤害。

与任何与性能相关的问题一样,当然,在花费太多时间进行优化之前,先测试并确保它真的很重要。