alibaba / DataX

DataX是阿里云DataWorks数据集成的开源版本。
Other
15.99k stars 5.45k forks source link

hdfsreader OOM #57

Open yuqingshui opened 6 years ago

yuqingshui commented 6 years ago

java.lang.OutOfMemoryError: Java heap space at com.csvreader.CsvReader.updateCurrentValue(Unknown Source) ~[javacsv-2.0.jar:na] at com.csvreader.CsvReader.checkDataLength(Unknown Source) ~[javacsv-2.0.jar:na] at com.csvreader.CsvReader.readRecord(Unknown Source) ~[javacsv-2.0.jar:na] at com.alibaba.datax.plugin.unstructuredstorage.reader.UnstructuredStorageReaderUtil.splitBufferedReader(UnstructuredStorageReaderUtil.java:72) ~[plugin-unstructured-storage-util-0.0.1-SNAPSHOT.jar:na] at com.alibaba.datax.plugin.unstructuredstorage.reader.UnstructuredStorageReaderUtil.doReadFromStream(UnstructuredStorageReaderUtil.java:288) ~[plugin-unstructured-storage-util-0.0.1-SNAPSHOT.jar:na] at com.alibaba.datax.plugin.unstructuredstorage.reader.UnstructuredStorageReaderUtil.readFromStream(UnstructuredStorageReaderUtil.java:214) ~[plugin-unstructured-storage-util-0.0.1-SNAPSHOT.jar:na] at com.alibaba.datax.plugin.reader.hdfsreader.HdfsReader$Task.startRead(HdfsReader.java:265) ~[hdfsreader-0.0.1-SNAPSHOT.jar:na] at com.alibaba.datax.core.taskgroup.runner.ReaderRunner.run(ReaderRunner.java:57) ~[datax-core-0.0.1-SNAPSHOT.jar:na] at java.lang.Thread.run(Thread.java:748) [na:1.8.0_161] Exception in thread "taskGroup-1" com.alibaba.datax.common.exception.DataXException: Code:[Framework-13], Description:[DataX插件运行时出错, 具体原因请参看DataX运行结束时的错误诊断信息 .]. - java.lang.OutOfMemoryError: Java heap space at com.csvreader.CsvReader.updateCurrentValue(Unknown Source) at com.csvreader.CsvReader.checkDataLength(Unknown Source) at com.csvreader.CsvReader.readRecord(Unknown Source) at com.alibaba.datax.plugin.unstructuredstorage.reader.UnstructuredStorageReaderUtil.splitBufferedReader(UnstructuredStorageReaderUtil.java:72) at com.alibaba.datax.plugin.unstructuredstorage.reader.UnstructuredStorageReaderUtil.doReadFromStream(UnstructuredStorageReaderUtil.java:288) at com.alibaba.datax.plugin.unstructuredstorage.reader.UnstructuredStorageReaderUtil.readFromStream(UnstructuredStorageReaderUtil.java:214) at com.alibaba.datax.plugin.reader.hdfsreader.HdfsReader$Task.startRead(HdfsReader.java:265) at com.alibaba.datax.core.taskgroup.runner.ReaderRunner.run(ReaderRunner.java:57) at java.lang.Thread.run(Thread.java:748)

yuqingshui commented 6 years ago

Full thread dump Java HotSpot(TM) 64-Bit Server VM (25.161-b12 mixed mode):

"SIGINT handler" #49 daemon prio=9 os_prio=0 tid=0x00007ff384001000 nid=0xa7ff runnable [0x0000000000000000] java.lang.Thread.State: RUNNABLE

"IPC Parameter Sending Thread #1" #48 daemon prio=5 os_prio=0 tid=0x00007ff2d4002000 nid=0xa7fe waiting on condition [0x00007ff3f19d8000] java.lang.Thread.State: TIMED_WAITING (parking) at sun.misc.Unsafe.park(Native Method)

"0-1-3-writer" #45 prio=5 os_prio=0 tid=0x00007ff314010000 nid=0xa7f9 waiting on condition [0x00007ff3f02c5000] java.lang.Thread.State: TIMED_WAITING (parking) at sun.misc.Unsafe.park(Native Method)

"0-0-15-reader" #44 prio=5 os_prio=0 tid=0x00007ff32001b000 nid=0xa7f8 waiting on condition [0x00007ff3f03c6000] java.lang.Thread.State: TIMED_WAITING (parking) at sun.misc.Unsafe.park(Native Method)

"0-0-15-writer" #43 prio=5 os_prio=0 tid=0x00007ff320019800 nid=0xa7f7 runnable [0x00007ff3f04c7000] java.lang.Thread.State: RUNNABLE at java.net.SocketInputStream.socketRead0(Native Method) at java.net.SocketInputStream.socketRead(SocketInputStream.java:116) at java.net.SocketInputStream.read(SocketInputStream.java:171) at java.net.SocketInputStream.read(SocketInputStream.java:141) at oracle.net.ns.Packet.receive(Packet.java:300) at oracle.net.ns.DataPacket.receive(DataPacket.java:106) at oracle.net.ns.NetInputStream.getNextPacket(NetInputStream.java:315) at oracle.net.ns.NetInputStream.read(NetInputStream.java:260) at oracle.net.ns.NetInputStream.read(NetInputStream.java:185) at oracle.net.ns.NetInputStream.read(NetInputStream.java:102) at oracle.jdbc.driver.T4CSocketInputStreamWrapper.readNextPacket(T4CSocketInputStreamWrapper.java:124) at oracle.jdbc.driver.T4CSocketInputStreamWrapper.read(T4CSocketInputStreamWrapper.java:80) at oracle.jdbc.driver.T4CMAREngine.unmarshalUB1(T4CMAREngine.java:1137) at oracle.jdbc.driver.T4CTTIfun.receive(T4CTTIfun.java:290) at oracle.jdbc.driver.T4CTTIfun.doRPC(T4CTTIfun.java:192) at oracle.jdbc.driver.T4C8Oall.doOALL(T4C8Oall.java:531) at oracle.jdbc.driver.T4CPreparedStatement.doOall8(T4CPreparedStatement.java:207) at oracle.jdbc.driver.T4CPreparedStatement.executeForRows(T4CPreparedStatement.java:1044) at oracle.jdbc.driver.OraclePreparedStatement.executeForRowsWithTimeout(OraclePreparedStatement.java:10143) at oracle.jdbc.driver.OraclePreparedStatement.executeBatch(OraclePreparedStatement.java:10249)

"0-0-11-reader" #42 prio=5 os_prio=0 tid=0x00007ff320017800 nid=0xa7f6 waiting on condition [0x00007ff3f05c8000] java.lang.Thread.State: TIMED_WAITING (parking) at sun.misc.Unsafe.park(Native Method)

"0-0-11-writer" #41 prio=5 os_prio=0 tid=0x00007ff320015800 nid=0xa7f5 runnable [0x00007ff3f06c9000] java.lang.Thread.State: RUNNABLE at java.net.SocketInputStream.socketRead0(Native Method) at java.net.SocketInputStream.socketRead(SocketInputStream.java:116) at java.net.SocketInputStream.read(SocketInputStream.java:171) at java.net.SocketInputStream.read(SocketInputStream.java:141) at oracle.net.ns.Packet.receive(Packet.java:300) at oracle.net.ns.DataPacket.receive(DataPacket.java:106) at oracle.net.ns.NetInputStream.getNextPacket(NetInputStream.java:315) at oracle.net.ns.NetInputStream.read(NetInputStream.java:260) at oracle.net.ns.NetInputStream.read(NetInputStream.java:185) at oracle.net.ns.NetInputStream.read(NetInputStream.java:102) at oracle.jdbc.driver.T4CSocketInputStreamWrapper.readNextPacket(T4CSocketInputStreamWrapper.java:124) at oracle.jdbc.driver.T4CSocketInputStreamWrapper.read(T4CSocketInputStreamWrapper.java:80) at oracle.jdbc.driver.T4CMAREngine.unmarshalUB1(T4CMAREngine.java:1137) at oracle.jdbc.driver.T4CTTIfun.receive(T4CTTIfun.java:290) at oracle.jdbc.driver.T4CTTIfun.doRPC(T4CTTIfun.java:192) at oracle.jdbc.driver.T4C8Oall.doOALL(T4C8Oall.java:531) at oracle.jdbc.driver.T4CPreparedStatement.doOall8(T4CPreparedStatement.java:207) at oracle.jdbc.driver.T4CPreparedStatement.executeForRows(T4CPreparedStatement.java:1044) at oracle.jdbc.driver.OraclePreparedStatement.executeForRowsWithTimeout(OraclePreparedStatement.java:10143) at oracle.jdbc.driver.OraclePreparedStatement.executeBatch(OraclePreparedStatement.java:10249)

"0-1-6-reader" #40 prio=5 os_prio=0 tid=0x00007ff31400e000 nid=0xa7f4 waiting on condition [0x00007ff3f07ca000] java.lang.Thread.State: TIMED_WAITING (parking) at sun.misc.Unsafe.park(Native Method)

"0-1-6-writer" #39 prio=5 os_prio=0 tid=0x00007ff31400c000 nid=0xa7f3 runnable [0x00007ff3f08cb000] java.lang.Thread.State: RUNNABLE at java.net.SocketInputStream.socketRead0(Native Method) at java.net.SocketInputStream.socketRead(SocketInputStream.java:116) at java.net.SocketInputStream.read(SocketInputStream.java:171) at java.net.SocketInputStream.read(SocketInputStream.java:141) at oracle.net.ns.Packet.receive(Packet.java:300) at oracle.net.ns.DataPacket.receive(DataPacket.java:106) at oracle.net.ns.NetInputStream.getNextPacket(NetInputStream.java:315) at oracle.net.ns.NetInputStream.read(NetInputStream.java:260) at oracle.net.ns.NetInputStream.read(NetInputStream.java:185) at oracle.net.ns.NetInputStream.read(NetInputStream.java:102) at oracle.jdbc.driver.T4CSocketInputStreamWrapper.readNextPacket(T4CSocketInputStreamWrapper.java:124) at oracle.jdbc.driver.T4CSocketInputStreamWrapper.read(T4CSocketInputStreamWrapper.java:80) at oracle.jdbc.driver.T4CMAREngine.unmarshalUB1(T4CMAREngine.java:1137) at oracle.jdbc.driver.T4CTTIfun.receive(T4CTTIfun.java:290) at oracle.jdbc.driver.T4CTTIfun.doRPC(T4CTTIfun.java:192) at oracle.jdbc.driver.T4C7Ocommoncall.doOCOMMIT(T4C7Ocommoncall.java:75) at oracle.jdbc.driver.T4CConnection.doCommit(T4CConnection.java:610)

"0-0-19-reader" #38 prio=5 os_prio=0 tid=0x00007ff320014000 nid=0xa7f2 waiting on condition [0x00007ff3f09cc000] java.lang.Thread.State: TIMED_WAITING (parking) at sun.misc.Unsafe.park(Native Method)

"0-0-19-writer" #36 prio=5 os_prio=0 tid=0x00007ff320012000 nid=0xa7f1 runnable [0x00007ff3f0acc000] java.lang.Thread.State: RUNNABLE at java.net.SocketInputStream.socketRead0(Native Method) at java.net.SocketInputStream.socketRead(SocketInputStream.java:116) at java.net.SocketInputStream.read(SocketInputStream.java:171) at java.net.SocketInputStream.read(SocketInputStream.java:141) at oracle.net.ns.Packet.receive(Packet.java:300) at oracle.net.ns.DataPacket.receive(DataPacket.java:106) at oracle.net.ns.NetInputStream.getNextPacket(NetInputStream.java:315) at oracle.net.ns.NetInputStream.read(NetInputStream.java:260) at oracle.net.ns.NetInputStream.read(NetInputStream.java:185) at oracle.net.ns.NetInputStream.read(NetInputStream.java:102) at oracle.jdbc.driver.T4CSocketInputStreamWrapper.readNextPacket(T4CSocketInputStreamWrapper.java:124) at oracle.jdbc.driver.T4CSocketInputStreamWrapper.read(T4CSocketInputStreamWrapper.java:80) at oracle.jdbc.driver.T4CMAREngine.unmarshalUB1(T4CMAREngine.java:1137) at oracle.jdbc.driver.T4CTTIfun.receive(T4CTTIfun.java:290) at oracle.jdbc.driver.T4CTTIfun.doRPC(T4CTTIfun.java:192) at oracle.jdbc.driver.T4C8Oall.doOALL(T4C8Oall.java:531) at oracle.jdbc.driver.T4CPreparedStatement.doOall8(T4CPreparedStatement.java:207) at oracle.jdbc.driver.T4CPreparedStatement.executeForRows(T4CPreparedStatement.java:1044) at oracle.jdbc.driver.OraclePreparedStatement.executeForRowsWithTimeout(OraclePreparedStatement.java:10143) at oracle.jdbc.driver.OraclePreparedStatement.executeBatch(OraclePreparedStatement.java:10249)

"0-1-4-reader" #37 prio=5 os_prio=0 tid=0x00007ff31400a000 nid=0xa7f0 waiting on condition [0x00007ff3f0bce000] java.lang.Thread.State: TIMED_WAITING (parking) at sun.misc.Unsafe.park(Native Method)

"0-1-4-writer" #35 prio=5 os_prio=0 tid=0x00007ff314008800 nid=0xa7ef runnable [0x00007ff3f0ccf000] java.lang.Thread.State: RUNNABLE at java.net.SocketInputStream.socketRead0(Native Method) at java.net.SocketInputStream.socketRead(SocketInputStream.java:116) at java.net.SocketInputStream.read(SocketInputStream.java:171) at java.net.SocketInputStream.read(SocketInputStream.java:141) at oracle.net.ns.Packet.receive(Packet.java:300) at oracle.net.ns.DataPacket.receive(DataPacket.java:106) at oracle.net.ns.NetInputStream.getNextPacket(NetInputStream.java:315) at oracle.net.ns.NetInputStream.read(NetInputStream.java:260) at oracle.net.ns.NetInputStream.read(NetInputStream.java:185) at oracle.net.ns.NetInputStream.read(NetInputStream.java:102) at oracle.jdbc.driver.T4CSocketInputStreamWrapper.readNextPacket(T4CSocketInputStreamWrapper.java:124) at oracle.jdbc.driver.T4CSocketInputStreamWrapper.read(T4CSocketInputStreamWrapper.java:80) at oracle.jdbc.driver.T4CMAREngine.unmarshalUB1(T4CMAREngine.java:1137) at oracle.jdbc.driver.T4CTTIfun.receive(T4CTTIfun.java:290) at oracle.jdbc.driver.T4CTTIfun.doRPC(T4CTTIfun.java:192) at oracle.jdbc.driver.T4C8Oall.doOALL(T4C8Oall.java:531) at oracle.jdbc.driver.T4CPreparedStatement.doOall8(T4CPreparedStatement.java:207) at oracle.jdbc.driver.T4CPreparedStatement.executeForRows(T4CPreparedStatement.java:1044) at oracle.jdbc.driver.OraclePreparedStatement.executeForRowsWithTimeout(OraclePreparedStatement.java:10143) at oracle.jdbc.driver.OraclePreparedStatement.executeBatch(OraclePreparedStatement.java:10249)

"0-0-21-reader" #34 prio=5 os_prio=0 tid=0x00007ff320010800 nid=0xa7ee waiting on condition [0x00007ff3f0dd0000] java.lang.Thread.State: TIMED_WAITING (parking) at sun.misc.Unsafe.park(Native Method)

"0-1-12-reader" #33 prio=5 os_prio=0 tid=0x00007ff314006800 nid=0xa7ed waiting on condition [0x00007ff3f0ed1000] java.lang.Thread.State: TIMED_WAITING (parking) at sun.misc.Unsafe.park(Native Method)

"0-0-21-writer" #32 prio=5 os_prio=0 tid=0x00007ff320003000 nid=0xa7ec runnable [0x00007ff3f0fd2000] java.lang.Thread.State: RUNNABLE at java.net.SocketInputStream.socketRead0(Native Method) at java.net.SocketInputStream.socketRead(SocketInputStream.java:116) at java.net.SocketInputStream.read(SocketInputStream.java:171) at java.net.SocketInputStream.read(SocketInputStream.java:141) at oracle.net.ns.Packet.receive(Packet.java:300) at oracle.net.ns.DataPacket.receive(DataPacket.java:106) at oracle.net.ns.NetInputStream.getNextPacket(NetInputStream.java:315) at oracle.net.ns.NetInputStream.read(NetInputStream.java:260) at oracle.net.ns.NetInputStream.read(NetInputStream.java:185) at oracle.net.ns.NetInputStream.read(NetInputStream.java:102) at oracle.jdbc.driver.T4CSocketInputStreamWrapper.readNextPacket(T4CSocketInputStreamWrapper.java:124) at oracle.jdbc.driver.T4CSocketInputStreamWrapper.read(T4CSocketInputStreamWrapper.java:80) at oracle.jdbc.driver.T4CMAREngine.unmarshalUB1(T4CMAREngine.java:1137) at oracle.jdbc.driver.T4CTTIfun.receive(T4CTTIfun.java:290) at oracle.jdbc.driver.T4CTTIfun.doRPC(T4CTTIfun.java:192) at oracle.jdbc.driver.T4C8Oall.doOALL(T4C8Oall.java:531) at oracle.jdbc.driver.T4CPreparedStatement.doOall8(T4CPreparedStatement.java:207) at oracle.jdbc.driver.T4CPreparedStatement.executeForRows(T4CPreparedStatement.java:1044) at oracle.jdbc.driver.OraclePreparedStatement.executeForRowsWithTimeout(OraclePreparedStatement.java:10143) at oracle.jdbc.driver.OraclePreparedStatement.executeBatch(OraclePreparedStatement.java:10249)

"0-1-12-writer" #31 prio=5 os_prio=0 tid=0x00007ff314004800 nid=0xa7eb runnable [0x00007ff3f10d3000] java.lang.Thread.State: RUNNABLE at java.net.SocketOutputStream.socketWrite0(Native Method) at java.net.SocketOutputStream.socketWrite(SocketOutputStream.java:111) at java.net.SocketOutputStream.write(SocketOutputStream.java:155) at oracle.net.ns.DataPacket.send(DataPacket.java:210)

"taskGroup-0" #29 prio=5 os_prio=0 tid=0x00007ff4452f8000 nid=0xa7e9 waiting on condition [0x00007ff3f12d5000] java.lang.Thread.State: TIMED_WAITING (sleeping) at java.lang.Thread.sleep(Native Method) at com.alibaba.datax.core.taskgroup.TaskGroupContainer.start(TaskGroupContainer.java:268) at com.alibaba.datax.core.taskgroup.runner.TaskGroupContainerRunner.run(TaskGroupContainerRunner.java:24) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748)

"org.apache.hadoop.hdfs.PeerCache@672f11c2" #28 daemon prio=5 os_prio=0 tid=0x00007ff445517000 nid=0xa7e8 waiting on condition [0x00007ff3f15d6000] java.lang.Thread.State: TIMED_WAITING (sleeping) at java.lang.Thread.sleep(Native Method) at org.apache.hadoop.hdfs.PeerCache.run(PeerCache.java:255) at org.apache.hadoop.hdfs.PeerCache.access$000(PeerCache.java:46) at org.apache.hadoop.hdfs.PeerCache$1.run(PeerCache.java:124) at java.lang.Thread.run(Thread.java:748)

"client DomainSocketWatcher" #25 daemon prio=5 os_prio=0 tid=0x00007ff44524e800 nid=0xa7e5 runnable [0x00007ff3f39ed000] java.lang.Thread.State: RUNNABLE at org.apache.hadoop.net.unix.DomainSocketWatcher.doPoll0(Native Method) at org.apache.hadoop.net.unix.DomainSocketWatcher.access$900(DomainSocketWatcher.java:52) at org.apache.hadoop.net.unix.DomainSocketWatcher$2.run(DomainSocketWatcher.java:511) at java.lang.Thread.run(Thread.java:748)

"Service Thread" #20 daemon prio=9 os_prio=0 tid=0x00007ff444114000 nid=0xa7e0 runnable [0x0000000000000000] java.lang.Thread.State: RUNNABLE

"C1 CompilerThread14" #19 daemon prio=9 os_prio=0 tid=0x00007ff444108800 nid=0xa7df runnable [0x0000000000000000] java.lang.Thread.State: RUNNABLE

"C1 CompilerThread13" #18 daemon prio=9 os_prio=0 tid=0x00007ff444106800 nid=0xa7de waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE

"C1 CompilerThread12" #17 daemon prio=9 os_prio=0 tid=0x00007ff444104800 nid=0xa7dd waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE

"C1 CompilerThread11" #16 daemon prio=9 os_prio=0 tid=0x00007ff444102000 nid=0xa7dc waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE

"C1 CompilerThread10" #15 daemon prio=9 os_prio=0 tid=0x00007ff444100000 nid=0xa7db waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE

"C2 CompilerThread9" #14 daemon prio=9 os_prio=0 tid=0x00007ff4440fe000 nid=0xa7da runnable [0x0000000000000000] java.lang.Thread.State: RUNNABLE

"C2 CompilerThread8" #13 daemon prio=9 os_prio=0 tid=0x00007ff4440fc000 nid=0xa7d9 waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE

"C2 CompilerThread7" #12 daemon prio=9 os_prio=0 tid=0x00007ff4440f9800 nid=0xa7d8 waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE

"C2 CompilerThread6" #11 daemon prio=9 os_prio=0 tid=0x00007ff4440f7800 nid=0xa7d7 waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE

"C2 CompilerThread5" #10 daemon prio=9 os_prio=0 tid=0x00007ff4440f5000 nid=0xa7d6 waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE

"C2 CompilerThread4" #9 daemon prio=9 os_prio=0 tid=0x00007ff4440eb000 nid=0xa7d5 waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE

"C2 CompilerThread3" #8 daemon prio=9 os_prio=0 tid=0x00007ff4440e8800 nid=0xa7d4 waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE

"C2 CompilerThread2" #7 daemon prio=9 os_prio=0 tid=0x00007ff4440e6800 nid=0xa7d3 waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE

"C2 CompilerThread1" #6 daemon prio=9 os_prio=0 tid=0x00007ff4440e4800 nid=0xa7d2 waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE

"C2 CompilerThread0" #5 daemon prio=9 os_prio=0 tid=0x00007ff4440e1800 nid=0xa7d1 waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE

"Signal Dispatcher" #4 daemon prio=9 os_prio=0 tid=0x00007ff4440e0000 nid=0xa7d0 waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE

"Finalizer" #3 daemon prio=8 os_prio=0 tid=0x00007ff4440ad000 nid=0xa7cf in Object.wait() [0x00007ff3fafee000] java.lang.Thread.State: WAITING (on object monitor) at java.lang.Object.wait(Native Method) at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:143)

"Reference Handler" #2 daemon prio=10 os_prio=0 tid=0x00007ff4440a8800 nid=0xa7ce in Object.wait() [0x00007ff3fb0ef000] java.lang.Thread.State: WAITING (on object monitor) at java.lang.Object.wait(Native Method) at java.lang.Object.wait(Object.java:502) at java.lang.ref.Reference.tryHandlePending(Reference.java:191)

"job-0" #1 prio=5 os_prio=0 tid=0x00007ff44400a800 nid=0xa7b5 waiting on condition [0x00007ff44b240000] java.lang.Thread.State: TIMED_WAITING (sleeping) at java.lang.Thread.sleep(Native Method) at com.alibaba.datax.core.job.scheduler.AbstractScheduler.schedule(AbstractScheduler.java:102) at com.alibaba.datax.core.job.JobContainer.schedule(JobContainer.java:535) at com.alibaba.datax.core.job.JobContainer.start(JobContainer.java:119) at com.alibaba.datax.core.Engine.start(Engine.java:92) at com.alibaba.datax.core.Engine.entry(Engine.java:171) at com.alibaba.datax.core.Engine.main(Engine.java:204)

"VM Thread" os_prio=0 tid=0x00007ff4440a0800 nid=0xa7cd runnable

"GC task thread#0 (ParallelGC)" os_prio=0 tid=0x00007ff44401f800 nid=0xa7b6 runnable

"GC task thread#1 (ParallelGC)" os_prio=0 tid=0x00007ff444021800 nid=0xa7b7 runnable

"GC task thread#2 (ParallelGC)" os_prio=0 tid=0x00007ff444023800 nid=0xa7b8 runnable

"GC task thread#3 (ParallelGC)" os_prio=0 tid=0x00007ff444025000 nid=0xa7b9 runnable

"GC task thread#4 (ParallelGC)" os_prio=0 tid=0x00007ff444027000 nid=0xa7ba runnable

"GC task thread#5 (ParallelGC)" os_prio=0 tid=0x00007ff444029000 nid=0xa7bb runnable

"GC task thread#6 (ParallelGC)" os_prio=0 tid=0x00007ff44402a800 nid=0xa7bc runnable

"GC task thread#7 (ParallelGC)" os_prio=0 tid=0x00007ff44402c800 nid=0xa7bd runnable

"GC task thread#8 (ParallelGC)" os_prio=0 tid=0x00007ff44402e800 nid=0xa7be runnable

"GC task thread#9 (ParallelGC)" os_prio=0 tid=0x00007ff444030000 nid=0xa7bf runnable

"GC task thread#10 (ParallelGC)" os_prio=0 tid=0x00007ff444032000 nid=0xa7c0 runnable

"GC task thread#11 (ParallelGC)" os_prio=0 tid=0x00007ff444033800 nid=0xa7c1 runnable

"GC task thread#12 (ParallelGC)" os_prio=0 tid=0x00007ff444035800 nid=0xa7c2 runnable

"GC task thread#13 (ParallelGC)" os_prio=0 tid=0x00007ff444037800 nid=0xa7c3 runnable

"GC task thread#14 (ParallelGC)" os_prio=0 tid=0x00007ff444039000 nid=0xa7c4 runnable

"GC task thread#15 (ParallelGC)" os_prio=0 tid=0x00007ff44403b000 nid=0xa7c5 runnable

"GC task thread#16 (ParallelGC)" os_prio=0 tid=0x00007ff44403d000 nid=0xa7c6 runnable

"GC task thread#17 (ParallelGC)" os_prio=0 tid=0x00007ff44403e800 nid=0xa7c7 runnable

"GC task thread#18 (ParallelGC)" os_prio=0 tid=0x00007ff444040800 nid=0xa7c8 runnable

"GC task thread#19 (ParallelGC)" os_prio=0 tid=0x00007ff444042800 nid=0xa7c9 runnable

"GC task thread#20 (ParallelGC)" os_prio=0 tid=0x00007ff444044000 nid=0xa7ca runnable

"GC task thread#21 (ParallelGC)" os_prio=0 tid=0x00007ff444046000 nid=0xa7cb runnable

"GC task thread#22 (ParallelGC)" os_prio=0 tid=0x00007ff444048000 nid=0xa7cc runnable

"VM Periodic Task Thread" os_prio=0 tid=0x00007ff444118800 nid=0xa7e1 waiting on condition

JNI global references: 265

Heap PSYoungGen total 3184128K, used 2441324K [0x00000006eab00000, 0x00000007bd900000, 0x00000007c0000000) eden space 2921472K, 78% used [0x00000006eab00000,0x00000007774ae9b0,0x000000079d000000) from space 262656K, 52% used [0x00000007ad880000,0x00000007b5eec980,0x00000007bd900000) to space 266752K, 0% used [0x000000079d000000,0x000000079d000000,0x00000007ad480000) ParOldGen total 6990848K, used 4542655K [0x0000000540000000, 0x00000006eab00000, 0x00000006eab00000) object space 6990848K, 64% used [0x0000000540000000,0x000000065542fcc0,0x00000006eab00000) Metaspace used 31230K, capacity 31474K, committed 31872K, reserved 1077248K class space used 3527K, capacity 3648K, committed 3712K, reserved 1048576K

yuqingshui commented 6 years ago

问题解决了, 因为csv 默认 双引号作字段分界符 为true,字段中有部分字段有分割符,所以导致单个 strigncolum 字符过大,导致内存溢出, 解决办法, 文件类型选择 csv ,然后指定 "csvReaderConfig":{ "useTextQualifier": false }

i-love-doufunao commented 5 years ago

请问能参考一下你的任务配置吗?