Closed mxdlzg closed 3 weeks ago
input_ids: [[[128000,128006,9125,128007,271,...,15837,13,128009,128006,882],[128007,271,15546,2643,387,...,2686,15837,304,23719,13],...,[3674,32305,13,1102,374,...,3041,499,19737,389,279],[1989,719,279,3828,374,...,8774,53302,323,3339,709]]]
labels: [[[-100,-100,-100,-100,-100,...,15837,13,128009,-100,-100],[-100,-100,-100,-100,-100,...,-100,-100,-100,-100,-100],...,[3674,32305,13,1102,374,...,-100,-100,-100,-100,-100],[-100,-100,-100,-100,-100,...,-100,-100,-100,-100,-100]]]
length: [[4096,4096,4096,4096,4096,...,4096,4096,4096,4096,4096]]], [MemoryMappedTable
input_ids: list
这些是我为了debug加的日志,正常是没有的
搞清楚了,datasets库版本差异导致的,统一升级到2.20.0解决。
同一个网段的不同机器。
这是主节点的启动信息:
子节点失败: