issues
search
tony-framework
/
TonY
TonY is a framework to natively run deep learning frameworks on Apache Hadoop.
https://tony-project.ai
Other
708
stars
164
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
feat: support placement constraint in app level
#682
zuston
closed
1 year ago
1
Support placement constraint
#681
zuston
closed
1 year ago
4
improvement(client): throw the exception when submitting to Yarn
#680
zuston
closed
1 year ago
1
Support placement constraint
#679
zuston
closed
1 year ago
0
[FIX] Ignore non-existing role to avoid NPE under group-dependency-timeout mechanism
#678
zuston
closed
1 year ago
2
Update version to 0.5.4
#677
zuston
closed
1 year ago
1
Add sleep to free up cpu
#676
zuston
closed
1 year ago
0
tony-core runtime error
#675
tonywang-sh
opened
2 years ago
14
Check registrationTimeoutMs in advance to avoid decrease in efficiency
#674
daugraph
closed
2 years ago
3
ERROR ApplicationMaster:496 - Exception while preparing AM org.apache.hadoop.yarn.exceptions.YarnException: Can't resolve the ip of ubuntu at com.linkedin.tony.util.Utils.getHostNameOrIpFromTokenConf(Utils.java:365) at com.linkedin.tony.ApplicationMaster.prepare(ApplicationMaster.java:476) at com.linkedin.tony.ApplicationMaster.run(ApplicationMaster.java:368) at com.linkedin.tony.ApplicationMaster.main(ApplicationMaster.java:342)
#673
ckqqqq
opened
2 years ago
2
Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/hadoop/fs/FSDataOutputStream
#672
Alpaca4610
closed
2 years ago
1
Update version to 0.5.3
#671
zuston
closed
2 years ago
0
Failed to get RM principal
#670
Alpaca4610
closed
2 years ago
2
AM Web: Correct the tips in tony client
#669
zuston
closed
2 years ago
1
AM Web UI: Reduce logo size and add link
#668
zuston
closed
2 years ago
0
Optimize the TonY AM web dashboard page
#667
zuston
closed
2 years ago
1
Optimize the TonY AM web dashboard page
#666
zuston
closed
2 years ago
2
Fix ConcurrentModificationException when we traverse registeredTasks #664
#665
oliverhu
closed
2 years ago
0
ConcurrentModificationException when we traverse registeredTasks
#664
oliverhu
closed
2 years ago
1
Fix the missing cluster_spec due to the null when getting clusterSpec…
#663
zuston
closed
2 years ago
0
Update version to 0.5.1
#662
zuston
closed
2 years ago
0
Introduce the simple TonY web dashboard
#661
zuston
closed
2 years ago
3
Provide detailed diagnostic message when allocation timeout
#660
zuston
closed
2 years ago
0
Introduce the simple TonY web dashboard
#659
zuston
closed
2 years ago
1
Support venv of tar.gz compression algorithm
#658
zuston
opened
2 years ago
0
Instability test case of testTonyAllocationTimeoutShouldFail
#657
zuston
closed
2 years ago
0
Fix the instability of testTonyAMStartupTimeoutShouldFail
#656
zuston
closed
2 years ago
0
Release TonY v0.5.0
#655
zuston
closed
2 years ago
0
Disable refreshMemoryBytesMetrics when running test cases on Mac
#654
zuston
closed
2 years ago
0
Collect the detailed execution error log to Yarn diagnostics
#653
zuston
closed
2 years ago
1
Seperate the python subprocess log from task executor log
#652
zuston
closed
2 years ago
0
Get task executor's python subprocess exit detailed diagnostics message
#651
zuston
closed
2 years ago
0
Attach more readable info to task executor's thread
#650
zuston
closed
2 years ago
0
Add shutdown hook when waiting executor's subprocess result
#649
zuston
closed
2 years ago
0
Introduce the config of max-waiting-time when task executor register to AM
#648
zuston
closed
2 years ago
1
Remove the duplicate logic code
#647
zuston
closed
2 years ago
0
Seperate the interface of registerTask and getClusterSpec in TaskExec…
#646
zuston
closed
2 years ago
0
[Optimization] Introducing the config of timeout that task executor register to AM
#645
zuston
closed
2 years ago
0
[Optimization] Seperate the interface of registerTask and getClusterSpec in TaskExecutor
#644
zuston
closed
2 years ago
0
Release TonY v0.4.15
#643
zuston
closed
2 years ago
0
Configurable status when dependency times out
#642
zuston
closed
2 years ago
0
Configurable status when dependency times out
#641
oliverhu
closed
2 years ago
2
Release TonY v0.4.14
#640
zuston
closed
2 years ago
1
Killing all applications when test case stopped in testTonyAMStartupT…
#639
zuston
closed
2 years ago
0
The process of task executor is still alive when existing NM marked as lost node by RM
#638
zuston
closed
2 years ago
0
The process of task executor is still alive when existing NM marked as lost node by RM
#637
zuston
closed
2 years ago
0
Allow that one role of task executor could make other roles exit
#636
zuston
opened
2 years ago
3
Make job fail fast when container starting failed
#635
zuston
closed
2 years ago
3
CI looks unstable
#634
zuston
closed
2 years ago
1
Add timeout in testTonyAMStartupTimeoutShouldFail
#633
zuston
closed
2 years ago
0
Next