Open 1032851561 opened 1 week ago
Is there any error log in master? or error command in t_ds_error_command?
You can get the scheduler count metrics by ds_master_quartz_job_executed
Is there any error log in master? or error command in t_ds_error_command? You can get the scheduler count metrics by
ds_master_quartz_job_executed
- no error log in master and api service
- two records in t_ds_error_command, I had deleted, but not work.
- not found metrics of
ds_master_quartz_job_executed
, just foundds_master_consume_command_count_total{application="master-server",} 0.0
ProcessScheduleTask#executeInternal
is not running yet. Is it running in dolphinscheduler-api server?- after many cycles have passed, there is not new record generated in table 't_ds_process_instance' .
ProcessScheduleTask#executeInternal
is running on master. You need to provide more information, e.g. your cluster information, is this bug can reproduce?
ProcessScheduleTask#executeInternal
is running on master. You need to provide more information, e.g. your cluster information, is this bug can reproduce?
The bug is alway exist. All timed job not trigger. My cluster: docker deployment , 1 master ,1 worker ,1 apiserver , postgresql database
The process goes like this:
I can't see the log of ProcessScheduleTask in master: scheduled fire time :{}, fire time......
, so is quartz something wrong?
ProcessScheduleTask#executeInternal
is running on master. You need to provide more information, e.g. your cluster information, is this bug can reproduce?
I try to debug master:
run the sql directly:
4. found error in master , some log like 'Master handle command xxx error '
This is caused by master handle command failed, you can find the reason from t_ds_error_command or master error log
- found error in master , some log like 'Master handle command xxx error '
This is caused by master handle command failed, you can find the reason from t_ds_error_command or master error log
https://github.com/apache/dolphinscheduler/issues/16197#issuecomment-2184493186
- found error in master , some log like 'Master handle command xxx error '
This is caused by master handle command failed, you can find the reason from t_ds_error_command or master error log
If you delete the records from t_ds_error_command, then you cannot find out the reason why the command handle failed. I am not clear why you delete these, these will not affect the system.
- found error in master , some log like 'Master handle command xxx error '
This is caused by master handle command failed, you can find the reason from t_ds_error_command or master error log
If you delete the records from t_ds_error_command, then you cannot find out the reason why the command handle failed. I am not clear why you delete these, these will not affect the system.
My problem is not why the command handler failed . Instead, ProcessScheduleTask
why doesn't execute, this is a quratz job ,it not trigger.
please see this : https://github.com/apache/dolphinscheduler/issues/16197#issuecomment-2184493186
- found error in master , some log like 'Master handle command xxx error '
This is caused by master handle command failed, you can find the reason from t_ds_error_command or master error log
If you delete the records from t_ds_error_command, then you cannot find out the reason why the command handle failed. I am not clear why you delete these, these will not affect the system.
My problem is not why the command handler failed . Instead,
ProcessScheduleTask
why doesn't execute, this is a quratz job ,it not trigger.please see this : #16197 (comment)
I'm still not sure what your problem is at the moment, right now ds process timing task will have two steps:
You means the step one is wrong? There are many reason may cause the step one not execute. e.g. quartz metadata is incorrect, quartz main thread is block, db lock. You can find some detail from the log and check if there exist dead lock in db.
- found error in master , some log like 'Master handle command xxx error '
This is caused by master handle command failed, you can find the reason from t_ds_error_command or master error log
If you delete the records from t_ds_error_command, then you cannot find out the reason why the command handle failed. I am not clear why you delete these, these will not affect the system.
My problem is not why the command handler failed . Instead,
ProcessScheduleTask
why doesn't execute, this is a quratz job ,it not trigger. please see this : #16197 (comment)I'm still not sure what your problem is at the moment, right now ds process timing task will have two steps:
- Generate command by quartz task
- Execute the command.
You means the step one is wrong? There are many reason may cause the step one not execute. e.g. quartz metadata is incorrect, quartz main thread is block, db lock. You can find some detail from the log and check if there exist dead lock in db.
Yes, step one is wrong , it is never tigger. Quartz main thread is running , it query the table qrtz_triggers
to find some timed job has triggered. When I debug the master service remotely, the code shows 0 records, but running the sql directly in the database shows 3 records.
- found error in master , some log like 'Master handle command xxx error '
This is caused by master handle command failed, you can find the reason from t_ds_error_command or master error log
If you delete the records from t_ds_error_command, then you cannot find out the reason why the command handle failed. I am not clear why you delete these, these will not affect the system.
My problem is not why the command handler failed . Instead,
ProcessScheduleTask
why doesn't execute, this is a quratz job ,it not trigger. please see this : #16197 (comment)I'm still not sure what your problem is at the moment, right now ds process timing task will have two steps:
- Generate command by quartz task
- Execute the command.
You means the step one is wrong? There are many reason may cause the step one not execute. e.g. quartz metadata is incorrect, quartz main thread is block, db lock. You can find some detail from the log and check if there exist dead lock in db.
Yes, step one is wrong , it is never tigger. Quartz main thread is running , it query the table
qrtz_triggers
to find some timed job has triggered. When I debug the master service remotely, the code shows 0 records, but running the sql directly in the database shows 3 records.
Is the date is correct of the master machine?
Search before asking
What happened
My timed task add success but never trigger
What you expected to happen
The task should be triggered every minute.
How to reproduce
Just create a 'shell' task , print some message , online this timed task.
Anything else
No response
Version
3.2.x
Are you willing to submit PR?
Code of Conduct