Closed yinlinzh closed 7 years ago
1) What sort of integration are you looking for?
2) A few people have used Bistro as a work-queue in the past.
Are you trying to queue many sharded jobs, or to queue per-shard tasks for a single job? Can you give more detail?
The typical pattern for a queue of many jobs would be:
ConfigLoader
for that DB type supporting saveJob
and deleteJob
. There are some simple examples https://github.com/facebook/bistro/tree/master/bistro/configThe typical pattern for a queue of many shards / tasks would be similar:
ls /some/path
is a valid script_fetcher
.NodeFetcher
, or use ScriptFetcher
to have your scheduler run an external script to fetch the nodes. Code here: https://github.com/facebook/bistro/tree/master/bistro/nodes3) I don't really understand your question. Can you try describing what you are trying to do, and how you want to do it in more detail? I assume you are aware of the binaries bistro_scheduler
and bistro_worker
, and how they interact? If not, you should try --help
with both, and skim the protocol documentation:
@yinlinzh, please reopen if you would like to resume this discussion :)
Hi Developers,
I'm working on setting up distributed task scheduling system to achieve data migration from HDFS to local file system. The example use case is querying the namenode to get file list, then create and schedule the task for each individual file. In README, the example code shows that scheduling and executing one task which written in script. Here are some of my questions:
Thanks in advance!