Closed quyen closed 11 years ago
We are using a redis
queue with circus
to lauch X workers. And it works fine.
Hey @quyen can you be a little more specific about what errors you're encountering?
DocumentCloud uses Docsplit in a manner similar to what @Natim outlines.
@knowtheory & @Natim - I'm trying to do the same thing as Quyen, but having some trouble figuring out Circus..
Would you guys happen to know of any tutorial covering the set-up for using Circus to run multiple processes?
Thanks in advance.
@antonlakin Have a look at thoose projects : https://github.com/novagile/insight-reloaded and https://github.com/novagile/insight-installer there is some configuration example : https://github.com/novagile/insight-installer/tree/master/chef/cookbooks/insight/templates/default
Just for some additional details, DocumentCloud uses CloudCrowd for distributed queuing of jobs which use DocSplit. You can check out the actions we've written, and in particular note the document_import action.
I'd like to be able to extract pdf concurently, but it is not possible with docsplit gem I tried to extract 2 ppt files to pdf, the gem fails to process. The code is as below, please replace path_to_docsplit.rb, path_to_test_file1.ppt, path_to_test_file2.ppt
Im looking forward to your answer. Thank you, Quyen
!/usr/bin/ruby
require 'path_to_docsplit.rb'
def extraction(path_to_file)
Docsplit.extract_pdf(path_to_file) end
puts('start extraction') t1=Thread.new{extraction('path_to_test_file1.ppt')} t2=Thread.new{extraction('path_to_test_file2.ppt')} t1.join t2.join puts('end extraction')