wip distributed backup / file mirroring tool
Mirror your files between all your computers and servers, so when a harddrive breaks or a computer gets stolen you don't lose anything.
Like bittorrent sync, but open source and in node.js, so it runs on SmartOS too.
Like dropbox, but stores your data only on your machines. No third party - except possibly internet providers - involved.
The filesystem is represented as a merkle tree, so backer can efficiently figure out what files changed.
When two or more nodes start replicating (i.e. mirroring, syncing) they need to figure out how their files differ, so they can exchange only what they really have to.
The most efficient way to do this - as far as I know - is to create a merkle tree (also known as hash tree) representing all the files and directories in your sync folder.
Since the file system is a tree this is a perfect fit!
When replication is kicked off, both nodes exchange their top hash, the hash of the root node. If that differs, they start going down the tree (i.e. going deeper into directories) and exchange hashes until the know which files are different and which are the same.
Then the individual files are diffed and those partial changes are exchanged and applied.
TODO
Since backer is only used for transferring data and never stores it on any server/computer but the user's, only transfor needs to be encrypted.
Encrypting your own harddrive is out of scope and there's plenty of tools that do this, one popular being truecrypt.
For maximum composability the transport part of the api will just be a duplex stream that is to be piped to another backer's duplex stream. This way it works over tcp, websockets, in memory, or over any another network or streaming interface.
var a = backer(__dirname + '/a').createStream();
var b = backer(__dirname + '/b').createStream();
a.pipe(b).pipe(a);
Over tcp it would look like this:
// computer A
var backer = require('backer');
var back = backer(__dirname);
net.createServer(function(con) {
con.pipe(back.createStream()).pipe(con);
}).listen(PORT);
// computer B
var backer = require('backer');
var reconnect = require('reconnect');
var back = backer(__dirname);
reconnect(function(con) {
con.pipe(back.createStream()).pipe(con);
}).listen(PORT);
There will be events emitted on the backer instance, which can then for example be fed to a web frontend.
There's a lot still to be figured out, so if you either are a mad scientist that juggles chainsaws while writing distributed systems, or want to be come one, this is the right place for you!
Plus, there's going to be need for designs, blog posts, a public facing website, etc.!
MIT