To index files on our internal share, it would be nice to mount the directory in readonly mode and then plug the mounted directory into Nutch using a custom Protocol that takes the afp:// URLs and translates them into file:// URLs. This should allow Nutch to produce the proper URLs to elastic search, without having to implement afp:// in Java.
To index files on our internal share, it would be nice to mount the directory in readonly mode and then plug the mounted directory into Nutch using a custom Protocol that takes the afp:// URLs and translates them into file:// URLs. This should allow Nutch to produce the proper URLs to elastic search, without having to implement afp:// in Java.