Closed klockla closed 1 week ago
Thanks @klockla Looks good at this stage but I think it needs an addition to the client so that we can query the new endpoint and display the status of a URL.
see comment in the conversation re-client side
Added the method in client.
thanks a lot @klockla - I gave it a try and it seems to work fine let me know what you think of my comments and suggestions above
Tested, works great! Thanks @klockla, this is a great contribution to the project
Add a new API method to retrieve information about an URL
Implemented only for MemoryFrontier and RocksDb (may fullfill partially https://github.com/crawler-commons/url-frontier/issues/57 )
Unfortunately the internal storage doesn't make a distinction between Discovered and Known URLs which have to be refetched (or I have missed the point)
So all scheduled items will be returned as a KnownURLItem (with a refetch date equal to 0 for completed items) If the URL is not in URLFrontier, the method will return io.grpc.Status.NOT_FOUND.asRuntimeException()
Signed-off-by: Laurent Klock Laurent.Klock@arhs-cube.com