Closed vokac closed 1 year ago
Hi Petr,
Just for kicks (not as a real solution). What happens if you specify IPv6 to host mapping in /etc/hosts on the door node.
Thanks, Dmitry
No difference.
I guess Java also use normal NETLINK interface on linux, but Java returns interfaces/addresses in reverse order compared getifaddrs
example:
[root@se1.farm.particle.cz /tmp]# ./a.out
Internet Address: (null)
LineDescription : lo
Broadcast Addr : (null)
Internet Address: (null)
LineDescription : eth0
Broadcast Addr : (null)
Internet Address: (null)
LineDescription : eth1
Broadcast Addr : (null)
Internet Address: 127.0.0.1
LineDescription : lo
Netmask : 255.0.0.0
Broadcast Addr : 127.0.0.1
Internet Address: 147.231.25.100
LineDescription : eth0
Netmask : 255.255.255.0
Broadcast Addr : 147.231.25.255
Internet Address: 172.16.0.100
LineDescription : eth1
Netmask : 255.255.0.0
Broadcast Addr : 172.16.255.255
Internet Address: ::1
LineDescription : lo
Netmask : ffff:ffff:ffff:ffff:ffff:ffff:ffff:ffff
Internet Address: fe80::5054:ff:fe8e:c89a
LineDescription : eth0
Netmask : ffff:ffff:ffff:ffff::
Internet Address: 2001:718:401:6017:2::1000
LineDescription : eth1
Netmask : ffff:ffff:ffff:ffff:ffff:ffff:ffff:ffff
Internet Address: fe80::5054:ff:fef1:ef79
LineDescription : eth1
Netmask : ffff:ffff:ffff:ffff::
Another question - if checksum verification is not requested - does it work? (just narrowing the space).
Yes, transfer without checksum verification works. Actually file is transferred and also checksum works if it is executed as separate command, e.g.
$ xrdfs root://se1.farm.particle.cz:1094 rm /dteam/test.chksum
$ xrdcopy --cksum ADLER32:print /etc/services root://se1.farm.particle.cz:1094//dteam/test.chksum
[654.6kB/654.6kB][100%][==================================================][654.6kB/s]
Run: [ERROR] Invalid redirect URL: Got an error while querying the checksum! (destination)
$ xrdfs root://se1.farm.particle.cz:1094 query checksum /dteam/test.chksum
adler32 6408a0a8
Fortunately Rucio file upload doesn't ask for checksum during copy operation => two independent gfal2 operations (copy, checksum) => this issue doesn't affect our production workflows.
Hi Petr,
Might I ask you what version of the xrootd client you are using?
Thanks, Al
Albert L. Rossi Senior Software Developer Scientific Computing Division, Scientific Data Services, Distributed Data Development WH 566 Fermi National Accelerator Laboratory Batavia, IL 60510 (630) 840-3023
From: vokac @.> Sent: Monday, November 28, 2022 10:48 AM To: dCache/dcache @.> Cc: Subscribed @.***> Subject: Re: [dCache/dcache] LinkLocal addresses used by xroot checksum redirection (failing xroot uploads) (Issue #6884)
Yes, transfer without checksum verification works. Actually file is transferred and also checksum works if it is executed as separate command, e.g.
$ xrdfs root://se1.farm.particle.cz:1094 rm /dteam/test.chksum
$ xrdcopy --cksum ADLER32:print /etc/services root://se1.farm.particle.cz:1094//dteam/test.chksum [654.6kB/654.6kB][100%][==================================================][654.6kB/s] Run: [ERROR] Invalid redirect URL: Got an error while querying the checksum! (destination)
$ xrdfs root://se1.farm.particle.cz:1094 query checksum /dteam/test.chksum adler32 6408a0a8
Fortunately Rucio file upload doesn't ask for checksum during copy operation => two independent gfal2 operations (copy, checksum) => this issue doesn't affect our production workflows.
— Reply to this email directly, view it on GitHubhttps://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_dCache_dcache_issues_6884-23issuecomment-2D1329419161&d=DwMCaQ&c=gRgGjJ3BkIsb5y6s49QqsA&r=60rQ0HHqHmEY1P6VSdyuTQ&m=ctCxuYjjwP2Ho6q68DmDJ46gOZxnZs0VxCsNWJ6hSjcahCF8Fg0p5tivnG9JBgWN&s=ivl2R6MQjaeV1mcDb2ta7iAkXEs9S8r_rnUfm2A7i_E&e=, or unsubscribehttps://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_notifications_unsubscribe-2Dauth_AA6NBHDEOY4MIF2K447AY6LWKTO5JANCNFSM6AAAAAASMTZCMI&d=DwMCaQ&c=gRgGjJ3BkIsb5y6s49QqsA&r=60rQ0HHqHmEY1P6VSdyuTQ&m=ctCxuYjjwP2Ho6q68DmDJ46gOZxnZs0VxCsNWJ6hSjcahCF8Fg0p5tivnG9JBgWN&s=aDrr8OyQ6JfduMhB141v0OmIOhe16XR6IxUcX6zyggs&e=. You are receiving this because you are subscribed to this thread.Message ID: @.***>
Latest version from EPEL7
$ rpm -qa xrootd-client
xrootd-client-5.5.1-1.el7.x86_64
Petr, I have a potential fix.
If I send you a link to the .jar file would you be able to deploy and test?
do:
rm -f /usr/share/dcache/classes//dcache-xrootd-*.jar
drop the new .jar into that directory
and restart the domain.
Thanks - Al
What version of dcache are you running?
Sure, I can try your update.
8.2.2 on the headnode (se1.farm.particle.cz
) and 8.2.0 on door/pool nodes
Sent you the link from my Google drive.
With your dcache-xrootd-8.2.6-SNAPSHOT.jar
xroot transfer with checksum works, details:
$ XRD_LOGLEVEL=Dump xrdcopy --cksum ADLER32:print /etc/services root://se1.farm.particle.cz:1094//dteam/test.chksum
[2022-11-28 23:00:56.011449 +0100][Debug ][Utility ] Initializing xrootd client version: v5.5.1
...
[2022-11-28 23:00:57.024942 +0100][Debug ][Utility ] Attempting checksum calculation, mode: target.
[2022-11-28 23:00:57.025007 +0100][Dump ][Utility ] URL: root://dpmpool21.farm.particle.cz:23398//dteam/test.chksum?org.dcache.uuid=790b66ef-ba1b-499d-a386-eacfd7768319&org.dcache.xrootd.client=vokac.23233@ui2.farm.particle.cz&oss.asize=670293&xrd.logintoken=org.dcache.door=se1.farm.particle.cz:1094&xrdcl.requuid=36048a63-fb7e-4b1c-aa7e-f55668c78782
...
[2022-11-28 23:00:57.035275 +0100][Dump ][XRootD ] [dpmpool21.farm.particle.cz:23398] Got kXR_redirect response to message kXR_query (code: kXR_Qcksum, arg length: 36): se1.farm.particle.cz, port 1094
...
[2022-11-28 23:00:57.044068 +0100][Dump ][FileSystem ] [0x948ea0@dpmpool21.farm.particle.cz:23398] Assigning dpmpool21.farm.particle.cz:23398 as last URL
[2022-11-28 23:00:57.044115 +0100][Debug ][XRootD ] Redirect trace-back:
[2022-11-28 23:00:57.044115 +0100][Debug ][XRootD ] 0. Redirected from: root://dpmpool21.farm.particle.cz:23398//dteam/test.chksum to: root://se1.farm.particle.cz:1094/
[2022-11-28 23:00:57.044130 +0100][Debug ][ExDbgMsg ] [se1.farm.particle.cz:1094] Destroying MsgHandler: 0x94bd80.
[2022-11-28 23:00:57.044153 +0100][Dump ][Utility ] Checksum for /dteam/test.chksum checksum: adler32:6408a0a8
[2022-11-28 23:00:57.044294 +0100][Debug ][File ] [0x947da0@file://localhost/etc/services?xrdcl.requuid=817c273c-1794-4645-baef-6dd14bcbbba4] Sending a close command for handle 0xe to localhost
[2022-11-28 23:00:57.044397 +0100][Debug ][File ] [0x947da0@file://localhost/etc/services?xrdcl.requuid=817c273c-1794-4645-baef-6dd14bcbbba4] Close returned from localhost with: [SUCCESS]
[2022-11-28 23:00:57.044422 +0100][Dump ][File ] [0x947da0@file://localhost/etc/services?xrdcl.requuid=817c273c-1794-4645-baef-6dd14bcbbba4] Items in the fly 0, queued for recovery 0
adler32: 6408a0a8 root://se1.farm.particle.cz:1094/dteam/test.chksum 670293
...
Great. Thanks Petr.
(I'll be looking at the other problem with xroots / ls soon).
Uploading file with checksum validation using xroot protocol fails with
Invalid redirect URL
, because dCache try to redirect client to the dCache headnode link local IPv6 address. Why dCache even consider local addresses to be used during transfers?It seems to me that dCache use first discovered interface address, but that's IPv6 link local in our case, e.g. with
I'm getting following output on
se1.farm.particle.cz