DIRACGrid / DIRAC

DIRAC Grid
http://diracgrid.org
GNU General Public License v3.0
112 stars 174 forks source link

Failure in getReplicas with unexpected keyword #498

Closed closier closed 12 years ago

closier commented 12 years ago

Using Dirac v6r1p8 I get this error when trying to run the InputDatatResolution module:

setJobStatus(896255,Running,Input Data Resolution,JobWrapper) Input Data is: /lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89335/089335_0000000054.raw Job input data requirement is /lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89335/089335_0000000054.raw Job input data resolution policy module is LHCbDIRAC.Core.Utilities.InputDataResolution Site has the following local SEs: CERN-FREEZER, CERN-RAW, CERN_M-DST, CERN-DST, CERN-USER, CERN-FAILOVER, CERN-disk, CERN-tape, CERN-RDST, CERN_MC_M-DST, CERN-HIST, CERN-ARCHIVE, CERN_MC-DST Found optimizer catalogue result {'OK': True, 'Value': {'Successful': {'/lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89335/089335_0000000054.raw': {'CERN-RAW': 'srm://srm-lhcb.cern.ch/castor/cern.ch/grid/lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89335/089335_0000000054.raw', 'SARA-RAW': 'srm://srm.grid.sara.nl/pnfs/grid.sara.nl/data/lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89335/089335_0000000054.raw'}}, 'Failed': {}}} {'OK': True, 'Value': {'Successful': {'/lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89335/089335_0000000054.raw': {'CERN-RAW': 'srm://srm-lhcb.cern.ch/castor/cern.ch/grid/lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89335/089335_0000000054.raw', 'SARA-RAW': 'srm://srm.grid.sara.nl/pnfs/grid.sara.nl/data/lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89335/089335_0000000054.raw'}}, 'Failed': {}}} The following file(s) were found not to have a GUID: /lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89335/089335_0000000054.raw Could not obtain replica information from Optimizer File Catalog information {'Message': 'File metadata is not available', 'OK': False} Trying to load from root path LHCbDIRAC Trying to load from root path DIRAC Trying to load from root path LHCbDIRAC Trying to load from root path LHCbDIRAC JC res = fileCatalog.getReplicas(lfns, allStatus=False)
JobWrapper failed to resolve input data == EXCEPTION == <type 'exceptions.TypeError'>:getReplicas() got an unexpected keyword argument 'allStatus' File "/localdisk/dirac/work/job/Wrapper/Wrapper_896255", line 85, in execute result = job.resolveInputData()

File "/mnt/sw-real/dirac/certification/pro/DIRAC/WorkloadManagementSystem/JobWrapper/JobWrapper.py", line 539, in resolveInputData result = self.__checkFileCatalog( lfns )

File "/mnt/sw-real/dirac/certification/pro/DIRAC/WorkloadManagementSystem/JobWrapper/JobWrapper.py", line 590, in checkFileCatalog replicas = self.getReplicaMetadata( lfns )

File "/mnt/sw-real/dirac/certification/pro/DIRAC/WorkloadManagementSystem/JobWrapper/JobWrapper.py", line 615, in __getReplicaMetadata repsResult = self.rm.getReplicas( lfns )

File "/mnt/sw-real/dirac/certification/pro/DIRAC/DataManagementSystem/Client/ReplicaManager.py", line 2257, in getReplicas return self.getCatalogReplicas( lfn )

File "/mnt/sw-real/dirac/certification/pro/DIRAC/DataManagementSystem/Client/ReplicaManager.py", line 159, in getCatalogReplicas return self._executeFileCatalogFunction( lfn, 'getReplicas', argsDict = {'allStatus':allStatus}, catalogs = catalogs )

File "/mnt/sw-real/dirac/certification/pro/DIRAC/DataManagementSystem/Client/ReplicaManager.py", line 98, in _executeFileCatalogFunction exec( execString )

File "", line 1, in

File "/mnt/sw-real/dirac/certification/pro/DIRAC/Resources/Catalog/FileCatalog.py", line 129, in r_execute

res = method( _parms, *_kws )

Failure during Input Data Resolution

graciani commented 12 years ago

Hi,

both the DIRACFileCatalog and LcgFileCatalog getReplicas method include "allStatus" in the definition of the method:

  def getReplicas(self, lfns, allStatus=False,rpc='',url='',timeout=120):
    rpcClient = self._getRPC(rpc=rpc,url=url,timeout=timeout)
    return rpcClient.getReplicas(lfns,allStatus)
  def getReplicas( self, lfn, allStatus = False ):
    """ Returns replicas for an LFN or list of LFNs
    """
    res = self.__checkArgumentFormat( lfn )

what File Catalog are defined in the instance you are testing? Ricardo

closier commented 12 years ago

HI Ricardo,

Ok there was a RAWIntegrity Catalog in th elocal dirac.cfg. I remove it and now I get a Bad Credentials while I ma using your credential and it was ok last week..

Any idea ?

2012-02-13 10:52:57 UTC WorkloadManagement/JobAgent[gHjg] INFO: Payload proxy information: subject : /DC=es/DC=irisgrid/O=ecm-ub/CN=Ricardo-Graciani-Diaz/CN=proxy/CN=proxy/CN=limited proxy issuer : /DC=es/DC=irisgrid/O=ecm-ub/CN=Ricardo-Graciani-Diaz/CN=proxy/CN=proxy identity : /DC=es/DC=irisgrid/O=ecm-ub/CN=Ricardo-Graciani-Diaz timeleft : 166:03:06 DIRAC group : lhcb_data path : /tmp/tmpu5_QPX username : rgracian properties : NormalUser, JobSharing, ProductionManagement, LimitedDelegation VOMS : True VOMS fqan : ['/lhcb/Role=production'] 2012-02-13 10:52:57 UTC WorkloadManagement/JobAgent[gHjg] VERB: Starting process for monitoring payload proxy 2012-02-13 10:52:57 UTC WorkloadManagement/JobAgent[gHjg] VERB: CE submission command: /localdisk/dirac/work/job/Wrapper/Job896409 2012-02-13 10:52:57 UTC WorkloadManagement/JobAgent/Subprocess[gHjg] DEBUG: Timeout set to 0 Trying to load from root path LHCbDIRAC Trying to load from root path DIRAC

Trying to load from root path LHCbDIRAC

SVN version $Id$ v7r2p10 Developer tag: 2 Job Wrapper started under PID: 19257 Job Wrapper parent process group ID: 19072

Job Wrapper process group ID: 19257

sys.path is: /mnt/sw-real/dirac/certification/pro /localdisk/dirac/work/job/Wrapper /mnt/sw-real/dirac/certification/versions/v7r2p10_1329122154/Linux_x86_64_glibc-2.5/lib/python2.6/site-packages/setuptools-0.6c11-py2.6.egg /mnt/sw-real/dirac/certification/versions/v7r2p10_1329122154/Linux_x86_64_glibc-2.5/lib/python2.6/site-packages/distribute-0.6.15-py2.6.egg /mnt/sw-real/dirac/certification/versions/v7r2p10_1329122154/Linux_x86_64_glibc-2.5/lib/python2.6/site-packages/pip-0.8.3-py2.6.egg /mnt/sw-real/dirac/certification/versions/v7r2p10_1329122154/Linux_x86_64_glibc-2.5/lib/python2.6/site-packages/GSI-0.5.0-py2.6-linux-x86_64.egg /mnt/sw-real/dirac/certification/versions/v7r2p10_1329122154 /mnt/sw-real/dirac/certification/pro /mnt/sw-real/dirac/certification/versions/v7r2p10_1329122154/Linux_x86_64_glibc-2.5/lib/python26.zip /mnt/sw-real/dirac/certification/versions/v7r2p10_1329122154/Linux_x86_64_glibc-2.5/lib/python2.6 /mnt/sw-real/dirac/certification/versions/v7r2p10_1329122154/Linux_x86_64_glibc-2.5/lib/python2.6/plat-linux2 /mnt/sw-real/dirac/certification/versions/v7r2p10_1329122154/Linux_x86_64_glibc-2.5/lib/python2.6/lib-tk /mnt/sw-real/dirac/certification/versions/v7r2p10_1329122154/Linux_x86_64_glibc-2.5/lib/python2.6/lib-old /mnt/sw-real/dirac/certification/versions/v7r2p10_1329122154/Linux_x86_64_glibc-2.5/lib/python2.6/lib-dynload /mnt/sw-real/dirac/certification/versions/v7r2p10_1329122154/Linux_x86_64_glibc-2.5/lib/python2.6/site-packages

PYTHONPATH is: /mnt/sw-real/dirac/certification/versions/v7r2p10_1329122154

/mnt/sw-real/dirac/certification/pro

LD_LIBRARY_PATH is: /mnt/sw-real/dirac/certification/versions/v7r2p10_1329122154/Linux_x86_64_glibc-2.5/lib /mnt/sw-real/dirac/certification/pro/Linux_x86_64_glibc-2.5/lib

/mnt/sw-real/dirac/certification/pro/Linux_x86_64_glibc-2.5/lib/mysql

setJobStatus(896409,Running,Job Initialization,JobWrapper) Starting Job Wrapper Initialization for Job 896409 {'ParametricInputData': '', 'ProductionOutputData': ['/lhcb/validation/Collision11/SDST/00000652/0000/00000652_00000206_1.sdst', '/lhcb/validation/Collision11/HIST/00000652/0000/Brunel_00000652_00000206_1_Hist.root', '/lhcb/validation/Collision11/HIST/00000652/0000/DaVinci_00000652_00000206_2_Hist.root'], 'SoftwareDistModule': 'LHCbDIRAC.Core.Utilities.CombinedSoftwareInstallation', 'InputDataType': '', 'DebugLFNs': ['/lhcb/debug/Collision11/SDST/00000652/0000/00000652_00000206_1.sdst', '/lhcb/debug/Collision11/HIST/00000652/0000/Brunel_00000652_00000206_1_Hist.root', '/lhcb/debug/Collision11/DST/00000652/0000/00000652_00000206_2.dst', '/lhcb/debug/Collision11/HIST/00000652/0000/DaVinci_00000652_00000206_2_Hist.root', '/lhcb/debug/Collision11/CORE/00000652/0000/00000206_core'], 'Status': 'Created', 'VirtualOrganization': 'lhcb', 'LogLevel': 'verbose', 'InputDataPolicy': 'DIRAC.WorkloadManagementSystem.Client.DownloadInputData', 'BookkeepingLFNs': ['/lhcb/validation/Collision11/SDST/00000652/0000/00000652_00000206_1.sdst', '/lhcb/validation/Collision11/HIST/00000652/0000/Brunel_00000652_00000206_1_Hist.root', '/lhcb/validation/Collision11/DST/00000652/0000/00000652_00000206_2.dst', '/lhcb/validation/Collision11/HIST/00000652/0000/DaVinci_00000652_00000206_2_Hist.root'], 'JobType': 'DataReprocessing', 'GridEnv': '/afs/cern.ch/project/gd/LCG-share/3.2.8-0/etc/profile.d/grid-env', 'StdError': 'std.err', 'RunNumber': '89338', 'OwnerDN': '/DC=es/DC=irisgrid/O=ecm-ub/CN=Ricardo-Graciani-Diaz', 'JobGroup': '00000652', 'StdOutput': 'std.out', 'JobID': '896409', 'Origin': 'DIRAC', 'SystemConfig': 'x86_64-slc5-gcc43-opt', 'MaxCPUTime': '1000000', 'LogFilePath': '/lhcb/validation/Collision11/LOG/00000652/0000/00000206', 'LogTargetPath': '/lhcb/validation/Collision11/LOG/00000652/0000/00000652_00000206.tar', 'Executable': '$DIRACROOT/scripts/dirac-jobexec', 'JobName': '00000652_00000206', 'Priority': '7', 'JobRequirements': ['[', 'OwnerDN', '=', '/DC=es/DC=irisgrid/O=ecm-ub/CN=Ricardo-Graciani-Diaz;', 'VirtualOrganization', '=', 'lhcb;', 'Setup', '=', 'LHCb-Certification;', 'Sites', '=', 'LCG.CERN.ch', 'LCG.CSCS.ch', 'DIRAC.ONLINE.ch', 'DIRAC.CERNVMTest.ch', ';', 'CPUTime', '=', '1000000;', 'OwnerGroup', '=', 'lhcb_data;', 'UserPriority', '=', '7;', 'JobTypes', '=', 'DataReprocessing', ';', ']'], 'Arguments': 'jobDescription.xml -o LogLevel=verbose', 'SoftwarePackages': ['AppConfig.v3r110', 'Brunel.v41r1', 'AppConfig.v3r106', 'DaVinci.v29r0'], 'InputDataModule': 'LHCbDIRAC.Core.Utilities.InputDataResolution', 'InputSandbox': ['jobDescription.xml', 'SB:CertificationSandboxSE|/SandBox/l/lhcb_data/eb7/9fd/eb79fd1ee4b6e30e88c0b786e9875a22.tar.bz2'], 'OwnerName': 'rgracian', 'OutputSandbox': ['std.err', 'std.out'], 'TargetSE': 'CNAF-RAW', 'TransformationID': '652', 'DIRACSetup': 'LHCb-Certification', 'ProcessingType': 'Reco12', 'ParametricInputSandbox': '', 'CPUTime': '86400', 'TaskID': '206', 'Site': 'LCG.CERN.ch,LCG.CSCS.ch,DIRAC.ONLINE.ch,DIRAC.CERNVMTest.ch', 'OwnerGroup': 'lhcb_data', 'Owner': 'rgracian', 'InputData': '/lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89338/089338_0000000082.raw'} {'Requirements': 'True', 'LocalArea': '/localdisk/dirac/work', 'CPUScalingFactor': '4.0', 'InputDataDirectory': '/localdisk/dirac/work', 'TotalCPUs': '1', 'ControlDirectory': '/localdisk/dirac/work/WorkloadManagement/JobAgent', 'WorkDirectory': '/localdisk/dirac/work', 'LogLevel': 'INFO', 'LHCbPlatform': 'x86_64-slc5-gcc43-opt', 'SharedArea': '/mnt/sw-real/lib', 'DIRACVersion': 'v7r2p7', 'GridEnv': '/afs/cern.ch/project/gd/LCG-share/3.2.8-0/etc/profile.d/grid-env', 'WorkingDirectory': '/localdisk/dirac/work', 'InstancePath': '/mnt/sw-real/dirac/certification/', 'ReleaseVersion': 'v7r2p7', 'CPUTime': '3000000', 'MaxWaitingJobs': '1', 'FileCatalog': 'LcgFileCatalogCombined', 'Architecture': 'x86_64-slc5-gcc43-opt', 'Root': '/mnt/sw-real/dirac/certification/pro', 'EnableAgentMonitoring': 'yes', 'Site': 'DIRAC.ONLINE.ch', 'LocalAccountString': 'lhcbprod', 'MaxCPUTime': '3000000', 'MaxTotalJobs': '1'} setJobParameters(896409,[('CPUScalingFactor', '4.0'), ('PilotAgent', 'v7r2p10'), ('JobWrapperPID', 19257)]) setJobStatus(896409,Running,Downloading InputSandbox,JobWrapper) Downloading InputSandbox for job 896409: jobDescription.xml Downloading Input SandBox SB:CertificationSandboxSE|/SandBox/l/lhcb_data/eb7/9fd/eb79fd1ee4b6e30e88c0b786e9875a22.tar.bz2

Trying to load from root path LHCbDIRAC Trying to load from root path DIRAC Trying to load from root path LHCbDIRAC Trying to load from root path LHCbDIRAC Looking for file /mnt/sw-real/dirac/certification/versions/v7r2p10_1329122154/LHCbDIRAC/Resources/Storage/DIPStorage.py

Trying to load from root path DIRAC Looking for file /mnt/sw-real/dirac/certification/versions/v7r2p10_1329122154/DIRAC/Resources/Storage/DIPStorage.py

StorageElement.isValid: Determining whether the StorageElement CertificationSandboxSE is valid for use. StorageElement.executeFunction: Attempting to perform 'getFile' operation with 1 pfns. StorageElement.isValid: Determining whether the StorageElement CertificationSandboxSE is valid for use. StorageElement.isLocalSE: Determining whether CertificationSandboxSE is a local SE. StorageElement.executeFunction: Generating 1 protocol PFNs for DIP. StorageElement.executeFunction: Attempting to perform 'getFile' for 1 physical files. Total size of input sandbox: 0.03 MiB (30558 bytes) setJobStatus(896409,Running,Input Data Resolution,JobWrapper) Input Data is: /lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89338/089338_0000000082.raw Job input data requirement is /lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89338/089338_0000000082.raw Job input data resolution policy module is LHCbDIRAC.Core.Utilities.InputDataResolution Site has the following local SEs: CERN-FREEZER, CERN-RAW, CERN_M-DST, CERN-DST, CERN-USER, CERN-FAILOVER, CERN-disk, CERN-tape, CERN-RDST, CERN_MC_M-DST, CERN-HIST, CERN-ARCHIVE, CERN_MC-DST Found optimizer catalogue result {'OK': True, 'Value': {'Successful': {'/lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89338/089338_0000000082.raw': {'CERN-RAW': 'srm://srm-lhcb.cern.ch/castor/cern.ch/grid/lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89338/089338_0000000082.raw', 'CNAF-RAW': 'srm://storm-fe-lhcb.cr.cnaf.infn.it/t1d0/lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89338/089338_0000000082.raw'}}, 'Failed': {}}} {'OK': True, 'Value': {'Successful': {'/lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89338/089338_0000000082.raw': {'CERN-RAW': 'srm://srm-lhcb.cern.ch/castor/cern.ch/grid/lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89338/089338_0000000082.raw', 'CNAF-RAW': 'srm://storm-fe-lhcb.cr.cnaf.infn.it/t1d0/lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89338/089338_0000000082.raw'}}, 'Failed': {}}} The following file(s) were found not to have a GUID: /lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89338/089338_0000000082.raw {'OK': True, 'Value': {'Successful': {'/lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89338/089338_0000000082.raw': {'CERN-RAW': 'srm://srm-lhcb.cern.ch/castor/cern.ch/grid/lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89338/089338_0000000082.raw', 'CNAF-RAW': 'srm://storm-fe-lhcb.cr.cnaf.infn.it/t1d0/lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89338/089338_0000000082.raw'}}, 'Failed': {}}} Could not obtain replica information from Optimizer File Catalog information {'Message': 'File metadata is not available', 'OK': False} Trying to load from root path LHCbDIRAC Trying to load from root path DIRAC Trying to load from root path LHCbDIRAC JC res = fileCatalog.getReplicas(lfns, allStatus=False) Replica Lookup Time: 0.10 seconds Job Wrapper found 1 problematic LFN(s) for job 896409 LFN:/lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89338/089338_0000000082.raw Problem: Bad credentials setJobParameter(896409,MissingLFNs,LFN:/lhcb/data/2011/RAW/FULL/LHCb/COLLISION11/89338/089338_0000000082.raw Problem: Bad credentials) Could not obtain replica information from File Catalog directly {'Message': 'Input Data Not Available', 'OK': False} Input Data Not Available JobWrapper failed to resolve input data == EXCEPTION == <class '__main.JobWrapperError'>:Input Data Not Available File "/localdisk/dirac/work/job/Wrapper/Wrapper_896409", line 88, in execute

raise JobWrapperError( result['Message'] )

Failure during Input Data Resolution

Joel Closier PH-LBC, Physics Department CERN, CH-1211 Genève 23 LHCb Experiment, Core Computing& Software Group Office 2-R-001, Tel : +41 22 767 7172, Fax : +41 22 766 9978 GSM : +41 76 487 0381 mailto:Joel.Closier@cern.ch

Le 13/02/12 11:13, Ricardo Graciani a écrit :

Hi,

both the DIRACFileCatalog and LcgFileCatalog getReplicas method include "allStatus" in the definition of the method:

   def getReplicas(self, lfns, allStatus=False,rpc='',url='',timeout=120):
     rpcClient = self._getRPC(rpc=rpc,url=url,timeout=timeout)
     return rpcClient.getReplicas(lfns,allStatus)
   def getReplicas( self, lfn, allStatus = False ):
     """ Returns replicas for an LFN or list of LFNs
     """
     res = self.__checkArgumentFormat( lfn )

what File Catalog are defined in the instance you are testing? Ricardo


Reply to this email directly or view it on GitHub: https://github.com/DIRACGrid/DIRAC/issues/498#issuecomment-3937065

graciani commented 12 years ago

This is a configuration issue of a given LHCB installation, I close it here and answer you by mail.