nexr / RHive

RHive is an R extension facilitating distributed computing via Apache Hive.
http://nexr.github.io/RHive
122 stars 63 forks source link

RHive query results transfer extremely slow #50

Open haltux opened 11 years ago

haltux commented 11 years ago

Hello,

When I do a simple rhive.query("select * from X limit 10000")

it takes 90s to answer once the query is completed on the hiveserver (OK displayed on the console).

It increases linearily with data size, always exactly 9 ms per line, it does not depend on the line length.

It is several order of magnitude slower than any other kind of data transfer between R and whatever. My guess is that there is some kind of timeout somewhere.

alephomega commented 11 years ago

rhive.query function was badly designed. It incur growing object and growing object is so slow in R. rhive.query function will be redesigned.

balnagy commented 10 years ago

+1 ;)

haltux commented 10 years ago

FYI, using RJDBC with HiveServer2 is also painfully slow. Less slow than RHive, but still extremely slow, much slower than with other JDBC drivers. So as I guess the problems comes from HiveServer.

Right now the acceptable way to transfer data from Hive to R is to export data from Hive to a local CSV file and to load it using data.table's fread.

birdplane-industries commented 9 years ago

A simple solution I came up with involves simply piping your Hive query to the command line. Then you can use readLines() and separate out fields based on the '\t' delimiter, creating a data.table out of your query results.

require(data.table) require(tidyr)

hive_pipe <- pipe("hive -e 'SELECT my, table, columns FROM my_table'") results <- data.table(readLines(hive_pipe)) close(hive_pipe) results <- separate(results, "V1", into = c(my, table, columns), sep = "\t")

This is significantly faster than rhive.query. And it doesn't require writing out your query results to a CSV file first.

If I have the time, I may fork rhive and add this solution to my version.

imanopholist commented 8 years ago

Hi sherath21

Can you please explain how did you connect from RStudio to Hive CLI. Because while trying to proceed as you did I got an error on line : results=data.table(readLines(hive_pipe)). The error says : _sh: 1: hive: not found _ .

My code is the following :

Call necessary libraries

library(rJava) library(Rserve) library(RHive)

Intitialize rhive

rhive.init( hiveLib="/usr/local/hive/lib", hiveHome = "/usr/local/hive/", hadoopHome="/usr/local/hadoop-2.7.2/", hadoopConf="/usr/local/hadoop-2.7.2/etc/hadoop", hadoopLib="/usr/local/hadoop-2.7.2/lib",verbose=FALSE )

library(data.table)

Establish the connection

rhive.connect("10.0.2.15",defaultFS="hdfs://localhost:9000")

hive_pipe=pipe("hive -e 'USE hello_db; SELECT * FROM table_txt limit 10'") results = data.table(readLines(hive_pipe)) close(hive_pipe)

Any idea ?

Thanks

birdplane-industries commented 8 years ago

I think you may be missing the fact that my solution is a way to bypass RHive altogether. I'm just piping a "hive -e" shell command from R to the command line. Your error suggests the hive command is not recognized in the environment you're piping "hive -e" to.

On Tue, May 10, 2016 at 2:26 PM, imanopholist notifications@github.com wrote:

Hi sherath21

Can you please explain how did you connect from RStudio to Hive CLI. Because while trying to proceed as you did I got an error on line : results=data.table(readLines(hive_pipe)). The error says : _sh: 1: hive: not found _ .

My code is the following :

Call necessary libraries

library(rJava) library(Rserve) library(RHive)

Intitialize rhive

rhive.init( hiveLib="/usr/local/hive/lib", hiveHome = "/usr/local/hive/", hadoopHome="/usr/local/hadoop-2.7.2/", hadoopConf="/usr/local/hadoop-2.7.2/etc/hadoop", hadoopLib="/usr/local/hadoop-2.7.2/lib",verbose=FALSE )

library(data.table)

Establish the connection

rhive.connect("10.0.2.15",defaultFS="hdfs://localhost:9000")

Define a hive query

hive_pipe=pipe("hive -e 'USE hello_db; SELECT * FROM table_txt limit 10'")

Execute hive query in hive commande line

Read line by line the result and store it in a data table

-- Note: Time to fetch 1000001 is 2 sec instead of hours with rhive --

results = data.table(readLines(hive_pipe))

close hive conenction

close(hive_pipe)

Any idea ?

Thanks

— You are receiving this because you commented. Reply to this email directly or view it on GitHub https://github.com/nexr/RHive/issues/50#issuecomment-218156415

imanopholist commented 8 years ago

Hi,

library(data.table) hive_pipe=pipe("hive -e 'USE hello_db; SELECT * FROM table_txt limit 10'") results = data.table(readLines(hive_pipe)) close(hive_pipe)

If thinking about RStudio ignore the paths or rstudio user: _Sys.getenv() _ HADOOP_CMD /usr/local/hadoop-2.7.2/bin/hadoop HADOOP_HOME /usr/local/hadoop-2.7.2 HADOOP_STREAMING /usr/local/hadoop-2.7.2/share/hadoop/tools/lib/hadoopstreaming-2.7.2.jar HIVE_HOME /usr/local/hive-1.2.1 HOME /home/hadoopuser LOGNAME hadoopuser RSTUDIO_USER_IDENTITY hadoopuser

ghost commented 8 years ago

@imanopholist For this reason, RHive has a function rhive.big.query. Same functionality as rhive.query, but input query executes with CreateTableAsSelect format. So the results are stored in the temp table in Hive, then load into HDFS or Local Directory, at last read data from those files (see: rhive.load.table2 function). Full codes are in the RHive/R/rhive.R.

Thanks.

imanopholist commented 8 years ago

@DrakeMin thanks, I'll try that right away !

imanopholist commented 8 years ago

Hi @DrakeMin, I've tried using rhive.big.query. but I am getting this error : Erreur : java.sql.SQLException: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTask. This error doesn't block the creation of the new table in the database. I also tried to load the new created table by using rhive.load.table2 but I am getting the same error. I searched on the net, and I found that I need to assure having rhive-2.0.4. I've installed it as mentioned in this link https://github.com/nexr/RHive by @ssshow16 , but still getting the same error. On hiveserver I find that the source of error is: Failed with exception Unable to move source hdfs://localhost:9000/rhive/tmp/hadoopuser/7a9dac451c09d16c9dafe31cdf081d/.hive staging_hive_2016-05-15_17-08-03_055_4204045923460424200-5/-ext-10000 to destination /rhive/tmp/hadoopuser/7a9dac451c09d16c9dafe31cdf081d

any ideas ?

I just want to add, that when fetching a small table it works with big.query and no need to use load.table2.

Thank you

imanopholist commented 8 years ago

Hi, I noticed that I can execute @sherath21 solution on R console and not on RStudio (desktop/server) so I verified the variable environment in both and find that there is some differences. but the main one was that even if RStudio had its RSTUDIO_USER_IDENTITY , I had to add the variable USER and give it the name of RStudio user. and that fixed the problem. but unfortunately still didn't get any luck with the second method.

ghost commented 8 years ago

@imanopholist Can you provide the full log of hive-server and/or related MR job(related Hive movetask) ? Thank you.

imanopholist commented 8 years ago

Hi @DrakeMin thanks for responding. the hiveserver2 result while executing rhive functions is : **#rhive.connect(ip,defaultFS="hdfs://localhost:9000")

converting to local hdfs://localhost:9000/rhive/lib/2.0-0.4/rhive_udf.jar Added [/tmp/ed1b5743-0b9b-47f9-8e72-1b38238c4ab3_resources/rhive_udf.jar] to class path Added resources: [hdfs://localhost:9000/rhive/lib/2.0-0.4/rhive_udf.jar] OK OK OK OK OK OK OK OK OK OK

rhive.load.table2 (added comment not in result)-->

Query ID = hadoopuser_20160516091619_2c275aec-9cf1-4597-aca1-9b209d1fd45c Total jobs = 3 Launching Job 1 out of 3 Number of reduce tasks is set to 0 since there's no reduce operator Starting Job = job_1463321987231_0009, Tracking URL = http://slave1:8088/proxy/application_1463321987231_0009/ Kill Command = /usr/local/hadoop-2.7.2/bin/hadoop job -kill job_1463321987231_0009 Hadoop job information for Stage-1: number of mappers: 1; number of reducers: 0 2016-05-16 09:16:30,113 Stage-1 map = 0%, reduce = 0% 2016-05-16 09:16:41,786 Stage-1 map = 100%, reduce = 0%, Cumulative CPU 3.13 sec MapReduce Total cumulative CPU time: 3 seconds 130 msec Ended Job = job_1463321987231_0009 Stage-3 is selected by condition resolver. Stage-2 is filtered out by condition resolver. Stage-4 is filtered out by condition resolver. Moving data to: hdfs://localhost:9000/rhive/tmp/hadoopuser/9e19c70c640137d3e65e6db13803a6/.hive-staging_hive_2016-05-16_09-16-19_626_4167550319352383126-1/-ext-10000 Moving data to: /rhive/tmp/hadoopuser/9e19c70c640137d3e65e6db13803a6 Failed with exception Unable to move source hdfs://localhost:9000/rhive/tmp/hadoopuser/9e19c70c640137d3e65e6db13803a6/.hive-staging_hive_2016-05-16_09-16-19_626_4167550319352383126-1/-ext-10000 to destination /rhive/tmp/hadoopuser/9e19c70c640137d3e65e6db13803a6 FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTask MapReduce Jobs Launched: Stage-Stage-1: Map: 1 Cumulative CPU: 3.13 sec HDFS Read: 58672091 HDFS Write: 58667652 SUCCESS Total MapReduce CPU Time Spent: 3 seconds 130 msec**

ghost commented 8 years ago

@imanopholist hmm. It's weird. The query is complete, temp data is stored in hdfs://localhost:9000/rhive/tmp/hadoopuser/9e19c70c640137d3e65e6db13803a6/.hive-staging_hive_2016-05-16_09-16-19_626_4167550319352383126-1/-ext-10000, then move to original location(hdfs://localhost:9000/rhive/tmp/hadoopuser/9e19c70c640137d3e65e6db13803a6).

How about the HDFS Namenode log at that time? I think move task will be a hdfs job, so if move failed, HDFS logs may have an error.

imanopholist commented 8 years ago

@DrakeMin I've only used rhive.load.table2 and I run it at 16:28. the log is a little bit long. I guess the error comes because the created files are under user : anonymous permission and not hadoopuser. If you agree with me can you please show me how to change it from anonymous to hadoopuser.

Log :

**2016-05-16 16:27:12,770 INFO logs: Aliases are enabled 2016-05-16 16:27:36,418 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:27:36,420 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:27:36,421 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:27:36,425 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:27:36,428 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:27:36,436 INFO org.apache.hadoop.hdfs.server.namenode.FSEditLog: Number of transactions: 4 Total time for transactions(ms): 12 Number of transactions batched in Syncs: 0 Number of syncs: 4 SyncTimes(ms): 56 2016-05-16 16:27:36,440 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:27:36,442 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:27:36,443 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:27:36,449 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:27:36,635 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:27:36,652 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:02,478 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:02,480 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:02,481 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:02,482 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:02,515 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:02,663 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:02,679 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:02,693 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:02,698 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:02,700 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:03,218 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:03,221 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:03,222 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:03,247 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:03,248 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:03,258 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:03,274 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:03,423 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:03,427 INFO org.apache.hadoop.hdfs.StateChange: BLOCK* allocate blk_1073742500_1676{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} for /tmp/hive/anonymous/cb8c7b37-3b8f-41cc-a9f6-fc30dd6556ab/hive_2016-05-16_16-28-02_030_8915384368329738261-2/-mr-10004/a4c40b6a-50ef-4e7a-bc45-f4d0c6f81ecf/map.xml 2016-05-16 16:28:03,737 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:03,737 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: BLOCK* blk_1073742500_1676{UCState=COMMITTED, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} is not COMPLETE (ucState = COMMITTED, replication# = 0 < minimum = 1) in file /tmp/hive/anonymous/cb8c7b37-3b8f-41cc-a9f6-fc30dd6556ab/hive_2016-05-16_16-28-02_030_8915384368329738261-2/-mr-10004/a4c40b6a-50ef-4e7a-bc45-f4d0c6f81ecf/map.xml 2016-05-16 16:28:03,796 INFO BlockStateChange: BLOCK* addStoredBlock: blockMap updated: 127.0.0.1:50010 is added to blk_1073742500_1676{UCState=COMMITTED, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} size 4190 2016-05-16 16:28:04,146 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,148 INFO org.apache.hadoop.hdfs.StateChange: DIR* completeFile: /tmp/hive/anonymous/cb8c7b37-3b8f-41cc-a9f6-fc30dd6556ab/hive_2016-05-16_16-28-02_030_8915384368329738261-2/-mr-10004/a4c40b6a-50ef-4e7a-bc45-f4d0c6f81ecf/map.xml is closed by DFSClient_NONMAPREDUCE_1022403887_28 2016-05-16 16:28:04,154 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,154 INFO org.apache.hadoop.hdfs.server.blockmanagement.BlockManager: Increasing replication from 1 to 10 for /tmp/hive/anonymous/cb8c7b37-3b8f-41cc-a9f6-fc30dd6556ab/hive_2016-05-16_16-28-02_030_8915384368329738261-2/-mr-10004/a4c40b6a-50ef-4e7a-bc45-f4d0c6f81ecf/map.xml 2016-05-16 16:28:04,556 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,604 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,605 INFO org.apache.hadoop.ipc.Server: IPC Server handler 1 on 9000, call org.apache.hadoop.hdfs.protocol.ClientProtocol.getBlockLocations from 127.0.0.1:55246 Call#43 Retry#0: java.io.FileNotFoundException: File does not exist: /tmp/hive/anonymous/cb8c7b37-3b8f-41cc-a9f6-fc30dd6556ab/hive_2016-05-16_16-28-02_030_8915384368329738261-2/-mr-10004/a4c40b6a-50ef-4e7a-bc45-f4d0c6f81ecf/reduce.xml 2016-05-16 16:28:04,618 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,619 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,674 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,675 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,681 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,685 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,688 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,690 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,692 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,697 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,698 INFO org.apache.hadoop.hdfs.StateChange: BLOCK* allocate blk_1073742501_1677{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} for /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/libjars/rhive_udf.jar 2016-05-16 16:28:04,709 INFO BlockStateChange: BLOCK* addStoredBlock: blockMap updated: 127.0.0.1:50010 is added to blk_1073742501_1677{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} size 0 2016-05-16 16:28:04,712 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,714 INFO org.apache.hadoop.hdfs.StateChange: DIR* completeFile: /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/libjars/rhive_udf.jar is closed by DFSClient_NONMAPREDUCE_1022403887_28 2016-05-16 16:28:04,715 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,715 INFO org.apache.hadoop.hdfs.server.blockmanagement.BlockManager: Increasing replication from 1 to 10 for /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/libjars/rhive_udf.jar 2016-05-16 16:28:04,718 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,719 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,742 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:04,743 INFO org.apache.hadoop.hdfs.StateChange: BLOCK* allocate blk_1073742502_1678{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} for /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/job.jar 2016-05-16 16:28:05,119 INFO BlockStateChange: BLOCK* addStoredBlock: blockMap updated: 127.0.0.1:50010 is added to blk_1073742502_1678{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} size 0 2016-05-16 16:28:05,121 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,122 INFO org.apache.hadoop.hdfs.StateChange: DIR* completeFile: /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/job.jar is closed by DFSClient_NONMAPREDUCE_1022403887_28 2016-05-16 16:28:05,123 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,124 INFO org.apache.hadoop.hdfs.server.blockmanagement.BlockManager: Increasing replication from 1 to 10 for /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/job.jar 2016-05-16 16:28:05,126 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,130 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,131 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,132 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,133 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,135 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,136 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,137 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,138 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,165 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,166 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,172 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,174 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,188 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,192 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,195 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,195 INFO org.apache.hadoop.hdfs.server.blockmanagement.BlockManager: Increasing replication from 1 to 10 for /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/job.split 2016-05-16 16:28:05,199 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,200 INFO org.apache.hadoop.hdfs.StateChange: BLOCK* allocate blk_1073742503_1679{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} for /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/job.split 2016-05-16 16:28:05,208 INFO BlockStateChange: BLOCK* addStoredBlock: blockMap updated: 127.0.0.1:50010 is added to blk_1073742503_1679{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} size 0 2016-05-16 16:28:05,210 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,212 INFO org.apache.hadoop.hdfs.StateChange: DIR* completeFile: /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/job.split is closed by DFSClient_NONMAPREDUCE_1022403887_28 2016-05-16 16:28:05,213 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,215 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,219 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,220 INFO org.apache.hadoop.hdfs.StateChange: BLOCK* allocate blk_1073742504_1680{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} for /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/job.splitmetainfo 2016-05-16 16:28:05,226 INFO BlockStateChange: BLOCK* addStoredBlock: blockMap updated: 127.0.0.1:50010 is added to blk_1073742504_1680{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} size 0 2016-05-16 16:28:05,228 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,230 INFO org.apache.hadoop.hdfs.StateChange: DIR* completeFile: /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/job.splitmetainfo is closed by DFSClient_NONMAPREDUCE_1022403887_28 2016-05-16 16:28:05,235 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,248 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,344 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,344 INFO org.apache.hadoop.hdfs.StateChange: BLOCK* allocate blk_1073742505_1681{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} for /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/job.xml 2016-05-16 16:28:05,457 INFO BlockStateChange: BLOCK* addStoredBlock: blockMap updated: 127.0.0.1:50010 is added to blk_1073742505_1681{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} size 0 2016-05-16 16:28:05,459 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,460 INFO org.apache.hadoop.hdfs.StateChange: DIR* completeFile: /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/job.xml is closed by DFSClient_NONMAPREDUCE_1022403887_28 2016-05-16 16:28:05,549 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,613 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,614 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,619 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,620 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,626 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,628 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,630 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,631 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,633 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,635 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,636 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,639 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,641 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,643 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,644 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:05,646 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:07,870 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:07,899 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:08,072 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:08,078 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:08,121 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:08,123 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:08,356 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:08,357 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:08,368 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:08,369 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:08,385 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:08,386 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:11,321 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:11,354 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:11,355 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:11,356 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:11,500 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:11,503 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:11,506 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:11,956 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:11,962 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:14,564 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:14,657 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:14,886 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:14,886 INFO org.apache.hadoop.hdfs.StateChange: BLOCK* allocate blk_1073742506_1682{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} for /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/job_1463408585018_0001_1_conf.xml 2016-05-16 16:28:15,170 INFO BlockStateChange: BLOCK* addStoredBlock: blockMap updated: 127.0.0.1:50010 is added to blk_1073742506_1682{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} size 0 2016-05-16 16:28:15,172 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:15,176 INFO org.apache.hadoop.hdfs.StateChange: DIR* completeFile: /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/job_1463408585018_0001_1_conf.xml is closed by DFSClient_NONMAPREDUCE_283905618_1 2016-05-16 16:28:16,769 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:16,770 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:16,785 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:16,786 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:16,789 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:16,792 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:16,812 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:16,813 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:16,815 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:16,816 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:16,817 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:16,819 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:21,378 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:21,615 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:21,982 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:22,495 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:22,534 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:22,586 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:22,749 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:22,749 INFO org.apache.hadoop.hdfs.StateChange: BLOCK* allocate blk_1073742507_1683{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} for /rhive/tmp/hadoopuser/9e19c70c640137d3e65e6db13803a6/.hive-staging_hive_2016-05-16_16-28-02_030_8915384368329738261-1/_task_tmp.-ext-10002/_tmp.000000_0 2016-05-16 16:28:26,738 INFO BlockStateChange: BLOCK* addStoredBlock: blockMap updated: 127.0.0.1:50010 is added to blk_1073742507_1683{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} size 0 2016-05-16 16:28:26,744 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:26,761 INFO org.apache.hadoop.hdfs.StateChange: DIR* completeFile: /rhive/tmp/hadoopuser/9e19c70c640137d3e65e6db13803a6/.hive-staging_hive_2016-05-16_16-28-02_030_8915384368329738261-1/_task_tmp.-ext-10002/_tmp.000000_0 is closed by DFSClient_attempt_1463408585018_0001_m_000000_0_185857106_1 2016-05-16 16:28:26,767 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,027 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,176 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,177 INFO org.apache.hadoop.hdfs.StateChange: BLOCK* allocate blk_1073742508_1684{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} for /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/job_1463408585018_0001_1.jhist 2016-05-16 16:28:27,206 INFO org.apache.hadoop.hdfs.StateChange: BLOCK* fsync: /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/job_1463408585018_0001_1.jhist for DFSClient_NONMAPREDUCE_283905618_1 2016-05-16 16:28:27,206 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,209 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,227 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,228 INFO org.apache.hadoop.hdfs.StateChange: DIR* completeFile: /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/COMMIT_STARTED is closed by DFSClient_NONMAPREDUCE_283905618_1 2016-05-16 16:28:27,232 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,235 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,237 INFO org.apache.hadoop.hdfs.StateChange: DIR* completeFile: /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/COMMIT_SUCCESS is closed by DFSClient_NONMAPREDUCE_283905618_1 2016-05-16 16:28:27,323 INFO BlockStateChange: BLOCK* addStoredBlock: blockMap updated: 127.0.0.1:50010 is added to blk_1073742508_1684{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} size 13561 2016-05-16 16:28:27,326 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,329 INFO org.apache.hadoop.hdfs.StateChange: DIR* completeFile: /tmp/hadoop-yarn/staging/anonymous/.staging/job_1463408585018_0001/job_1463408585018_0001_1.jhist is closed by DFSClient_NONMAPREDUCE_283905618_1 2016-05-16 16:28:27,333 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,345 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,345 INFO org.apache.hadoop.hdfs.StateChange: BLOCK* allocate blk_1073742509_1685{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} for /tmp/hadoop-yarn/staging/history/done_intermediate/anonymous/job_1463408585018_0001.summary_tmp 2016-05-16 16:28:27,373 INFO BlockStateChange: BLOCK* addStoredBlock: blockMap updated: 127.0.0.1:50010 is added to blk_1073742509_1685{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} size 0 2016-05-16 16:28:27,374 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,376 INFO org.apache.hadoop.hdfs.StateChange: DIR* completeFile: /tmp/hadoop-yarn/staging/history/done_intermediate/anonymous/job_1463408585018_0001.summary_tmp is closed by DFSClient_NONMAPREDUCE_283905618_1 2016-05-16 16:28:27,378 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,398 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,401 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,412 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,420 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,422 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,425 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,439 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,440 INFO org.apache.hadoop.hdfs.StateChange: BLOCK* allocate blk_1073742510_1686{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} for /tmp/hadoop-yarn/staging/history/done_intermediate/anonymous/job_1463408585018_0001-1463408885709-anonymous-INSERT+OVERWRITE+DIRE..._20160515165942_e727%28Stage-1463408907244-1-0-SUCCEEDED-default-1463408894545.jhist_tmp 2016-05-16 16:28:27,453 INFO BlockStateChange: BLOCK* addStoredBlock: blockMap updated: 127.0.0.1:50010 is added to blk_1073742510_1686{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} size 0 2016-05-16 16:28:27,455 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,456 INFO org.apache.hadoop.hdfs.StateChange: DIR* completeFile: /tmp/hadoop-yarn/staging/history/done_intermediate/anonymous/job_1463408585018_0001-1463408885709-anonymous-INSERT+OVERWRITE+DIRE..._20160515165942_e727%28Stage-1463408907244-1-0-SUCCEEDED-default-1463408894545.jhist_tmp is closed by DFSClient_NONMAPREDUCE_283905618_1 2016-05-16 16:28:27,457 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,461 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,462 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,463 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,465 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,466 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,469 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,480 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,480 INFO org.apache.hadoop.hdfs.StateChange: BLOCK* allocate blk_1073742511_1687{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} for /tmp/hadoop-yarn/staging/history/done_intermediate/anonymous/job_1463408585018_0001_conf.xml_tmp 2016-05-16 16:28:27,498 INFO BlockStateChange: BLOCK* addStoredBlock: blockMap updated: 127.0.0.1:50010 is added to blk_1073742511_1687{UCState=UNDER_CONSTRUCTION, truncateBlock=null, primaryNodeIndex=-1, replicas=[ReplicaUC[[DISK]DS-f349950f-3477-4727-bbca-ccf26189f6e4:NORMAL:127.0.0.1:50010|RBW]]} size 0 2016-05-16 16:28:27,500 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,501 INFO org.apache.hadoop.hdfs.StateChange: DIR* completeFile: /tmp/hadoop-yarn/staging/history/done_intermediate/anonymous/job_1463408585018_0001_conf.xml_tmp is closed by DFSClient_NONMAPREDUCE_283905618_1 2016-05-16 16:28:27,503 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,507 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,517 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:27,520 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:28,566 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:28,604 INFO BlockStateChange: BLOCK* addToInvalidates: blk_1073742502_1678 127.0.0.1:50010 2016-05-16 16:28:28,604 INFO BlockStateChange: BLOCK* addToInvalidates: blk_1073742503_1679 127.0.0.1:50010 2016-05-16 16:28:28,605 INFO BlockStateChange: BLOCK* addToInvalidates: blk_1073742504_1680 127.0.0.1:50010 2016-05-16 16:28:28,605 INFO BlockStateChange: BLOCK* addToInvalidates: blk_1073742505_1681 127.0.0.1:50010 2016-05-16 16:28:28,605 INFO BlockStateChange: BLOCK* addToInvalidates: blk_1073742508_1684 127.0.0.1:50010 2016-05-16 16:28:28,605 INFO BlockStateChange: BLOCK* addToInvalidates: blk_1073742506_1682 127.0.0.1:50010 2016-05-16 16:28:28,605 INFO BlockStateChange: BLOCK* addToInvalidates: blk_1073742501_1677 127.0.0.1:50010 2016-05-16 16:28:28,722 INFO BlockStateChange: BLOCK* BlockManager: ask 127.0.0.1:50010 to delete [blk_1073742501_1677, blk_1073742502_1678, blk_1073742503_1679, blk_1073742504_1680, blk_1073742505_1681, blk_1073742506_1682, blk_1073742508_1684] 2016-05-16 16:28:29,548 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,551 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,571 INFO BlockStateChange: BLOCK* addToInvalidates: blk_1073742500_1676 127.0.0.1:50010 2016-05-16 16:28:29,573 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,585 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,587 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,591 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,596 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,602 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,605 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,606 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,612 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,621 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,623 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,640 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,648 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,651 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,656 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,657 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,659 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,662 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,663 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,667 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,668 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,672 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,673 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,674 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:29,675 WARN org.apache.hadoop.security.UserGroupInformation: No groups available for user anonymous 2016-05-16 16:28:31,722 INFO BlockStateChange: BLOCK* BlockManager: ask 127.0.0.1:50010 to delete [blk_1073742500_1676]**

ghost commented 8 years ago

@imanopholist sorry, currently I have no idea for this error. I'll try at our test bed for error reproduce. Thanks.

imanopholist commented 8 years ago

@DrakeMin thank you !

imanopholist commented 8 years ago

Hi @sherath21 I noticed that when loading a table and an indexed table there is no big difference in the loading time. Ex: Time_to_load_non_indexed_table= 16.78 sec Time_to_load_indexed_table= 15.23 sec

Do you have an explanation to this?

Imane