cloudera « hadoop « Java Database Q&A





1. can i use hadoop cloudera without root access?    stackoverflow.com

a bit of a binary question (okay, not excatly) - but was wondering if one is able to configure cloudera / hadoop to run at the nodes without root shell access ...

2. Flume agent - can I specify compression like gzip or bz2?    stackoverflow.com

Is it possible to specify a compression option on a Flume agent so that the data is transferred to the collector in a compressed format? I know there are compression ...

3. Hadoop put command doing nothing!    stackoverflow.com

I am running Cloudera's distribution of Hadoop and everything is working perfectly.The hdfs contains a large number of .seq files.I need to merge the contents of all the .seq files into ...

4. Hadoop: intermediate merge failed    stackoverflow.com

I'm running into a strange issue. When I run my Hadoop job over a large dataset (>1TB compressed text files), several of the reduce tasks fail, with stacktraces like these:

java.io.IOException: Task: ...

5. Comparing using Map Reduce(Cloudera Hadoop 0.20.2) two text files of size of almost 3GB    stackoverflow.com

I'm trying to do the following in hadoop map/reduce( written in java, linux kernel OS)

  1. Text files 'rules-1' and 'rules-2' (total 3GB in size) contains some rules, each rule are seperated by ...

6. Does Cloudera Mountable HDFS provide deduplicaion    stackoverflow.com

Looking at running a HDFS based storage cluster, and looking at a simple method of using the Mountable HDFS system through the Cloudera release. The first question I ask is will this ...

7. "Child Error" in Executing stream Job on multi node Hadoop cluster (cloudera distribution CDH3u0 Hadoop 0.20.2)    stackoverflow.com

I am working on 8 node Hadoop cluster, and I am trying to execute a simple streaming Job with the specified configuration.

hadoop jar /usr/lib/hadoop-0.20/contrib/streaming/hadoop-streaming-0.20.2-cdh3u0.jar \-D mapred.map.max.tacker.failures=10 \-D mared.map.max.attempts=8 \-D mapred.skip.attempts.to.start.skipping=8 \-D ...

8. hadoop certification exam    stackoverflow.com

Is there any one here who has given the hadoop certification exam (cloudera)? I heard it is a 60 multiple choice Q&A exam. Are there any samples questions / ...

9. Connecting to Cloudera VM from my desktop    stackoverflow.com

I downloaded the Cloudera VM on my Windows 7 laptop to play around. I am trying to connect to the Hadoop instance running in the VM from Windows. I did an ...





10. Socket exception when writing Map-reduce output    stackoverflow.com

I am trying to run a map-reduce job on 15 TB of data on a 32 node Hadoop (CDH 0.20.2) Cluster. 5 TB is the amount of resulting data being sent ...

11. Running java hadoop job on local/remote cluster    stackoverflow.com

I'm trying to run hadoop job on local/remote cluster. This job in future will be executed from web application. I'm trying to execute this piece of code from eclipse:

public class TestHadoop ...

12. How to convert Cloudera Hadoop "vbox" VMDK to VirtualBox VDI    stackoverflow.com

Hi guys : I am trying to run the Cloudera Hadoop VM in Virtual box.

  • First, I noted that the download is a .vmdk file. Of course, this suffix is ...

13. Using encryption with Hadoop    stackoverflow.com

The Cloudera documentation says that Hadoop does not support on disk encryption. Would it be possible to use hardware encrypted hard drives with Hadoop?

14. What is the simplest way to connect to cloudera's vm for hadoop?    stackoverflow.com

Hi guys : I recently posted regarding ssh and accessing a hadoop (single-node, cloudera VM). There was some chatter about the core-site.xml file, and that i should look at it ...

15. How do you compile a UDF in Cloudera's distribution of Hadoop+Pig    stackoverflow.com

I am running Cloudera's VM (CH3). I've copied a simple UDF in my text editor, but I don't know how to compile it in order to be able to call it ...

16. Is cloudera desktop works with apche hadoop    stackoverflow.com

Is cloudera desktop- A gui for hadoop... works with normal apache hadoop? or is it works only with cloudera distribution for hadoop. as per the below article they said it works with ...





17. Is CDH(Cloudera Distribution for hadoop) is open source to use?    stackoverflow.com

Is CDH(Cloudera Distribution for hadoop) is open source to use or is it commercial? Any inputs on this greatly appreciated.

18. Hadoop Web Authentication using Kerberos    stackoverflow.com

I configured hadoop using kerberos, everything works fine, I can browse hdfs, submit jobs, etc. But failed http web authentication. I use hadoop-0.20.2 in cdh3u2, which supports HTTP SPNEGO. HTTP authentication related configurations ...

19. Hadoop job configuration file specification    stackoverflow.com

I have a flow of ~10 Hadoop jobs. I have sequence of XML files that configure these jobs for instance:

-conf /path/on/my/local/system/start-conf.xml
-conf /path/on/my/local/system/job-1-conf.xml
-conf /path/on/my/local/system/job-2-conf.xml
-conf /path/on/my/local/system/job-i-conf.xml
...
-conf /path/on/my/local/system/job-10-conf.xml
Let's assume I have class which is ...

20. cloudera vm hadoop Hive fails after fs.http.address ip changed    stackoverflow.com

I'm using Cloudera VM (cdh3u2) as a simulated distributed file system. In order to perform file creation and writing from a web server I changed the fs.http.address property to point to ...