cluster « hadoop « Java Database Q&A





1. Hadoop on windows server    stackoverflow.com

I'm thinking about using hadoop to process large text files on my existing windows 2003 servers (about 10 quad core machines with 16gb of RAM) The questions are:

  1. Is there any good tutorial ...

2. Hadoop cluster. 2 Fast, 4 Medium, 8 slower machines?    stackoverflow.com

We're going to purchase some new hardware to use just for a Hadoop cluster and we're stuck on what we should purchase. Say we have a budget of $5k should we ...

3. Jar works with standalone hadoop, but not on the actual cluster (java.lang.ClassNotFoundException: org.jfree.data.xy.XYDataset)    stackoverflow.com

I am trying to build my project using eclipse on windows and execute on a linux cluster. The project depends on some external jars, which I enclosed using eclipse's "Export->Runnable JAR ...

4. Running map reduce over geographically located VMs - How bad is this setup for a hadoop cluster?    stackoverflow.com

Like the subject reads, is it important that I get dedicated hardware to run a hadoop cluster and not VMs? If yes, what is acceptable network latency? Are you required to ...

5. What approximate amount of semistructured data is enough for setting up Hadoop cluster?    stackoverflow.com

I know, Hadoop is not only alternative for semistructured data processing in general — I can do many things with plain tab-separated data and a bunch of unix tools (cut, grep, ...

6. hadoop install on round-robin DNS    stackoverflow.com

I want to install hadoop on a round-robin DNS environment. I have a bunch of machines sharing a common user environment and a common name. These machines are equal. The round-robin ...

7. Running a mapreduce jar on Hadoop cluster    stackoverflow.com

I'm trying to run the map reduce implementation of quadratic sieve algorithm on Hadoop. For this purpose I'm using karmasphere Hadoop community plugin with Netbeans. The program works fine using the ...

8. Hadoop cluster with some nodes of low disk space~    stackoverflow.com

i'm now holding a cluster with 12 nodes. Some of them, specifically 8 nodes, have enough disk space. But the other 4 only have little space to use. However, the other 4 ...

9. hadoop multi node cluster installation in 2 machies    stackoverflow.com

Possible Duplicate:
hadoop multi node cluster installation in 2 machies
hello scp error: “Permission denied (publickey). lost connection�





10. hadoop not running in the multinode cluster    stackoverflow.com

I have a jar file "Tsp.jar" that I made myself. This same jar files executes well in single node cluster setup of hadoop. However when I run it on a ...

11. Hadoop JBOD disk configuration on HP Smart Array 410/i disk controller    stackoverflow.com

I'm in a evaluation phase of some hw that could be used for setting up a hadoop cluster. This hw is refurbished (hp G6 servers w/ Smart Array 410/i ...

12. Too many fetch failures: Hadoop on cluster (x2)    stackoverflow.com

I have been using Hadoop for the last week or so (trying to get to grips with it), and although I have been able to set up a multinode cluster (2 ...

13. Running web-fetches from within a Hadoop cluster    stackoverflow.com

A blog post - http://petewarden.typepad.com/searchbrowser/2011/05/using-hadoop-with-external-api-calls.html - suggests calling external systems (querying the twitter API, or crawling webpages) from within a Hadoop cluster. For the system I'm currently developing, there are ...

14. Error in running Hadoop example on single node cluster    stackoverflow.com

I'm running the word count example provided by hadoop. The following is the error that i am getting:

-bash-4.1$ ./hadoop jar /home/chanders/Hadoop/bin/hadoop-examples-0.20.203.0.jar wordcount /usr/temp_hadoop /usr/output
Warning: Maximum heap size rounded up to 1024 ...

15. Hadoop cluster on NFS    stackoverflow.com

I'm trying to setup a hadoop cluster on 5 machines on same lan with NFS. The problem im facing is that the copy of hadoop on one machine is replicated on ...

16. How do I add a node to a pseudo-distributed hadoop setup?    stackoverflow.com

I've a single-node pseudo-distributed hadoop setup on a unix system in the network. What are the minimum steps to add another computer/node (cygwin) on the network to form a ...





17. Hadoop (HDFS) for images storing    stackoverflow.com

I encountered lately with the well-known software-stack called "Hadoop". Hadoop supply a distributed file system called HDFS. now, what I need is a system to store mass of images and other media-files read actions ...

18. Facing problems in setting up a multinode hadoop cluster (ubuntu)    stackoverflow.com

Possible Duplicate:
How can i connect to a remote host using ssh?
I am new to Hadoop and trying to learn and use it. I have ...

19. Why job with mappers only is so slow in real cluster?    stackoverflow.com

I have a job with mapper PrepareData only which needed for converting text data to SequencialFile with VLongWritable as a key and DoubleArrayWritable as a value. When I run it over 455000x90 ...

20. How to add jars into the classpath and get effected without restarting the hadoop cluster?    stackoverflow.com

I wrote some mapreduce jobs that reference a few external jars. so I added them into the CLASSPATH of the "running" cluster in order to run jobs. Once I tried to run them, ...

21. Hadoop Framework to simulate a cluster locally    stackoverflow.com

Is there any Hadoop framework wherein I can simulate a full cluster locally on a Linux or Windows Machine? The DFS could replicate the data locally and the Compute layer would ...

22. Hadoop vs simple cluster    coderanch.com

Yes, you have to rewrite your program to make it work over a cluster (distributed computing). Hadoop is a framework that makes the rewriting easier. In my book, I gave an example of writing a word counting program. Writing such a program to run on a single machine is easy. Writing it to run on a cluster of machines introduces a ...