I'm thinking about building a small testing application in hadoop to get the hang of the system.
The application I have in mind will be in the realm of doing statistics.
I want ...
This is how Hadoop currently works: If a reducer fails (throws a NullPointerException for example), Hadoop will reschedule another reducer to do the task of the reducer that failed.
Is it possible ...
I have a simple map-reduce program in which my map and reduce primitives look like this
map(K,V) = (Text, OutputAggregator)
reduce(Text, OutputAggregator) = (Text,Text)
The important point is that from my map function I ...
Hi
Text manipulation in Reduce phase seems not working correctly.
I suspect problem could be in my code rather then hadoop itself but you never know...
If you can spot any gotchas let ...
I need to access the counters from my mapper in my reducer. Is this possible? If so how is it done?
As an example:
my mapper is:
public class CounterMapper extends Mapper ...
I have an interesting problem that I'm struggling to fit in MapReduce. I have a bunch of log entries. What I need to do is something like this:
Check if any entry ...
I have a mapreduce java program in which I try to only compress the mapper output but not the reducer output. I thought that this would be possible by setting ...
i am trying to pass a complex writable between mapper and reducer, more specifically ArrayWritable of ObjectWritables.
public class ObjectArrayWritable extends ArrayWritable {
...
I am trying to find a way to "loop" my reducer, for example:
for(String document: tempFrequencies.keySet())
{
if(list.get(0).equals(document))
{
testMap.put(key.toString(), DF.format(tfIDF));
}
}
//This allows me to create a hashmap which i plan to write out to context as ...
I've got a job that uses 100 reducers config'ed with
setOutputFormat (SequenceFileOutputFormat.class);
After the job runs, can I combine all of the part files via
the following command and have things work correctly with
the ...
I am fairly new to hadoop, however, I've been reading "Hadoop: The definitive guide", so I think I have an understanding of the basic concepts.
I used Hadoop 0.20.2 to run a ...
I have a basic scenario in Hadoop:
All mappers send all values to the same key. Therefore all values end up on the same reducer.
However, when I iterate the values in the ...