com.intel.hadoop.graphbuilder.idnormalize.mapreduce
Class HashIdMR
java.lang.Object
com.intel.hadoop.graphbuilder.idnormalize.mapreduce.HashIdMR
public class HashIdMR
- extends java.lang.Object
This MapReduce class maps a list of unique vertex into 2 parts of output: A
dictionary from rawId to newId, and a new vertex data file using newId. The
domain of newId is consecutive integers from 0 to |V|-1.
Input directory: list of unique vertex data. Output directory:
- $outputdir/vidmap for rawid to newid dictionary.
- $outputdir/vdata for normalized vertex data.
Method Summary |
void |
run(java.lang.String inputpath,
java.lang.String outputpath)
|
Methods inherited from class java.lang.Object |
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
HashIdMR
public HashIdMR(GraphParser graphparser,
FieldParser vidparser,
FieldParser vdataparser)
- Create the MapReduce Job with GraphParser, and vertex FieldParsers.
- Parameters:
graphparser
- vidparser
- vdataparser
-
run
public void run(java.lang.String inputpath,
java.lang.String outputpath)
throws java.io.IOException
- Parameters:
inputpath
- the path to a unique vertex list. Each line is parsed into (vid,
data) using vidparser
and vdataparser
.outputpath
- the path of output directory.
- Throws:
java.io.IOException