com.intel.hadoop.graphbuilder.demoapps.wikipedia.docwordgraph
Class WordCountGraphTokenizer
java.lang.Object
com.intel.hadoop.graphbuilder.demoapps.wikipedia.docwordgraph.WordCountGraphTokenizer
- All Implemented Interfaces:
- GraphTokenizer<StringType,StringType,StringType>
public class WordCountGraphTokenizer
- extends java.lang.Object
- implements GraphTokenizer<StringType,StringType,StringType>
Methods inherited from class java.lang.Object |
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
WordCountGraphTokenizer
public WordCountGraphTokenizer()
configure
public void configure(org.apache.hadoop.mapred.JobConf job)
- Description copied from interface:
GraphTokenizer
- Configure the tokenizer from JobConf.
- Specified by:
configure
in interface GraphTokenizer<StringType,StringType,StringType>
vidClass
public java.lang.Class vidClass()
- Specified by:
vidClass
in interface GraphTokenizer<StringType,StringType,StringType>
- Returns:
- Class of the VidType. Used for type safety in the high level.
vdataClass
public java.lang.Class vdataClass()
- Specified by:
vdataClass
in interface GraphTokenizer<StringType,StringType,StringType>
- Returns:
- Class of the VertexData. Used for type safety in the high level.
edataClass
public java.lang.Class edataClass()
- Specified by:
edataClass
in interface GraphTokenizer<StringType,StringType,StringType>
- Returns:
- Class of the EdgeData. Used for type safety in the high level.
parse
public void parse(java.lang.String s)
- Description copied from interface:
GraphTokenizer
- Parse the input string and filter into internal vertex and edge fields.
- Specified by:
parse
in interface GraphTokenizer<StringType,StringType,StringType>
getVertices
public java.util.Iterator<Vertex<StringType,StringType>> getVertices()
- Specified by:
getVertices
in interface GraphTokenizer<StringType,StringType,StringType>
- Returns:
- a list of
Vertex
extracted from the input.
getEdges
public java.util.Iterator<Edge<StringType,StringType>> getEdges()
- Specified by:
getEdges
in interface GraphTokenizer<StringType,StringType,StringType>
- Returns:
- a list of
Edge
extracted from the input.