com.intel.hadoop.graphbuilder.demoapps.wikipedia.docwordgraph
Class WordCountGraphTokenizer

java.lang.Object
  extended by com.intel.hadoop.graphbuilder.demoapps.wikipedia.docwordgraph.WordCountGraphTokenizer
All Implemented Interfaces:
GraphTokenizer<StringType,StringType,StringType>

public class WordCountGraphTokenizer
extends java.lang.Object
implements GraphTokenizer<StringType,StringType,StringType>


Constructor Summary
WordCountGraphTokenizer()
           
 
Method Summary
 void configure(org.apache.hadoop.mapred.JobConf job)
          Configure the tokenizer from JobConf.
 java.lang.Class edataClass()
           
 java.util.Iterator<Edge<StringType,StringType>> getEdges()
           
 java.util.Iterator<Vertex<StringType,StringType>> getVertices()
           
 void parse(java.lang.String s)
          Parse the input string and filter into internal vertex and edge fields.
 java.lang.Class vdataClass()
           
 java.lang.Class vidClass()
           
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

WordCountGraphTokenizer

public WordCountGraphTokenizer()
Method Detail

configure

public void configure(org.apache.hadoop.mapred.JobConf job)
Description copied from interface: GraphTokenizer
Configure the tokenizer from JobConf.

Specified by:
configure in interface GraphTokenizer<StringType,StringType,StringType>

vidClass

public java.lang.Class vidClass()
Specified by:
vidClass in interface GraphTokenizer<StringType,StringType,StringType>
Returns:
Class of the VidType. Used for type safety in the high level.

vdataClass

public java.lang.Class vdataClass()
Specified by:
vdataClass in interface GraphTokenizer<StringType,StringType,StringType>
Returns:
Class of the VertexData. Used for type safety in the high level.

edataClass

public java.lang.Class edataClass()
Specified by:
edataClass in interface GraphTokenizer<StringType,StringType,StringType>
Returns:
Class of the EdgeData. Used for type safety in the high level.

parse

public void parse(java.lang.String s)
Description copied from interface: GraphTokenizer
Parse the input string and filter into internal vertex and edge fields.

Specified by:
parse in interface GraphTokenizer<StringType,StringType,StringType>

getVertices

public java.util.Iterator<Vertex<StringType,StringType>> getVertices()
Specified by:
getVertices in interface GraphTokenizer<StringType,StringType,StringType>
Returns:
a list of Vertex extracted from the input.

getEdges

public java.util.Iterator<Edge<StringType,StringType>> getEdges()
Specified by:
getEdges in interface GraphTokenizer<StringType,StringType,StringType>
Returns:
a list of Edge extracted from the input.