com.ebay.erl.mobius.core.builder
Class SeqFileDataset

java.lang.Object
  extended by com.ebay.erl.mobius.core.builder.Dataset
      extended by com.ebay.erl.mobius.core.builder.SeqFileDataset
All Implemented Interfaces:
java.io.Serializable

public class SeqFileDataset
extends Dataset

Represents a dataset backed with Hadoop sequence file format.

This product is licensed under the Apache License, Version 2.0, available at http://www.apache.org/licenses/LICENSE-2.0. This product contains portions derived from Apache hadoop which is licensed under the Apache License, Version 2.0, available at http://hadoop.apache.org. © 2007 – 2012 eBay Inc., Evan Chiu, Woody Zhou, Neel Sundaresan

See Also:
SeqFileDatasetBuilder, Serialized Form

Field Summary
 
Fields inherited from class com.ebay.erl.mobius.core.builder.Dataset
computedColumns, conf, input_format, job, mapper, name, schema, tupleConstraint
 
Constructor Summary
protected SeqFileDataset(MobiusJob job, java.lang.String name, java.lang.Class<? extends SequenceFileMapper> mapperClass)
          Creating an instance of SeqFileDataset with SequenceFileInputFormat as its input format.
 
Method Summary
protected  void setInputFormat(java.lang.Class<? extends org.apache.hadoop.mapred.InputFormat> input_format)
          Always thrown UnsupportedOperationException, the input format for this dataset is fixed to SequenceFileInputFormat.
protected  void setMapper(java.lang.Class<? extends AbstractMobiusMapper> mapper)
          Set the AbstractMobiusMapper for this dataset.
 
Methods inherited from class com.ebay.erl.mobius.core.builder.Dataset
addComputedColumn, createJobConf, equals, getDatasetID, getInputFormat, getInputs, getMapper, getName, getSchema, hashCode, initialize, orderBy, orderBy, orderBy, orderBy, setSchema, toString, validate, withinSchema
 
Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, wait
 

Constructor Detail

SeqFileDataset

protected SeqFileDataset(MobiusJob job,
                         java.lang.String name,
                         java.lang.Class<? extends SequenceFileMapper> mapperClass)
Creating an instance of SeqFileDataset with SequenceFileInputFormat as its input format.

User should not create this dataset directly through this constructor, but to use SeqFileDatasetBuilder to build one.

Parameters:
job - a Mobius job contains an analysis flow.
name - name of this dataset
mapperClass - the mapper class to parse the underline values into Tuples.
Method Detail

setInputFormat

protected void setInputFormat(java.lang.Class<? extends org.apache.hadoop.mapred.InputFormat> input_format)
Always thrown UnsupportedOperationException, the input format for this dataset is fixed to SequenceFileInputFormat.

Overrides:
setInputFormat in class Dataset

setMapper

protected void setMapper(java.lang.Class<? extends AbstractMobiusMapper> mapper)
Set the AbstractMobiusMapper for this dataset.

This method is called by the corresponding implementation of AbstractDatasetBuilder.

mapper must inherent SequenceFileMapper.

Overrides:
setMapper in class Dataset