Class

com.twitter.scalding

TemplatedTsv

Related Doc: package scalding

Permalink

case class TemplatedTsv(basePath: String, template: String, pathFields: Fields = Fields.ALL, writeHeader: Boolean = false, sinkMode: SinkMode = SinkMode.REPLACE, fields: Fields = Fields.ALL) extends TemplateSource with DelimitedScheme with Product with Serializable

An implementation of TSV output, split over a template tap.

basePath

The root path for the output.

template

The java formatter style string to use as the template. e.g. %s/%s.

pathFields

The set of fields to apply to the path.

writeHeader

Flag to indicate that the header should be written to the file.

sinkMode

How to handle conflicts with existing output.

fields

The set of fields to apply to the output.

Source
TemplateSource.scala
Linear Supertypes
Serializable, Product, Equals, DelimitedScheme, TemplateSource, HfsTapProvider, SchemedSource, Source, Serializable, AnyRef, Any
Type Hierarchy
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. TemplatedTsv
  2. Serializable
  3. Product
  4. Equals
  5. DelimitedScheme
  6. TemplateSource
  7. HfsTapProvider
  8. SchemedSource
  9. Source
  10. Serializable
  11. AnyRef
  12. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new TemplatedTsv(basePath: String, template: String, pathFields: Fields = Fields.ALL, writeHeader: Boolean = false, sinkMode: SinkMode = SinkMode.REPLACE, fields: Fields = Fields.ALL)

    Permalink

    basePath

    The root path for the output.

    template

    The java formatter style string to use as the template. e.g. %s/%s.

    pathFields

    The set of fields to apply to the path.

    writeHeader

    Flag to indicate that the header should be written to the file.

    sinkMode

    How to handle conflicts with existing output.

    fields

    The set of fields to apply to the output.

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  5. val basePath: String

    Permalink

    The root path for the output.

    The root path for the output.

    Definition Classes
    TemplatedTsvTemplateSource
  6. def checkFlowDefNotNull()(implicit flowDef: FlowDef, mode: Mode): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Source
  7. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  8. def createHfsTap(scheme: Scheme[JobConf, RecordReader[_, _], OutputCollector[_, _], _, _], path: String, sinkMode: SinkMode): Hfs

    Permalink
    Definition Classes
    HfsTapProvider
  9. def createTap(readOrWrite: AccessMode)(implicit mode: Mode): Tap[_, _, _]

    Permalink

    Creates the template tap.

    Creates the template tap.

    readOrWrite

    Describes if this source is being read from or written to.

    mode

    The mode of the job. (implicit)

    returns

    A cascading TemplateTap.

    Definition Classes
    TemplateSourceSource
  10. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  11. val fields: Fields

    Permalink

    The set of fields to apply to the output.

    The set of fields to apply to the output.

    Definition Classes
    TemplatedTsvDelimitedScheme
  12. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  13. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  14. def hdfsScheme: Scheme[JobConf, RecordReader[_, _], OutputCollector[_, _], _, _]

    Permalink

    The scheme to use if the source is on hdfs.

    The scheme to use if the source is on hdfs.

    Definition Classes
    DelimitedSchemeSchemedSource
  15. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  16. def localScheme: TextDelimited

    Permalink

    The scheme to use if the source is local.

    The scheme to use if the source is local.

    Definition Classes
    DelimitedSchemeSchemedSource
  17. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  18. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  19. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  20. val pathFields: Fields

    Permalink

    The set of fields to apply to the path.

    The set of fields to apply to the path.

    Definition Classes
    TemplatedTsvTemplateSource
  21. val quote: String

    Permalink
    Definition Classes
    DelimitedScheme
  22. def read(implicit flowDef: FlowDef, mode: Mode): Pipe

    Permalink
    Definition Classes
    Source
  23. val safe: Boolean

    Permalink
    Definition Classes
    DelimitedScheme
  24. val separator: String

    Permalink
    Definition Classes
    DelimitedScheme
  25. val sinkMode: SinkMode

    Permalink

    How to handle conflicts with existing output.

    How to handle conflicts with existing output.

    Definition Classes
    TemplatedTsvSchemedSource
  26. val skipHeader: Boolean

    Permalink
    Definition Classes
    DelimitedScheme
  27. def sourceId: String

    Permalink

    This is a name the refers to this exact instance of the source (put another way, if s1.sourceId == s2.sourceId, the job should work the same if one is replaced with the other

    This is a name the refers to this exact instance of the source (put another way, if s1.sourceId == s2.sourceId, the job should work the same if one is replaced with the other

    Definition Classes
    Source
  28. val strict: Boolean

    Permalink
    Definition Classes
    DelimitedScheme
  29. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  30. val template: String

    Permalink

    The java formatter style string to use as the template.

    The java formatter style string to use as the template. e.g. %s/%s.

    Definition Classes
    TemplatedTsvTemplateSource
  31. def transformForRead(pipe: Pipe): Pipe

    Permalink
    Attributes
    protected
    Definition Classes
    Source
  32. def transformForWrite(pipe: Pipe): Pipe

    Permalink
    Attributes
    protected
    Definition Classes
    Source
  33. def transformInTest: Boolean

    Permalink

    The mock passed in to scalding.JobTest may be considered as a mock of the Tap or the Source.

    The mock passed in to scalding.JobTest may be considered as a mock of the Tap or the Source. By default, as of 0.9.0, it is considered as a Mock of the Source. If you set this to true, the mock in TestMode will be considered to be a mock of the Tap (which must be transformed) and not the Source.

    Definition Classes
    Source
  34. val types: Array[Class[_]]

    Permalink
    Definition Classes
    DelimitedScheme
  35. def validateTaps(mode: Mode): Unit

    Permalink

    Validates the taps, makes sure there are no nulls as the path or template.

    Validates the taps, makes sure there are no nulls as the path or template.

    mode

    The mode of the job.

    Definition Classes
    TemplateSourceSource
  36. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  37. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  38. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  39. def writeFrom(pipe: Pipe)(implicit flowDef: FlowDef, mode: Mode): Pipe

    Permalink

    write the pipe but return the input so it can be chained into the next operation

    write the pipe but return the input so it can be chained into the next operation

    Definition Classes
    Source
  40. val writeHeader: Boolean

    Permalink

    Flag to indicate that the header should be written to the file.

    Flag to indicate that the header should be written to the file.

    Definition Classes
    TemplatedTsvDelimitedScheme

Deprecated Value Members

  1. def readAtSubmitter[T](implicit mode: Mode, conv: TupleConverter[T]): Stream[T]

    Permalink
    Definition Classes
    Source
    Annotations
    @deprecated
    Deprecated

    (Since version 0.9.0) replace with Mappable.toIterator

Inherited from Serializable

Inherited from Product

Inherited from Equals

Inherited from DelimitedScheme

Inherited from TemplateSource

Inherited from HfsTapProvider

Inherited from SchemedSource

Inherited from Source

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped