apache spark - Where is the reference for options for writing or reading per format? -
i use spark 1.6.1.
we trying write orc file hdfs using hivecontext , dataframewriter. while can use
df.write().orc(<path>)
we rather like
df.write().options(map("format" -> "orc", "path" -> "/some_path")
this have flexibility change format or root path depending on application uses helper library. can find reference options can passed dataframewriter? found nothing in docs here
where can find reference options can passed dataframewriter?
the definitive , authoritative answer sources:
some description may find in docs, there no single page (that possibly auto-generated sources stay up-to-date most).
the reason being options separated format implementation on purpose have flexibility want offer per use case (as duly noted):
this have flexibility change format or root path depending on application uses helper library.
your question seems similar how know file formats supported databricks? said:
where can list of options supported each file format?
that's not possible there no api follow (like in spark mllib) define options. every format on own...unfortunately , best bet read documentation or (more authoritative) source code.
Comments
Post a Comment