Merge request template: please remove the appropriate parts of this template.
Pre-merge request checklist (to be completed by the one making the request):
[x] I have performed a full review of this code myself.
For Python code in PySpark specific sections, all code should have been run in Jupyter notebooks.
For code in sections of the book containing both Python and R code, the page of the book should be constructed as described in the contributing guide and converted to a markdown file.
[x] I have formatted the outputs of code blocks correctly (to match other outputs in the book and in line with the style guide [coming soon])
[x] I have built the book as outlined in the contributing guide and confirmed that any additional/modified content is displaying as expected.
Details of this request:
Adds sparklyr code and explanation of spark_read_avro()/spark_write_avro() to the reading and writing avro files section of the book.
Closes issue #83
Things to note about this request:
You will need to install the sparkavro r package (via install.packages("sparkavro")) before you can run the code.
Requirements for review (such as):
[ ] Check that avro files can be read and written correctly using the sparklyr code
[ ] Check that the explanations given make sense and are complete
[ ] Check that the link to the sparkavro r package on CRAN works correctly
Merge request template: please remove the appropriate parts of this template.
Pre-merge request checklist (to be completed by the one making the request):
Details of this request:
spark_read_avro()
/spark_write_avro()
to the reading and writing avro files section of the book.Things to note about this request:
install.packages("sparkavro")
) before you can run the code.Requirements for review (such as):