UBC-DSCI / introduction-to-datascience

Open Source Textbook for DSCI100: Introduction to Data Science in R
https://datasciencebook.ca/
Other
50 stars 54 forks source link

minor wording chapter 2 #459

Closed trevorcampbell closed 1 year ago

trevorcampbell commented 1 year ago

Notice anything? They look the same! The same number of columns/rows and column names! So we needed to use different tools for the job depending on the file format and our resulting table (canlang_data) in both cases was the same!

This is a bit awkward -- as we are working through the python book, I rephrased this:

Notice anything? They look the same; they have the same number of columns and rows, and have the same column names! So even though we needed to use different arguments depending on the file format, our resulting data frame (canlang_data) in both cases was the same.

trevorcampbell commented 1 year ago

Section 2.4.1 is called

read_csv to read in comma-separated files

it should be

read_csv to read in comma-separated value files

or something like that

Same issue with 2.4.3

trevorcampbell commented 1 year ago

"To read in this type of data, we can use the read_tsv to read in .tsv (tab separated values) files."

To read in blah, we can use read_tsv to read in blah

awk repetition

trevorcampbell commented 1 year ago

Here, we provide the URL to read_* as the path to the file instead of a path to a local file on our computer

should be

path to a remote file instead of...

this sentence is awkward as well, maybe rewrite

trevorcampbell commented 1 year ago

You can preview files in a plain text editor by right-clicking on the file, selecting “Open With,” and choosing a plain text editor (e.g., Notepad).

weird OS-specific instruction here. remove

trevorcampbell commented 1 year ago

add rename to learning objectives?

trevorcampbell commented 1 year ago

at the end:

mands can currently be translated with SQLite databases. For example, we can compute a mean with an SQLite database but can’t easily compute a median. So you might be wondering: why should we use databases at all?

We should rewrite this to refer to general DBs, not just sqlite