dwyl / learn-postgresql

🐘 Learn how to use PostgreSQL and Structured Query Language (SQL) to store and query your relational data. 🔍
212 stars 23 forks source link
# Learn PostgreSQL Learn how to use PostgreSQL and Structured Query Language (SQL) to store and query your data.

[![Build Status](https://img.shields.io/travis/dwyl/learn-postgresql/master.svg?style=flat-square)](https://travis-ci.org/dwyl/learn-postgresql) [![codecov.io](https://img.shields.io/codecov/c/github/dwyl/learn-postgresql/master.svg?style=flat-square)](https://codecov.io/github/dwyl/learn-postgresql?branch=master) [![Dependencies: None!](https://david-dm.org/dwyl/learn-postgresql/status.svg?style=flat-square)](https://david-dm.org/dwyl/learn-postgresql) [![devDependencies Status](https://david-dm.org/dwyl/learn-postgresql/dev-status.svg?style=flat-square)](https://david-dm.org/dwyl/learn-postgresql?type=dev) [![contributions welcome](https://img.shields.io/badge/contributions-welcome-brightgreen.svg?style=flat-square)](https://github.com/dwyl/learn-postgresql/issues)

Why?

Helping people store, retrieve and derive insights from data is the essence of all software applications.

SQL is Everywhere

Like it or not, Relational Databases store most of the world's structured data and Structured Query Language (SQL) is by far the most frequent way of retrieving the data.

According to the most recent surveys/statistics, SQL still dominates the world of databases.

https://insights.stackoverflow.com/survey/2018/#technology-databases stackoverflow-survey-2018-databases

https://db-engines.com/en/ranking dbms-ranking

Note: you should never adopt a technology based on it's current popularity, also be ware of "argumentum ad populum" ("it's popular therefore you should use it"). Always pick the appropriate tool for the job based on the requirements, constraints and/or availability (both of "skill" on your existing team or in the wider community). We include these stats to explain that relational databases are still the most widely used by far and so learning SQL skills is a very wise investment both as an individual and for your team or organisation.

PostgreSQL is Easy to Learn and it Runs Everywhere!

Getting started with PostgreSQL is easy, (just follow the steps in this guide and try out the example queries!)
When you are ready to deploy your app, you are in safe hands, PostgreSQL runs everywhere:

Who?

Everyone building any application that stores data should learn SQL. SQL is ubiquitous in every field/industry and the sooner you learn/master it, the higher your life-time return on time investment.

Learning how to use a relational database is a foundational skill for all of computer science and application development.

Being proficient in SQL will open the door to Data Science with SQL-on-Hadoop Apache Spark, Google BigQuery, Oracle and Teradata. In short, get really good at SQL! It's very useful.

What?

This tutorial covers 5 areas:

  1. What is PostgreSQL?
  2. How do I get started with PostgreSQL? (a fully functioning example!)
  3. What is Structured Query Language (SQL)? (lots of example queries!)
  4. How do I write my own SQL Queries?
  5. How do I deploy my own PostgreSQL-based Application?

Once you have covered these areas, you will know if PostgreSQL is "right" for your needs, or if you need to keep looking for a different way to store data.

Let's dive in!

1. What is PostgreSQL?

PostgreSQL (often shortened to simply "Postgres") is an advanced Relational DataBase Management System ("RDBMS"), that lets you efficiently and securely store any type of data. We will explain "Relational Database" in the context of our example below, so don't worry if it sounds like a buzzword soup.

Postgres has an emphasis on standards compliance and extensibility which means there are many plugins you can use to enhance it like PostGIS for mapping applications and entire projects built on top of it like TimescaleDB (a time-series database perfect for analytics) and AgensGraph (a graph database, great for modelling networks e.g a "social graph").

Structured Query Language (SQL) is the preferred means of interacting with data at any scale.

The only reason MySQL is still more widely used than Postgres can be summarised in one word: WordPress. WordPress has a firm grip on the CMS-based website market and it shows no sign of slowing down. If your goal is to build CMS-based websites, or the company you already work for uses WordPress, you should go for it! If you prefer a more general introduction to SQL, follow this tutorial! The knowledge you will gain by learning Postgres is 95%+ "transferable" to other SQL databases so don't worry about the differences between MySQL and Postgres for now. If you're curious, read: https://hackr.io/blog/postgresql-vs-mysql

How?

Installation

Before you get started with using PostgreSQL, you'll have to install it. Follow these steps to get started:

MacOS

  1. There are a couple of ways to install PostgreSQL. One of the easier ways to get started is with Postgres.app. Navigate to https://postgresapp.com/ and then click "Download": download

  2. Once it's finished downloading, double click on the file to unzip then move the PostgreSQL elephant icon into your applications folder. Double click the icon to launch the application.

  3. You should now see a new window launched with a list of servers to the left side of the window (if it's a fresh install, you should see one named PostgreSQL XX). If it shows anything else or an error props up, make sure you don't have any other instances of Postgres on your computer and reinstall. To fully reinstall follow these steps to delete data directories and preferences. Click on the button 'Initialize' (or 'Start' if you had already installed previously).

    download
  4. Run sudo mkdir -p /etc/paths.d && echo /Applications/Postgres.app/Contents/Versions/latest/bin | sudo tee /etc/paths.d/postgresapp (found here) to use psql in the terminal. Close and open the terminal.

  5. Postgres.app will by default create a role and database that matches your current macOS username. You can connect straight away by running psql.

  6. You should then see something in your terminal that looks like this (with your macOS username in front of the prompt rather than 'postgres'):

terminal

  1. You should now be all set up to start using PostgreSQL. For documentation on command line tools etc see https://postgresapp.com/documentation/

Ubuntu

Digital Ocean have got a great article on getting started with postgres. A quick summary is below.

Installation
sudo apt-get update
sudo apt-get install postgresql postgresql-contrib

By default the only role created is the default 'postgres', so PostgreSQL will only respond to connections from an Ubuntu user called 'postgres'. We need to pretend to be that user and create a role matching our actual Ubuntu username:

sudo -u postgres createuser --interactive

This command means 'run the command createuser --interactive as the user called "postgres"'.

When asked for the name of the role enter your Ubuntu username. If you're not sure, open a new Terminal tab and run whoami.

When asked if you want to make the role a superuser, type 'y'.

We now need to create the database matching the role name, as PostgreSQL expects this. Run:

sudo -u postgres createdb [your user name]

You can now connect to PostgreSQL by running psql.

Create your first PostgreSQL database

  1. To start PostgreSQL, type this command into the terminal:
    psql

  2. Next type this command into the PostgreSQL interface:
    CREATE DATABASE test;
    NOTE: Don't forget the semi-colon. If you do, useful error messages won't show up.

  3. To check that our database has been created, type \l into the psql prompt. You should see something like this in your terminal: test db

Create new users for your database

  1. If you closed the PostgreSQL server, start it again with:
    psql

  2. To create a new user, type the following into the psql prompt:

    CREATE USER testuser;
  3. Check that your user has been created. Type \du into the prompt. You should see something like this: user Users can be given certain permissions to access any given database you have created.

  4. Next we need to give our user permissions to access the test database we created above. Enter the following command into the psql prompt:

    GRANT ALL PRIVILEGES ON DATABASE test TO testuser;

PostGIS - Spacial and Geographic objects for PostgreSQL

PostGIS Installation

If you've installed Postgres App as in the example above, you can easily extend it to include PostGIS. Follow these steps to begin using PostGIS:

  1. Ensure that you're logged in as a user OTHER THAN postgres. Follow the steps above to enable your default user to be able to access the psql prompt. (installation step 7)

  2. Type the following into the psql prompt to add the extension:
    CREATE EXTENSION postgis;

PostGIS Distance between two sets of coordinates

After you've extended PostgreSQL with PostGIS you can begin to use it. Type the following command into the psql command line:

SELECT ST_Distance(gg1, gg2) As spheroid_dist
FROM (SELECT
    ST_GeogFromText('SRID=4326;POINT(-72.1235 42.3521)') As gg1,
    ST_GeogFromText('SRID=4326;POINT(-72.1235 43.1111)') As gg2
    ) As foo  ;

This should return spheroid_dist along with a value in meters. The example above returns: 84315.42034614 which is rougly 84.3km between the two points.

Commands

Once you are serving the database from your computer

Postgresql follows the SQL convention of calling relations TABLES, attributes COLUMNs and tuples ROWS

Transaction All or nothing, if something fails the other commands are rolled back like nothing happened

Reference When a table is being created you can reference a column in another table to make sure any value which is added to that column exists in the referenced table.

CREATE TABLE cities (
  name text NOT NULL,
  postal_code varchar(9) CHECK (postal_code <> ''),
  country_code char(2) REFERENCES countries,
  PRIMARY KEY (country_code, postal_code)
);

<> means not equal

Join reads You can join tables together when reading them,

Inner Join Joins together two tables by specifying a column in each to join them by i.e.

SELECT cities.*, country_name
  FROM cities INNER JOIN countries
  ON cities.country_code = countries.country_code;

This will select all of the columns in both the countries and cities tables the data, the rows are matched up by country_code.

Grouping You can put rows into groups where the group is defined by a shared value in a particular column.

SELECT venue_id, count(*)
  FROM events
  GROUP BY venue_id;

This will group the rows together by the venue_id, count is then performed on each of the groups.

Learning Resources