Friday, December 18, 2020

Major League Soccer 2019 Statistics

Data analytics in soccer is a growing and thrilling field. In this exercise, I used Python to scrape data about the 2019 MLS campaign from fbref.com. Then, I built a table with filters and radar charts to compare the performances of different players. A radar chart is a common tool in the professionals' daily work. For example, when it comes to scouting. 

The code: https://github.com/chrideki/MLS_2019

The Tableau dashboard: https://tabsoft.co/2UuOW0J




Monday, October 19, 2020

U.S. Presidential Debate

Tuesday the 29th, took place the first 2020 presidential debate between President Donald Trump and the former Vice President Joe Biden. As known, it's been an extremely chaotic and confusing debate, rich in interruptions and crosstalking. Nevertheless, this dashboard analyzes some patterns and characteristics of the candidates' speeches. For the data analysis I used the full transcript available on rev.com. Click here to see the dashboard and enjoy the Tableau interactions. 





Monday, July 27, 2020

U.S. Unemployment Rate

As known, unemployment is a big issue these days. I wanted to take a look at the data by State to see how things are doing across the country.
In this exercise, I made use of the API from the U.S. Bureau of Labor Statistics to get data about the unemployment rate in each State. The Python code I used to pull and manipulate data is available on github.
I'm so happy I learned how to create a tile grid map in Tableau, a very handful graph when it comes for example of political data. The Tableau file is available here.

Thursday, February 27, 2020

Census, the Low Response Score in NYC

The Census Bureau developed the Low Response Score (LRS) as a metric to identify the hard-to-survey areas. The higher the LRS value, the lower is the propensity to respond in the census. The following map represents the quantiles of the LRS, breakdown by tracts, in the five boroughs of New York. The darker tracts are the areas where is more difficult to get responses in the census. To create this map I made use of QGIS.

data source: census.gov


Thursday, February 13, 2020

date_part #8 of #100PostgreSQLFunctions

This post is a part of the project #100PostgreSQLFunctions, which aims to explain the meaning of 100 PostgreSQL functions and how to use them. For the full list of functions click here.

date_part is a function to extract subfields of a value, which has to be one of the following datatypes: date, timestamp or interval.
This function has two argument:
  • subfield is the part of the value we want to extract
  • source represents the value from which we want to extract the subfield

Examples:

SELECT
    date_part('month', '2/20/2019'::date);

 date_part
-----------
         2
(1 row)


SELECT
    date_part('second', '2/20/2019 12:5:34'::timestamp);

 date_part
-----------
        34
(1 row)


SELECT
    date_part('day', '6 years 5 months 4 days'::interval);

 date_part
-----------
         4
(1 row)

date_trunc #7 of #100PostgreSQLFunctions

This post is a part of the project #100PostgreSQLFunctions, which aims to explain the meaning of 100 PostgreSQL functions and how to use them. For the full list of functions click here.

date_trunc is a function to truncate a timestamp value, or an interval value, with a level of precision.
This function has two argument:
  • datapart is the level of precision
  • field is the timestamp value or the interval value to truncate

Examples:

SELECT
    date_trunc('month', '1/30/2020'::timestamp);

     date_trunc   
---------------------
 2020-01-01 00:00:00
(1 row)


SELECT
    date_trunc('month', '6 years 3 months 30 days'::interval);

   date_trunc 
----------------
 6 years 3 mons
(1 row)

EXTRACT #6 of #100PostgreSQLFunctions

This post is a part of the project #100PostgreSQLFunctions, which aims to explain the meaning of 100 PostgreSQL functions and how to use them. For the full list of functions click here.

EXTRACT is a function to retrieve subfields of a value, which has to be one of the following datatypes: date, timestamp or interval. By parenthesis, we need to specify the subfield to extract, and from which value we want to get it.

Examples:

SELECT
    EXTRACT(month from '2/20/2019'::date);

 date_part
-----------
         2
(1 row)


SELECT
    EXTRACT(second from '2/20/2019 12:5:34'::timestamp);

 date_part
-----------
        34
(1 row)


SELECT
    EXTRACT(day from '6 years 5 months 4 days'::interval);

 date_part
-----------
         4
(1 row)

Friday, January 24, 2020

GREATEST and LEAST #4 and #5 of #100PostgreSQLFunctions

This post is a part of the project #100PostgreSQLFunctions, which aims to explain the meaning of 100 PostgreSQL functions and how to use them. For the full list of functions click here.

GREATEST is a function to select the biggest value in a list. This function can have two or more arguments, each of the list values. The values in the list should be of the same datatype. If the datatypes are different, all values will be converted to the first argument datatype. NULL values in the list are ignored. If the values are strings, this function sorts the strings in alphabetical order and then selects the last one.

Examples:

SELECT
    GREATEST(1, 23, 45);

 greatest
----------
       45
(1 row)


SELECT
    GREATEST('Joe', 'Alex', 'Cory');

 greatest
----------
 Joe
(1 row)


SELECT
    GREATEST('3/3/2019'::date, '3/11/2019'::date, '3/5/2019'::date);

  greatest
------------
 2019-03-11
(1 row)


LEAST is a function to select the smallest value in a list. This function can have two or more arguments, each of the list values. The values in the list should be of the same datatype. If the datatypes are different, all values will be converted to the first argument datatype. NULL values in the list are ignored. If the values are strings, this function sorts the strings in alphabetical order and then selects the first one.

Examples:

SELECT
    LEAST(1, 23, 45);

 least
-------
     1
(1 row)


SELECT
    LEAST('Joe', 'Alex', 'Cory');

 least
-------
 Alex
(1 row)


SELECT
    LEAST('3/3/2019'::date, '3/11/2019'::date, '3/5/2019'::date);

   least 
------------
 2019-03-03
(1 row)

#100PostgreSQLFunctions

This is the project #100PostgreSQLFunctions, which aims to explain the meaning of 100 PostgreSQL functions and how to use them.

Here is the full list of functions:

1. generate_series
2. CONCAT
3. CONCAT_WS
4. GREATEST
5. LEAST
6. EXTRACT
7. date_trunc
8. date_part


CONCAT and CONCAT_WS #2 and #3 of #100PostgreSQLFunctions

This post is a part of the project #100PostgreSQLFunctions, which aims to explain the meaning of 100 PostgreSQL functions and how to use them. For the full list of functions click here.

CONCAT is a function to concatenate two or more strings into one.
This function can have two or more arguments, each of the strings to merge.

Examples:

SELECT
    CONCAT('Hello', 'world!');

    concat 
--------------
 Helloworld!
(1 row)


SELECT
    CONCAT('Hello', ' ', 'world!');

    concat 
--------------
 Hello world!
(1 row)


CONCAT_WS is a function to concatenate two or more strings with a separator. This function can have three or more arguments:
  • separator is the string to between the others, it could be a space or whatever; 
  • string1 is the first string to merge
  • string2 is the other string to merge
  • string_n is the third or more string to merge;

Example:

SELECT
    CONCAT_WS(' ', 'Hello', 'world!');

    concat 
--------------
 Hello world!
(1 row)