Fuenfgeld / TeamDataScDatenmanagementUndArchivierung

MIT License
2 stars 0 forks source link

Selecting pandas DataFrame Rows Based On Conditions #9

Closed ahodelin closed 3 years ago

ahodelin commented 3 years ago

Import modules

import pandas as pd import numpy as np

Create a dataframe (Das ist nicht viel relevante, da wir die Information aus der Datenbank oder CSV-Dateien lesen)

raw_data = {'first_name': ['Jason', 'Molly', np.nan, np.nan, np.nan],         'nationality': ['USA', 'USA', 'France', 'UK', 'UK'],         'age': [42, 52, 36, 24, 70]}df = pd.DataFrame(raw_data, columns = ['first_name', 'nationality', 'age'])

Select all cases where the first name is not missing and nationality is USA

df[df['first_name'].notnull() & (df['nationality'] == "USA")] # in Eckige Klammern sind die Namen der Spalten

Save the above result in another dataframe

new_df = df[df['first_name'].notnull() & (df['nationality'] == "USA")]

Return new dataframe

new_df