In this lesson, we'll learn how to:
Pandas provides powerful ways to filter data using boolean conditions. You can:
import pandas as pd
# Sample DataFrame
df = pd.DataFrame({
'name': ['John', 'Anna', 'Peter'],
'age': [28, 22, 35],
'city': ['New York', 'Paris', 'London']
})
# Filter by condition
young_people = df[df['age'] < 30]
# Multiple conditions
young_ny = df[(df['age'] < 30) & (df['city'] == 'New York')]
Pandas is a powerful data manipulation library in Python that provides various functions for working with dates and times. When working with time-series data, it is essential to be able to manipulate and extract information from dates efficiently. Pandas provides a range of date functions that make it easy to work with dates and times in your data.
Some common date functions in pandas include: - pd.to_datetime(): Convert strings or numbers to datetime objects. - dt.date(): Extract the date component of a datetime object. - dt.time(): Extract the time component of a datetime object. - dt.day: Extract the day of the month from a datetime object. - dt.month: Extract the month from a datetime object. - dt.year: Extract the year from a datetime object. - dt.weekday(): Extract the day of the week as an integer, where Monday is 0 and Sunday is 6.
By using these date functions in pandas, you can easily perform operations like filtering data by date, extracting specific components of a date, or creating new date columns based on existing dates.
```python
import pandas as pd
# Create a sample DataFrame with dates
data = {'date': ['2022-01-01', '2022-01-02', '2022-01-03']}
df = pd.DataFrame(data)
# Convert the 'date' column to datetime
df['date'] = pd.to_datetime(df['date'])
# Extract the day, month, and year from the 'date' column
df['day'] = df['date'].dt.day
df['month'] = df['date'].dt.month
df['year'] = df['date'].dt.year
print(df)
```
Output:
```
date day month year
0 2022-01-01 1 1 2022
1 2022-01-02 2 1 2022
2 2022-01-03 3 1 2022
```