Calculating Moving Average in Python Using a Pandas DataFrame

author

The moving average is a popular statistical tool used to measure the average value of a set of numbers over a specific time period. It is particularly useful for analyzing financial data, such as stock prices, where it can help identify trends and market fluctuations. In this article, we will learn how to calculate a moving average in Python using the popular data processing library, Pandas.

Pandas Basics

Pandas is a powerful library that provides efficient and convenient access to large datasets. It is widely used in data analysis and data science applications. In this article, we will use the Pandas library to perform data manipulation and analysis.

1. Importing Pandas

First, we need to import the Pandas library into our Python script. If you have not already done so, follow these steps:

```python

import pandas as pd

```

2. Creating a DataFrame

Next, we will create a Pandas DataFrame to store our data. A DataFrame is a table-like data structure that allows us to perform efficient columnar data analysis.

```python

data = {'Date': ['2021-01-01', '2021-01-02', '2021-01-03', '2021-01-04', '2021-01-05'],

'Price': [100, 105, 110, 115, 120]}

df = pd.DataFrame(data)

```

3. Calculating Moving Average

Now, we will create a function to calculate a moving average for our data. We will use a window size of 3, which means we will consider the prices of the previous 3 days when calculating the moving average.

```python

def moving_average(data, window):

window_size = window

sum_price = pd.Series(data['Price']).rolling(window=window_size).sum()

return sum_price

```

4. Calculating Moving Average for Our Data

Finally, we will use our function to calculate the moving average for our data and store the result in a new column in our DataFrame.

```python

df['Moving Average'] = moving_average(df, 3)

```

5. Viewing the Result

Now, we can view our DataFrame to see the calculated moving average for each price in our data.

```python

print(df)

```

Output:

```

Date Price Moving Average

0 2021-01-01 100 100.0

1 2021-01-02 105 103.5

2 2021-01-03 110 106.5

3 2021-01-04 115 110.0

4 2021-01-05 120 114.0

```

In this article, we learned how to calculate a moving average in Python using the Pandas library. The moving average is a useful tool for analyzing data and identifying trends, and our example demonstrates how to perform this calculation in practice. We hope this helps you in your data analysis and data science projects!

coments
Have you got any ideas?