Script 445: SBA inactive campaigns

Purpose

Python script to identify inactive campaigns and calculate their pacing.

To Elaborate

This Python script aims to identify inactive campaigns and calculate their pacing. It takes a primary data source and performs various operations to filter and analyze the data. The key business rules of this script are as follows:

Only campaigns with end dates greater than today are considered.
The script calculates the maximum predicted UV difference for each SBA bucket.
It filters the data to include only the last 5 days.
The script groups the data by campaign and SBA bucket, and sums the publication cost.
It filters the data to include only rows where the publication cost is equal to 0.
The script gets a distinct list of SBA bucket names.
It updates the filtered data to only include rows with SBA bucket names in the distinct list.
The script merges the filtered data with the result of the maximum predicted UV difference calculation and the 5-day cost.
It updates the ‘Status’ column based on the publication cost (inactive if cost is 0, active otherwise).
The script updates the ‘Pacing’ column based on the predicted UV difference (on target if difference is greater than 95, not pacing otherwise).
The final output is the working data frame.

Walking Through the Code

Define the DEVICE dictionary with device types.
Get the current date and month.
Set the primary data source and column constants.
Create the output column constants.
Print the tableized input data frame.
Convert the ‘SBA Bucket End Date’ and ‘Date’ columns to datetime format.
Filter the input data frame to include only campaigns with end dates greater than today.
Group the filtered data frame by ‘SBA Bucket Name’ and calculate the maximum predicted UV difference.
Rename the columns of the result data frame.
Calculate the date threshold for the last 5 days.
Filter the filtered data frame to include only rows within the date threshold.
Group the filtered data frame by ‘Campaign’ and ‘SBA Bucket Name’ and sum the publication cost.
Filter the resulting data frame to include only rows where the publication cost is equal to 0.
Get a distinct list of ‘SBA Bucket Name’.
Update the filtered data frame to only include rows with ‘SBA Bucket Name’ in the distinct list.
Create a working data frame with ‘Campaign’ and ‘SBA Bucket Name’ columns.
Merge the working data frame with the result data frame on ‘SBA Bucket Name’.
Merge the working data frame with the 5-day cost data frame on ‘Campaign’.
Update the ‘Status’ column based on the publication cost (inactive if cost is 0, active otherwise).
Update the ‘Pacing’ column based on the predicted UV difference (on target if difference is greater than 95, not pacing otherwise).
Set the output data frame as the working data frame.

Vitals

Script ID : 445
Client ID / Customer ID: 1306923673 / 60269245
Action Type: Email Report
Item Changed: None
Output Columns:
Linked Datasource: M1 Report
Reference Datasource: None
Owner: Jonathan Reichl (jreichl@marinsoftware.com)
Created by Jonathan Reichl on 2023-10-26 13:36
Last Updated by Stephen Malina on 2023-12-11 15:41

> See it in Action

Python Code

##
## name: 
## description:
##  
## 
## author: undefined
## created: 2023-10-26
## 

DEVICE = {
  'MOBILE': 'm',
  'DESKTOP': 'c',
  'TABLET': 't',
}
today = datetime.datetime.now(CLIENT_TIMEZONE).date()
current_month = today.month
current_year = today.year

# primary data source and columns
inputDf = dataSourceDict["1"]
RPT_COL_CAMPAIGN = 'Campaign'
RPT_COL_DATE = 'Date'
RPT_COL_SBA_BUCKET_NAME = 'SBA Bucket Name'
RPT_COL_PUB_COST = 'Pub. Cost $'
RPT_COL_SBA_PREDICTED_UV_DIF = 'SBA Predicted UV Dif'
RPT_COL_SBA_PREDICTED = 'SBA Predicted UVs'
RPT_COL_CAMPAIGN_STATUS = 'Campaign Status'
RPT_COL_SBA_BUCKET_END_DATE = 'SBA Bucket End Date'
RPT_COL_SBA_MODEL_TARGET = 'SBA Budget Model Target'

# output columns and initial values
BULK_COL_ACCOUNT = 'Account'

#outputDf[BULK_COL_SOCIAL_PLAYPAUSE_UPDATE_STATUS] = "<<YOUR VALUE>>"

# user code start here
print(tableize(inputDf))
inputDf[RPT_COL_SBA_BUCKET_END_DATE] = pd.to_datetime(inputDf[RPT_COL_SBA_BUCKET_END_DATE])
inputDf[RPT_COL_DATE] = pd.to_datetime(inputDf[RPT_COL_DATE])

#only campaigns with end dates > today 
filtered_df = inputDf[(inputDf[RPT_COL_SBA_BUCKET_END_DATE].dt.month == current_month) & (inputDf[RPT_COL_SBA_BUCKET_END_DATE].dt.year == current_year)]

# code to find max predicted UV not needed as we will apply to all buckets
result_df = filtered_df.groupby(RPT_COL_SBA_BUCKET_NAME)[RPT_COL_SBA_PREDICTED_UV_DIF].max().reset_index()
# Rename the column to 'Bucket Name' and 'Conversions' (optional)
result_df.columns = [RPT_COL_SBA_BUCKET_NAME, RPT_COL_SBA_PREDICTED_UV_DIF]

## Calculate the date threshold (last 5 days)
start_date = pd.to_datetime(today - datetime.timedelta(days=5))
today = pd.to_datetime(today)

five_day_df = filtered_df[(filtered_df[RPT_COL_DATE] >= start_date) & (filtered_df[RPT_COL_DATE] <= today)]

# Group by 'RPT_COL_CAMPAIGN' and 'RPT_COL_SBA_BUCKET_NAME' and sum 'RPT_COL_PUB_COST'
five_day_df = five_day_df.groupby([RPT_COL_CAMPAIGN, RPT_COL_SBA_BUCKET_NAME])[RPT_COL_PUB_COST].sum().reset_index()

# Filter the DataFrame to include rows where 'RPT_COL_PUB_COST' is equal to 0
five_day_df_filter = five_day_df[five_day_df[RPT_COL_PUB_COST] == 0]

# Get a distinct list of 'RPT_COL_SBA_BUCKET_NAME'
distinct_buckets = five_day_df_filter[RPT_COL_SBA_BUCKET_NAME].unique()

# Update filtered_df to only include rows with 'RPT_COL_SBA_BUCKET_NAME' in distinct_buckets
filtered_df = filtered_df[filtered_df[RPT_COL_SBA_BUCKET_NAME].isin(distinct_buckets)]

workingdf = filtered_df

workingdf = workingdf[[RPT_COL_CAMPAIGN, RPT_COL_SBA_BUCKET_NAME]].drop_duplicates()

## add predicted target 
workingdf = workingdf.merge(result_df, on=RPT_COL_SBA_BUCKET_NAME, how='left')


## add 5 day cost 
workingdf = workingdf.merge(five_day_df[[RPT_COL_CAMPAIGN, RPT_COL_PUB_COST]], on=RPT_COL_CAMPAIGN, how='left')

# Update 'Status' column based on 'RPT_COL_PUB_COST'
workingdf['Status'] = workingdf[RPT_COL_PUB_COST].apply(lambda x: 'Inactive' if x == 0 else 'Active')

# Update 'Pacing' column based on 'RPT_COL_SBA_PREDICTED_UV_DIF'
workingdf['Pacing'] = workingdf[RPT_COL_SBA_PREDICTED_UV_DIF].apply(lambda x: 'On target' if x > 95 else 'Not pacing')


outputDf = workingdf

#outputDf = outputDf.sort_values(by=RPT_COL_SBA_BUCKET_NAME)

Post generated on 2024-05-15 07:44:05 GMT

26 Oct 2023

« Script 443: Campaign Benchmark Performance Script 453: Automate ProductGroup SetBidOverride »

MarinOne Scripts Creator's Corner