Script 631: Alert CCT Groups paused (Single word keywords)
Purpose:
The Python script identifies and filters paused CCT groups with single-word keywords based on a specific date range.
To Elaborate
The script is designed to process a dataset containing information about CCT (Structured Budget Allocation) groups and their associated keywords. It focuses on identifying groups that have been paused and filters them based on a specific date range. The script extracts relevant information from a column that contains processing details, such as the pause date and reason. It then converts these extracted dates into a datetime format to facilitate comparison. The primary goal is to filter out records where the pause date is within the last five days, allowing users to focus on recent changes in the status of their CCT groups. This functionality is particularly useful for monitoring and managing keyword campaigns effectively.
Walking Through the Code
- Data Preparation
- The script begins by defining a dictionary
MATCH_TYPE
to categorize keywords by their match type (exact, phrase, broad). - It retrieves the primary data source into a DataFrame
inputDf
and specifies key columns related to keywords, publishers, campaigns, and processing details.
- The script begins by defining a dictionary
- Data Extraction and Transformation
- The script extracts the ‘Pause Date’ and ‘Pause Reason’ from the
CCT_Single_keyword_processing
column by splitting the string data. - It converts the extracted ‘Pause Date’ strings into datetime objects for accurate date manipulation and comparison.
- The script extracts the ‘Pause Date’ and ‘Pause Reason’ from the
- Date Calculation and Filtering
- The script calculates a date five days prior to the current date to establish a threshold for filtering.
- It filters the DataFrame to include only those records where the ‘Pause Date’ is within the last five days, resulting in a filtered DataFrame
filtered_df
.
- Output Preparation
- The filtered DataFrame is assigned to
outputDf
, which contains the relevant paused CCT groups for further analysis or reporting.
- The filtered DataFrame is assigned to
Vitals
- Script ID : 631
- Client ID / Customer ID: 1306924347 / 69058
- Action Type: Email Report
- Item Changed: None
- Output Columns:
- Linked Datasource: M1 Report
- Reference Datasource: None
- Owner: Jonathan Reichl (jreichl@marinsoftware.com)
- Created by Jonathan Reichl on 2023-12-22 16:17
- Last Updated by Jonathan Reichl on 2023-12-22 16:17
> See it in Action
Python Code
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
##
## name: Alert paused cct single keywords
## description:
##
##
## author:
## created: 2023-12-19
##
MATCH_TYPE = {
'EXACT': 'exact',
'PHRASE': 'phrase',
'BROAD': 'broad',
}
today = datetime.datetime.now(CLIENT_TIMEZONE).date()
# primary data source and columns
inputDf = dataSourceDict["1"]
RPT_COL_KEYWORD = 'Keyword'
RPT_COL_PUBLISHER = 'Publisher'
RPT_COL_CCT_SOURCETEMPLATE = 'CCT_sourcetemplate'
RPT_COL_CAMPAIGN = 'Campaign'
RPT_COL_GROUP = 'Group'
RPT_COL_MATCH_TYPE = 'Match Type'
RPT_COL_STATUS = 'Status'
RPT_COL_KEYWORD_ID = 'Keyword ID'
RPT_COL_PUB_ID = 'Pub. ID'
RPT_COL_CCT_SINGLE_KEYWORD_PROCESSING = 'CCT_Single_keyword_processing'
# output columns and initial values
# user code start here
print(tableize(inputDf))
# Splitting and extracting date
inputDf['Pause Date'] = inputDf[RPT_COL_CCT_SINGLE_KEYWORD_PROCESSING].str.split('|').str[2]
inputDf['Pause Reason'] = inputDf[RPT_COL_CCT_SINGLE_KEYWORD_PROCESSING].str.split('|').str[1]
# Convert strings to datetime objects
inputDf['Pause Date'] = pd.to_datetime(inputDf['Pause Date'])
# Calculating 5 days ago from today
five_days_ago = today - datetime.timedelta(days=5)
# Convert the 'five_days_ago' date object to a pandas Timestamp for comparison
five_days_ago = pd.Timestamp(five_days_ago)
# Filtering the DataFrame
filtered_df = inputDf[inputDf['Pause Date'] >= five_days_ago]
outputDf = filtered_df
Post generated on 2025-03-11 01:25:51 GMT