Script 1363: Daily Budget Upload

Purpose:

The script processes and uploads daily budget data by transforming and deduplicating input records.

To Elaborate

The Python script is designed to handle the daily budget data upload process by transforming input data from a primary data source and preparing it for further use. It focuses on extracting specific columns from the input data, mapping them to a new structure, and ensuring that duplicate entries are removed based on a unique identifier. The script is part of a larger system that likely involves managing and allocating budgets for various campaigns, ensuring that each campaign’s budget is accurately recorded and updated. The deduplication step is crucial to prevent redundant data entries, which could lead to inaccuracies in budget tracking and reporting.

Walking Through the Code

  1. Data Source Initialization:
    • The script begins by defining the primary data source, inputDf, which contains the raw data needed for processing. This data includes columns such as ‘Group’, ‘Date’, ‘Account’, ‘Campaign’, ‘Daily Budget’, ‘Pub. ID’, and ‘Campaign Pub. ID’.
  2. Output DataFrame Setup:
    • A new DataFrame, outputDf, is created to store the transformed data. It maps specific columns from the input data to new column names, such as ‘Date’, ‘Group ID’, ‘Daily Budget Upload Conv’, and ‘Comments’.
  3. Data Transformation and Deduplication:
    • The script assigns values from the input DataFrame to the output DataFrame, ensuring that the necessary columns are correctly mapped.
    • It then removes duplicate entries from the outputDf based on the ‘Comments’ column, retaining only the first occurrence of each unique entry.
  4. User Code Execution:
    • The script concludes by printing a tabular representation of the first few rows of the input DataFrame, providing a quick overview of the data being processed.

Vitals

  • Script ID : 1363
  • Client ID / Customer ID: 1306927965 / 60270405
  • Action Type: Revenue Upload
  • Item Changed: None
  • Output Columns: Date, Group ID, Comments, Daily Budget Upload Conv
  • Linked Datasource: FTP/Email Feed
  • Reference Datasource: None
  • Owner: Tom McCaughey (tmccaughey@marinsoftware.com)
  • Created by Tom McCaughey on 2024-09-02 13:26
  • Last Updated by Tom McCaughey on 2024-09-03 15:20
> See it in Action

Python Code

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
##
## name: Daily Budget Upload
## description:
##  
## 
## author: 
## created: 2024-09-02
## 

today = datetime.datetime.now(CLIENT_TIMEZONE).date()

# primary data source and columns
inputDf = dataSourceDict["1"]
RPT_COL_GROUP = 'Group'
RPT_COL_DATE = 'Date'
RPT_COL_ACCOUNT = 'Account'
RPT_COL_CAMPAIGN = 'Campaign'
RPT_COL_DAILY_BUDGET = 'Daily Budget'
RPT_COL_PUB_ID = 'Pub. ID'
RPT_COL_CAMPAIGN_PUB_ID = 'Campaign Pub. ID'

# output columns and initial values
REVENUE_COL_DATE = 'Date'
REVENUE_COL_GROUP_ID = 'Group ID'
REVENUE_COL_DAILY_BUDGET_UPLOAD_CONV = 'Daily Budget Upload Conv'
REVENUE_COL_COMMENTS = 'Comments'
outputDf[REVENUE_COL_DATE] = inputDf['Date']
outputDf[REVENUE_COL_GROUP_ID] = inputDf['Pub. ID']
outputDf[REVENUE_COL_DAILY_BUDGET_UPLOAD_CONV] = inputDf['Daily Budget']
outputDf[REVENUE_COL_COMMENTS] = inputDf['Campaign Pub. ID']

outputDf = outputDf.drop_duplicates(subset=['Comments'], keep='first')



# user code start here
print(tableize(inputDf.head()))

Post generated on 2025-03-11 01:25:51 GMT

comments powered by Disqus