Introduction
As organizations increasingly rely on digital interactions to drive business outcomes, the management of high-frequency data, particularly web events, becomes critical. In Adobe Experience Platform (AEP), behavioral data can dominate the profile store, often constituting up to 90% of the profile population. Proper management of this data using expiration rules ensures compliance with license entitlements and maintains the relevance and usability of data for various use cases. This blog post delves into an effective strategy for managing behavioral data expiration in AEP, illustrated through a real-world implementation for a leading department store group.
Problem Statement or Background
In the digital age, businesses collect vast amounts of behavioral data from web events, which can quickly overwhelm data storage and processing capabilities if not properly managed. Different types of events have varying levels of relevance and value over time. For instance, purchase events may need to be retained for a year, while page views might only be relevant for 30 days. Implementing different expiration durations for various event types is essential to streamline data management and ensure that profiles remain within license limits.
Key Concepts or Terminology
- Behavioral Data: Information collected from user interactions on digital platforms, such as page views, product browsing, and purchases.
- Experience Event Schema: A data schema in AEP that includes mandatory fields like _id and Timestamp, representing the unique identifier and occurrence time of an event.
- Event Expiration: A process in AEP where data older than a specified duration is automatically deleted from the profile store.
- Real-Time Customer Profile (RTCP): A feature in AEP that allows for the unification and real-time analysis of customer data across various touchpoints.
- WebSDK: A library used to collect and send data from websites to AEP.
Detailed Explanation
Managing the expiration of high-frequency behavioral data involves setting up different retention durations for various event types. This ensures that only relevant data is retained, optimizing data storage and processing. Adobe Experience Platform supports this through the use of DataStreams and datasets, where each dataset can be configured with different expiration rules.
Automated Event Expiration Process
In AEP, an Experience Event Schema mandates the use of _id and Timestamp fields, which are populated during data ingestion. The Timestamp field should hold the actual occurrence time of the event, not the ingestion time. Event Type is another recommended field, providing context to the event and aiding downstream processes.
Experience Event expirations are configured to automatically remove data based on the Timestamp attribute. This feature must be enabled by contacting Adobe Support, as it is not accessible through the Platform UI or APIs. Once enabled, any data older than the specified expiration period is permanently deleted from the Unified Profile Store. This also includes the purging of anonymous profiles with no remaining valid events.
Step by Step Guide
Step 1: Configure Experience Event Schemas
Ensure that your Experience Event Schemas include the necessary fields (_id, Timestamp, Event Type) and are correctly set up for data ingestion.
Step 2: Create Multiple Datasets
Set up multiple datasets in AEP, each referencing the same Experience Event Schema but configured with different expiration durations. For instance:
- Dataset 1: Retention for purchase events (1 year)
- Dataset 2: Retention for page views and product browsing (30 days)
Step 3: Set Up DataStreams
Create separate DataStreams for each dataset. This allows you to route events to the appropriate dataset based on their type. DataStreams are configurations that define how data is sent to AEP and can be created in advance.
Step 4: Implement Custom Processing Rules
Use custom code in the data processing layer to dynamically route events to the correct DataStream. This can be based on conditions such as the event type or any other criteria available in the data layer. For example:
javascript
Copy code
var datastreamId;
if (pageType === ‘purchase’) {
datastreamId = ‘purchaseDataStreamId’;
} else {
datastreamId = ‘webEventsDataStreamId’;
}
sendToDataStream(datastreamId, eventData);
Step 5: Deploy Code via Tag Management
Deploy the custom processing code using a tag management solution like Adobe Launch or Google Tag Manager. Ensure the code runs on every relevant page to properly route events.
Best Practices or Tips
- Monitor Data Retention: Regularly review and adjust expiration durations based on evolving business needs and data usage patterns.
- Optimize DataStream Configuration: Keep the number of DataStreams and datasets within recommended limits to avoid system conflicts and ensure efficient processing.
- Use Event Type Consistently: Ensure that the Event Type field is consistently populated to facilitate accurate routing and reporting.
Case Studies or Examples
Real-World Example: Leading Department Store Group
A leading department store group in Europe implemented AEP RTCDP (Real-Time Customer Data Platform) to replace their Salesforce DMP and Google Analytics. They used WebSDK for data collection and Google Tag Manager for deployment. The challenge was to manage different retention requirements for various web events without increasing server calls.
The solution involved:
- Setting up two datasets with different retention periods.
- Creating two DataStreams for these datasets.
- Implementing custom processing rules to dynamically route events based on their type.
For example, purchase events were directed to a dataset with a 1-year retention period, while page views and product browsing events were directed to a dataset with a 30-day retention period. This ensured efficient data management and compliance with license limits.
Troubleshooting and FAQ
Q: How do I know if my expiration rules are working? A: Monitor the dataset usage and profile store to ensure that data is being purged as expected. Check the timestamps of remaining events to confirm that older data has been removed.
Q: Can I change expiration durations after they are set? A: Yes, but you will need to contact Adobe Support to adjust the expiration settings for your datasets.
Q: What happens if a profile has no remaining events? A: If a profile has no remaining events and no profile attributes, it will be purged from the Unified Profile Store.
Conclusion
Effective management of behavioral data using expiration rules in Adobe Experience Platform is crucial for maintaining data relevance, optimizing storage, and ensuring compliance with license entitlements. By implementing a multi-forked DataStream strategy, businesses can assign different retention periods to various event types, thereby enhancing data management efficiency. This approach not only supports better data governance but also aligns with business objectives by ensuring that valuable data remains available for analysis and decision-making.
By following the steps and best practices outlined in this blog post, organizations can streamline their data management processes, improve operational efficiency, and derive greater value from their digital interactions.
Leave a Reply