Data Pipeline metrics overview
Parse.ly’s Data Pipeline starts with a standard set of metrics and events that are worth instrumenting about many kinds of content and audience.
Event Types
Parse.ly’s tracking integration supports a number of standard event types out of the box, which help for measuring standard websites that include text and video content.
The type of an event is determined by the value of the action
field in each event. In a standard JavaScript integration, you will see mostly pageview
and heartbeat
events to start out. This table lists Parse.ly’s standard events and what they mean.
name | description |
---|---|
pageview | View of a URL; JavaScript tracker auto-fires this on load. |
heartbeat | Indicates page activity; auto-fires every few seconds. |
videostart | Video asset has started playing. |
vheartbeat | Video watching activity; auto-fires every few seconds. |
conversion | Conversion action has taken place. |
custom | Aside from the above reserved names, any name may be used for custom events. |
Metrics
A metric is nothing more than a count, unique count, or sum of a specific or filtered events.
The metrics below are each described in terms of a SQL query one might write to calculate the metric from a table containing all the raw data.
Page Views
SELECT COUNT(action) as views
FROM parsely.rawdata
WHERE action = 'pageview';
Unique Visitors
SELECT COUNT(DISTINCT visitor_site_id) as visitors
FROM parsely.rawdata;
Unique Sessions
SELECT COUNT(DISTINCT session) as sessions
FROM (
SELECT CONCAT(visitor_site_id, '_', session_id) as session
FROM parsely.rawdata
);
Engaged Time
SELECT SUM(engaged_time_inc) as time_in_seconds
FROM parsely.rawdata
WHERE action = 'heartbeat';
Conversions
SELECT COUNT(action) as conversions
FROM parsely.rawdata
WHERE action = 'conversion';
Video Starts
SELECT COUNT(action) as videostarts
FROM parsely.rawdata
WHERE action = 'videostart';
Unique Video Viewers
SELECT COUNT(DISTINCT parsely_visitor_id) as viewers
FROM parsely.rawdata
WHERE action = 'videostart';
Video Watch Time
SELECT SUM(engaged_time_inc) as time_in_seconds
FROM parsely.rawdata
WHERE action = 'vheartbeat';
Other Actions
SELECT action, COUNT(action) as num_actions
WHERE action != 'pageview'
FROM parsely.rawdata
GROUP BY 1
ORDER BY 2 DESC;
This query will count all the event types other than pageview
and sort by the ones with the highest frequency.
Channels
Channels, found in the channel
fields in the Data Pipeline, are places where user interactions happen. Parse.ly defined channels include:
fbia
(Facebook Instant Article)apln-rta
(Apple News Teal Time)amp
(Accelerated Mobile Pages)website
(Webpage traffic)native-ios
(iOS app traffic)native-android
(Android app traffic)
To filter specific channels, use the following queries:
Include only Apple news real-time data:
SELECT
<fields>
FROM parsely_rawdata
WHERE channel = 'apln-rta';
Exclude mobile app traffic:
SELECT
<fields>
FROM parsely_rawdata
WHERE channel not in ('native-ios','native-android');
Note, Apple News real time events include extra data from Apple News. See How to query Data Pipeline for Apple News Real-time data for more details on the extra data available!
Mobile Distributed Content
Parse.ly has support for Google Accelerated Mobile Pages (AMP) as described elsewhere in the integration docs. Raw events complying with the above event types come through as normal on these channels. AMP traffic is distinguished by a special query parameter whereas IA traffic is distinguished with a special referring URL.
Native Mobile Apps
Parse.ly has mobile SDKs for iOS and Android. At the present time, these SDKs only support the pageview
event, which are expected to be registered against “URLs” in the app that correspond to pages on your site with web parity. Support for custom events in our mobile SDKs is coming soon.
Off-Site Social Interactions
Parse.ly’s dashboard measures a number of off-site interactions, including organic social sharing on Facebook and Pinterest. These social interactions are not yet available in the Data Pipeline.
Traffic Investigation
When suspicious traffic is suspected, the following process will produce a report that allows insight into who is accessing your site or a specific page. The query outlined below returns an aggregation of pageviews by referrer, user_agent, and IP address. From our experience investigating suspicious traffic, this usually gives us a place to start investigating.
Traffic Investigation Query
SELECT
referrer,
user_agent,
visitor_ip,
count(action) as pageviews
FROM parsely.rawdata
WHERE
action = 'pageview'
and date(timestamp) = date('desired-date-here')
and url like '%url-pattern%'
GROUP BY 1,2,3
ORDER BY 4 desc;
Patterns to look for in the results:
- Bot traffic is often: direct traffic (pageviews with null referrers) and from similar user_agents and IP address patterns (ie. 123.45.987.*)
- Using reverse IP address lookups (such as https://mxtoolbox.com/ReverseLookup.aspx), it is also possible to identify other unknown traffic patterns.
Get Help
- If you are already a Parse.ly customer, get in touch with us, and we’ll be happy to walk through how events and metrics in the raw data map to Parse.ly’s real-time dashboard.
- If you are not a Parse.ly customer, we are glad to schedule a demo where we can share some of the advantages of our standard metrics and events in measuring user interactions, and how to sync that with custom events relevant to your business.
Last updated: September 30, 2024