Data Engineer signal: hash map + dedupe in a transaction dedupe context. This is a ProdMatch-owned data engineer drill, framed as a May 2026 CleverTap Trust Data simulation, not a copied platform question.
Company context
CleverTap · Trust Data
Freshness
May 2026
Product surface
transaction dedupe
ProdMatch interview simulation based on product-team patterns; not a claim of a real company question.
For transaction dedupe, collapse duplicate records by external ID. Keep the newest event per ID and return IDs in first-seen order.
Input
Output
Constraints
Concepts
[(a,1),(b,1),(a,3)] -> [(a,3),(b,1)]
Try framing your own approach first. The 30 seconds you think before peeking is where learning happens.
Reveal the approach first.
Your rating tunes when this problem shows up again.