Data Engineer signal: sliding window + frequency map in a data lineage graph context. This is a ProdMatch-owned data engineer drill, framed as a April 2026 Zerodha Data Platform simulation, not a copied platform question.
Company context
Zerodha · Data Platform
Freshness
April 2026
Product surface
data lineage graph
ProdMatch interview simulation based on product-team patterns; not a claim of a real company question.
For data lineage graph, process a timestamp-ordered stream of datasets. Find the longest contiguous window where at most k distinct risk labels appear and the total severity stays under budget.
Input
Output
Constraints
Concepts
labels: A,B,A,C with k=2 and budget=7 -> longest valid window length is 3
Try framing your own approach first. The 30 seconds you think before peeking is where learning happens.
Reveal the approach first.
Your rating tunes when this problem shows up again.