Visualization-Oriented Progressive Time Series Transformation

08-24-2025ยท
Xin Chen
Xin Chen
,
Lingyu Zhang
,
Huaiwei Bao
,
Wei Lu
,
Eugene Wu
,
Xiaohui Yu
,
Yunhai Wang
ยท 3 min read
Overview of our system PIVOT for exploring compressed large-scale time series on a remote server.
Abstract
Visual analysis of large time-series data often requires transformations over multivariate time series. Existing methods struggle to meet interactive response time requirements, relying on full transformations that incur high computation costs. We propose a visualization-oriented transformation system PIVOT that incrementally generates accurate visualizations by selectively transforming only essential data samples. At its core is a transformation-aware query mechanism that efficiently computes point-wise transformations by leveraging cached hierarchical data on the server. To support responsive interaction, we introduce a pixel-based error-bound guarantee that estimates the accuracy of intermediate visualizations without requiring a reference, enabling a balance between latency and visual fidelity. Experiments show that PIVOT achieves highly accurate visualizationswith interactive responsetimes, outperforming existing error-free methods by up to an order of magnitude on billion-scale datasets.
Type
Publication
In ACM SIGMOD/PODS International Conference on Management of Data

Results

use_case
Visual analysis of multiple time series from a dataset of NYSE-listed stocks. Analysts may casually apply and compose various point-wise transformations to explore interesting patterns, expecting timely and highly accurate visualizations.

TAT_illustration
Illustration of M4 and our proposed TAT representation with a sample time series in (a). (b) The corresponding line charts with two pixel columns: the blue line connects all data points, while the black line uses only M4-aggregated samples in each pixel column. (c) The corresponding TAT structure, where each node stores the minimum and maximum values and the associated time interval.

func_class
Illustration of how our solution handles three types of transformation functions: (a) monotonic univariate, (b) non-monotonic univariate, and (c) bivariate. In all cases, the input time series has values {7, 8, 10, 6}, represented by black dots within a single pixel column. The corresponding function values are shown as black squares, while yellow and green squares mark the minimum and maximum function values within the domain, respectively. In (c), an additional time series {4, -3, 0, 1} is used, and the corresponding TAT structures are illustrated in (d).

query_mechanism
Illustration of the query mechanism Q on a time series of length 64 over a canvas with three pixel columns. (a) For the non-monotonic function f(x) = (x2 - 1) / 2, Q retrieves aggregated values from the TAT, with node scores shown below. (b) Evolution of candidate lists ฮฑ and ๐›ฝ over the query process, alongside corresponding progressive visualizations. The ground truth f(X), shown in gray, is for reference only; it is not computed during execution.

error_bound_estimation
Illustration of pixel errors under different combinations of Rk and gR. (a-d) The four extreme rasterization cases and (e) our average estimation approach using ฮฑ and ๐›ฝ values obtained in the first iteration. Red and blue boxes indicate erroneous pixels relative to the final visualization in Figure 5 (b) and consistently rasterized pixels across all four cases, respectively. Black dashed lines serve as virtual guides for rasterization between Rk that do not correspond to valid results.

individual_func_eval
Performance comparison of three systems using 13 representative transformation functions covering univariate, bivariate, and multivariate cases evaluated on all datasets: (a) SSIM, (b) response time, and (c) memory usage.

varying_m_n
Response time (a,c) and memory usage (b,d) of the three systems under varying (a,b) numbers of data points and (c,d) numbers of input variables when applying the transformation L2ln(X) to the synthesized time-series datasets.

varying_width
Results of evaluation metrics on the largest real-world dataset Power as the canvas width varies: (a) SSIM scores and (b) response time.

varying_tau
Performance under varying pixelerror rate thresholds ๐œ. (a) The boxplots summarize the SSIM scores across all datasets. (b) The dot plot shows the relationship between pixel error rate upper bound ฮต and the actual pixel rate ๐œ on the Power dataset. (c) The dot plot displays the mean SSIM scores for various configurations with a fixed ๐œ = 5%.

interactive_comparison
Interactive performance on the real-world dataset Taxi (a, b) and synthetic dataset Syn5B (c, d): (a, c) SSIM scores; (b, d) response times.

Supplementary Material

The supplemental material file is available at https://www.yunhaiwang.net/sigmod2026/PIVOT/supp.pdf.