(Preprint) Data Pricing in Machine Learning Pipelines
Zicun Cong ¹, Xuan Luo ¹, Pei Jian 裴健 ¹, Feida Zhu 朱飞达 ², Yong Zhang ³
¹ Simon Fraser University, Burnaby, Canada
² Singapore Management University, Singapore
³ Huawei Technologies Canada, Burnaby, Canada
arXiv, 2021-08-18
Abstract
Machine learning is disruptive. At the same time, machine learning can only succeed by collaboration among many parties in multiple steps naturally as pipelines in an eco-system, such as collecting data for possible machine learning applications, collaboratively training models by multiple parties and delivering machine learning services to end users. Data is critical and penetrating in the whole machine learning pipelines.
As machine learning pipelines involve many parties and, in order to be successful, have to form a constructive and dynamic eco-system, marketplaces and data pricing are fundamental in connecting and facilitating those many parties. In this article, we survey the principles and the latest research development of data pricing in machine learning pipelines. We start with a brief review of data marketplaces and pricing desiderata. Then, we focus on pricing in three important steps in machine learning pipelines.
To understand pricing in the step of training data collection, we review pricing raw data sets and data labels. We also investigate pricing in the step of collaborative training of machine learning models, and overview pricing machine learning models for end users in the step of machine learning deployment. We also discuss a series of possible future directions.
Ppt-level volatile organic compounds detection via microsecond-pulse-enhanced mid-infrared photoacoustic
Senyu Wang, Liang Zhao, Hongyu Luo, Xiangyu Zhao, Jianfeng Li, Wei Wang, Hao Lei, Mingrui Jiang, Jinlong Wan, Binxing Zhao, Bincheng Li, Yong Liu
Opto-Electronic Science
2026-04-23
AI-assisted metaphotonics
Minsung Kang, Seokju Choi, Kaixi Fu, Xiaoyuan Liu, Zhun Wei, Lei Jin, Hao Wang, Olivier J. F. Martin, Joel K. W. Yang, Sunae So, Trevon Badloe
Opto-Electronic Advances
2026-04-17
Terahertz imaging technology: progress and applications
Yuyuan Tian, Xiaoyin Chen, Zhuocheng Zhang, Qianze Yan, Yiming Liu, Chengliang Deng, Min Wan, Jiang Li, Xiaoqiuyan Zhang, Lu Rong, Elizaveta Tsiplakova, Nikolay Petrov, Xinke Wang, Liguo Zhu, Min Hu, Yan Zhang
Opto-Electronic Technology
2026-03-30
A 4096-element 3D-integrated Si-SiN optical phased array for high-power coherent LiDAR
Han Wang, Weimin Xie, Xin Yan, Jiaqi Li, Youxi Lu, Ping Jiang, Feng Li, Kai Jin, Xu Yang, Jiali Jiang, Keran Deng, Weishuai Chen, Jing Luo, Li Jin, Junbo Feng, Kai Wei
Opto-Electronic Technology
2026-03-20