|Table of Contents|

Citation:
 Gang Chen,Jie Cai,Niels Gorm Maly Rytter,et al.A Practical Data Quality Assessment Method for Raw Data in Vessel Operations[J].Journal of Marine Science and Application,2023,(2):370-380.[doi:10.1007/s11804-023-00326-w]
Click and Copy

A Practical Data Quality Assessment Method for Raw Data in Vessel Operations

Info

Title:
A Practical Data Quality Assessment Method for Raw Data in Vessel Operations
Author(s):
Gang Chen1 Jie Cai2 Niels Gorm Maly Rytter2 Marie Lützen3
Affilations:
Author(s):
Gang Chen1 Jie Cai2 Niels Gorm Maly Rytter2 Marie Lützen3
1 World Maritime University, Malmö, Sweden;
2 Department of Technology and Innovation, University of Southern Denmark, Campusvej 55, DK-5230 Odense M, Denmark;
3 Department of Mechanical and Electrical Engineering, University of Southern Denmark, Campusvej 55, DK-5230 Odense M, Denmark
Keywords:
Data qualityVessel operationsShippingValidation rulesNoon reports
分类号:
-
DOI:
10.1007/s11804-023-00326-w
Abstract:
With the current revolution in Shipping 4.0, a tremendous amount of data is accumulated during vessel operations. Data quality (DQ) is becoming more and more important for the further digitalization and effective decision-making in shipping industry. In this study, a practical DQ assessment method for raw data in vessel operations is proposed. In this method, specific data categories and data dimensions are developed based on engineering practice and existing literature. Concrete validation rules are then formed, which can be used to properly divide raw datasets. Afterwards, a scoring method is used for the assessment of the data quality. Three levels, namely good, warning and alarm, are adopted to reflect the final data quality. The root causes of bad data quality could be revealed once the internal dependency among rules has been built, which will facilitate the further improvement of DQ in practice. A case study based on the datasets from a Danish shipping company is conducted, where the DQ variation is monitored, assessed and compared. The results indicate that the proposed method is effective to help shipping industry improve the quality of raw data in practice. This innovation research can facilitate shipping industry to set a solid foundation at the early stage of their digitalization journeys.

References:

Ahn K, Rakha H, Hill D (2008) Data quality white paper.Technical Report.United States.Federal Highway Administration.Office of Operations
Alkhattabi M, Neagu D, Cullen A (2011) Assessing information quality of e-learning systems:a web mining approach.Computers in Human Behavior 27:862-873.https://doi.org/10.1016/j.chb.2010.11.011
Bates MJ (2019) Understanding information retrieval systems:management, types, and standards.Auerbach Publications
Blake R, Mangiameli P (2011) The effects and interactions of data quality and problem complexity on classification.Journal of Data and Information Quality (JDIQ) 2:1-28.https://doi.org/10.1145/1891879.1891881
Cai J, Chen G, L/tzen M, Rytter NGM (2021) A practical ais-based route library for voyage planning at the pre-fixture stage.Ocean Engineering 236:109478.https://doi.org/10.1016/j.oceaneng.2021.109478
Cai J, Jiang, X, Yang Y, Lodewijks G, Wang M (2022) Data-driven methods to predict the burst strength of corroded line pipelines subjected to internal pressure.Journal of Marine Science and Application 21:115-132.https://doi.org/10.1007/s11804-022-00263-0
Caro A, Calero C, Caballero I, Piattini M (2008) A proposal for a set of attributes relevant for web portal data quality.Software Quality Journal 16:513-542
Chengalur-Smith IN, Ballou DP, Pazer HL (1999) The impact of data quality information on decision making:an exploratory analysis.IEEE Transactions on Knowledge and Data Engineering 11:853-864.https://doi.org/10.1109/69.824597
Coen-Porisini A, Sicari S (2012) Improving data quality using a cross layer protocol in wireless sensor networks.Computer Networks 56:3655-3665.https://doi.org/10.1016/j.comnet.2012.08.001
De Mauro A, Greco M, Grimaldi M (2015) What is big data? A consensual definition and a review of key research topics, in:AIP Conference Proceedings, American Institute of Physics, 97-104.https://doi.org/10.1063/1.4907823
Dey D, Kumar S (2010) Reassessing data quality for information products.Management science 56:2316-2322.https://doi.org/10.1287/mnsc.1100.1261
Eisele WL, Rilett LR (2002) Travel-time estimates obtained from intelligent transportation systems and instrumented test vehicles:Statistical comparison.Transportation research record 1804:8-16.https://doi.org/10.3141/1804-02
Falge C, Otto B, Österle H (2012) Data quality requirements of collaborative business processes, in:2012 IEEE 45th Hawaii International Conference on System Sciences, 4316-4325.https://doi.org/10.1109/HICSS.2012.8
FORCE Technology (2021) Onboard decision support system.URL:https://forcetechnology.com/en/services/onboard-decision-supportsystem
Hazen BT, Boone CA, Ezell JD, Jones-Farmer LA (2014) Data quality for data science, predictive analytics, and big data in supply chain management:An introduction to the problem and suggestions for research and applications.International Journal of Production Economics 154:72-80.https://doi.org/10.1016/j.ijpe.2014.04.018
Hermann M, Pentek T, Otto B (2016) Design principles for industrie 4.0 scenarios, in:2016 49th Hawaii international conference on system sciences (HICSS), IEEE.pp.3928-3937.https://doi.org/10.1109/HICSS.2016.488
Jones-Farmer LA, Woodall WH, Steiner SH, Champ CW (2014) An overview of phase i analysis for process improvement and monitoring.Journal of Quality Technology 46:265-280.https://doi.org/10.1080/00224065.2014.11917969
Karagiannidis P, Themelis N (2021) Data-driven modelling of ship propulsion and the effect of data pre-processing on the prediction of ship fuel consumption and speed loss.Ocean Engineering 222, 108616.https://doi.org/10.1016/j.oceaneng.2021.108616
Knight Sa, Burn J (2005) Developing a framework for assessing information quality on the world wide web.Informing Science 8
KONGSBERG (2021) KONGSBERG Vessel Performance.URL:https://www.kongsberg.com/digital/kognifaiecosystem/kognifaimarketplace/maritime/vessel-performance/
Lee YW, Strong DM, Kahn BK, Wang RY (2002) Aimq:a methodology for information quality assessment.Information & management 40:133-146.https://doi.org/10.1016/S0378-7206(02) 00043-5
Liao CF, Davis GA (2012) Traffic data quality verification and sensor calibration for weigh-in-motion (wim) systems
Peltier JW, Zahay D, Lehmann DR (2013) Organizational learning and crm success:a model for linking organizational practices, customer data quality, and performance.Journal of interactive marketing 27:1-13.https://doi.org/10.1016/j.intmar.2012.05.001
Perera LP, Mo B (2020) Ship performance and navigation information under high-dimensional digital models.Journal of Marine Science and Technology 25:81-92
Pipino LL, Lee YW, Wang RY (2002) Data quality assessment.Communications of the ACM 45:211-218.https://doi.org/10.1145/505248.506010
Redman TC (1998) The impact of poor data quality on the typical enterprise.Communications of the ACM 41:79-82.https://doi.org/10.1145/269012.269025
Richardson JK, Smith BL (2012) Development of hypothesis test for travel time data quality.Transportation research record 2308:103-109.https://doi.org/10.3141/2308-11
Røseth ?J (2016) Integrating iec and iso information models into the s-100 common maritime data structure
Shankaranarayan G, Ziad M, Wang RY (2003) Managing data quality in dynamic decision environments:An information product approach.Journal of Database Management (JDM) 14:14-32.https://doi.org/10.4018/jdm.2003100102
Soner O, Akyuz E, Celik M (2018) Use of tree based methods in ship performance monitoring under operating conditions.Ocean Engineering 166:302-310.https://doi.org/10.1016/j.oceaneng.2018.07.061
Soner O, Akyuz E, Celik M (2019) Statistical modelling of ship operational performance monitoring problem.Journal of Marine Science and Technology 24:543-552.https://doi.org/10.1007/s00773-018-0574-y
Tejay G, Dhillon G, Chin AG (2004) Data quality dimensions for information systems security:A theoretical exposition, in:Working Conference on Integrity and Internal Control in Information Systems, Springer.pp.21-39
TORM (1889) TORM SHIPPING.URL:https://torm.com/
Turner S (2004) Defining and measuring traffic data quality:White paper on recommended approaches.Transportation research record 1870:62-69.https://doi.org/10.3141/1870-08
US Department of Transportation (2021) Bureau of Transportation Statistics.URL:http://ntl.bts.gov/lib/jpodocs/reptste/14058files/chap3.htm
VPS (2021) Vessel Performance Solutions.URL:https://www.vpsolutions.dk/
Wang RY, Strong DM (1996) Beyond accuracy:What data quality means to data consumers.Journal of management information systems 12:5-33.https://doi.org/10.1080/07421222.1996.11518099
Wang RY, Ziad M, Lee YW (2006) Data quality.volume 23.Springer Science & Business Media
Yan R, Wang S, Du Y (2020) Development of a two-stage ship fuel consumption prediction and reduction model for a dry bulk ship.Transportation Research Part E:Logistics and Transportation Review 138:101930.https://doi.org/10.1016/j.tre.2020.101930
Yerva SR, Miklós Z, Aberer K (2012) Quality-aware similarity assessment for entity matching in web data.Information Systems 37:336-351.https://doi.org/10.1016/j.is.2011.09.007

Memo

Memo:
Received date:2022-02-27;Accepted date:2022-09-16。
Corresponding author:Jie Cai,E-mail:jiec@iti.sdu.dk
Last Update: 2023-06-02