40
IEEE Catalog Number: ISBN: CFP19BGD-POD 978-1-7281-0859-9 2019 IEEE International Conference on Big Data (Big Data 2019) Los Angeles, California, USA 9 – 12 December 2019 Pages 1-769 1/8

2019 IEEE International Conference on Big Data (Big Data

Embed Size (px)

Citation preview

IEEE Catalog Number: ISBN:

CFP19BGD-POD 978-1-7281-0859-9

2019 IEEE International Conference on Big Data (Big Data 2019)

Los Angeles, California, USA 9 – 12 December 2019

Pages 1-769

1/8

Copyright © 2019 by the Institute of Electrical and Electronics Engineers, Inc. All Rights Reserved Copyright and Reprint Permissions: Abstracting is permitted with credit to the source. Libraries are permitted to photocopy beyond the limit of U.S. copyright law for private use of patrons those articles in this volume that carry a code at the bottom of the first page, provided the per-copy fee indicated in the code is paid through Copyright Clearance Center, 222 Rosewood Drive, Danvers, MA 01923. For other copying, reprint or republication permission, write to IEEE Copyrights Manager, IEEE Service Center, 445 Hoes Lane, Piscataway, NJ 08854. All rights reserved. *** This is a print representation of what appears in the IEEE Digital Library. Some format issues inherent in the e-media version may also appear in this print version. IEEE Catalog Number: CFP19BGD-POD ISBN (Print-On-Demand): 978-1-7281-0859-9 ISBN (Online): 978-1-7281-0858-2 Additional Copies of This Publication Are Available From: Curran Associates, Inc 57 Morehouse Lane Red Hook, NY 12571 USA Phone: (845) 758-0400 Fax: (845) 758-2633 E-mail: [email protected] Web: www.proceedings.com

TABLE OF CONTENTS

RESPONSIBLE DATA SCIENCE ....................................................................................................................................... 1 Lise Getoor

DATACOMMONS ................................................................................................................................................................ 2 Ramanathan Guha

DECEPTION, ROBUSTNESS AND TRUST IN BIG DATA FUELED DEEP LEARNING SYSTEMS ...................... 3 Ling Liu

THE NEW SCIENCE OF CAUSE AND EFFECT, WITH REFLECTIONS ON DATA SCIENCE

AND ARTIFICIAL INTELLIGENCE ................................................................................................................................ 4 Judea Pearl

FEDERATED RECOMMENDATION SYSTEMS ............................................................................................................ 5 Yang Qiang

UNCERTAINTY-AWARE OPINION INFERENCE UNDER ADVERSARIAL ATTACKS ........................................ 6 Adil Alim ; Xujiang Zhao ; Jin-Hee Cho ; Feng Chen

MINDTHESTEP-ASYNCPSGD: ADAPTIVE ASYNCHRONOUS PARALLEL STOCHASTIC

GRADIENT DESCENT ...................................................................................................................................................... 16 Karl Bäckström ; Marina Papatriantafilou ; Philippas Tsigas

SKETCHYCORESVD: SKETCHYSVD FROM RANDOM SUBSAMPLING OF THE DATA

MATRIX ............................................................................................................................................................................... 26 Chandrajit Bajaj ; Yi Wang ; Tianming Wang

EXPLAINABLE AUTHORSHIP VERIFICATION IN SOCIAL MEDIA VIA ATTENTION-BASED

SIMILARITY LEARNING ................................................................................................................................................ 36 Benedikt Boenninghoff ; Steffen Hessler ; Dorothea Kolossa ; Robert M. Nickel

FINDING STABLE CLUSTERING FOR NOISY DATA VIA STRUCTURE-AWARE

REPRESENTATION .......................................................................................................................................................... 46 Huiyuan Chen ; Jing Li

RESTRICTED RECURRENT NEURAL NETWORKS ................................................................................................. 56 Enmao Diao ; Jie Ding ; Vahid Tarokh

PARSIMONIOUS MORPHEME SEGMENTATION WITH AN APPLICATION TO ENRICHING

WORD EMBEDDINGS ...................................................................................................................................................... 64 Ahmed El-Kishky ; Frank Xu ; Aston Zhang ; Jiawei Han

A STREAMING MODEL FOR GENERALIZED RAYLEIGH WITH EXTENSION TO MINIMUM

NOISE FRACTION ............................................................................................................................................................. 74 Soumyajit Gupta ; Chandrajit Bajaj

DETECTING MODEL CHANGES AND THEIR EARLY WARNING SIGNALS USING MDL

CHANGE STATISTICS ..................................................................................................................................................... 84 So Hirai ; Kenji Yamanishi

ON THE GLOBAL CONVERGENCE OF CONTINUOUS–TIME STOCHASTIC HEAVY–BALL

METHOD FOR NONCONVEX OPTIMIZATION ......................................................................................................... 94 Wenqing Hu ; Chris Junchi Li ; Xiang Zhou

MINING NEWS EVENTS FROM COMPARABLE NEWS CORPORA: A MULTI-ATTRIBUTE

PROXIMITY NETWORK MODELING APPROACH ................................................................................................. 105 Hyungsul Kim ; Ahmed El-Kishky ; Xiang Ren ; Jiawei Han

ON RELAXING FAILING QUERIES OVER RDF DATABASES .............................................................................. 115 Wafaa Mebrek ; Badran Raddaoui ; Mohamad Albilani

INFINITE DROPOUT FOR TRAINING BAYESIAN MODELS FROM DATA STREAMS ................................... 125 Van-Son Nguyen ; Duc-Tung Nguyen ; Linh Ngo Van ; Khoat Than

SUBSPACE CLUSTERING WITH ACTIVE LEARNING........................................................................................... 135 Hankui Peng ; Nicos G. Pavlidis

CTC-ATTENTION BASED NON-PARAMETRIC INFERENCE MODELING FOR CLINICAL

STATE PROGRESSION .................................................................................................................................................. 145 Riazat Ryan ; Handong Zhao ; Ming Shao

XSVM: SCALABLE DISTRIBUTED KERNEL SUPPORT VECTOR MACHINE TRAINING ............................. 155 Ruchi Shah ; Shaoshuai Zhang ; Ying Lin ; Panruo Wu

HDMF: HIERARCHICAL DATA MODELING FRAMEWORK FOR MODERN SCIENCE DATA

STANDARDS ..................................................................................................................................................................... 165 Andrew J. Tritt ; Oliver Rübel ; Benjamin Dichter ; Ryan Ly ; Donghe Kang ; Edward F. Chang ; Loren M. Frank

; Kristofer Bouchard

ORDALIA: DEEP LEARNING HYPERPARAMETER SEARCH VIA GENERALIZATION

ERROR BOUNDS EXTRAPOLATION ......................................................................................................................... 180 Benedetto J. Buratti ; Eli Upfal

HIERARCHICAL DEMAND FORECASTING FOR FACTORY PRODUCTION OF PERISHABLE

GOODS ............................................................................................................................................................................... 188 Can Chen ; Yijun Wang ; Guoan Huang ; Hui Xiong

SUBSAMPLED INFORMATION CRITERIA FOR BAYESIAN MODEL SELECTION IN THE BIG

DATA SETTING ............................................................................................................................................................... 194 Lijiang Geng ; Yishu Xue ; Guanyu Hu

AN INTERACTIVE DATA QUALITY TEST APPROACH FOR CONSTRAINT DISCOVERY AND

FAULT DETECTION ....................................................................................................................................................... 200 Hajar Homayouni ; Sudipto Ghosh ; Indrakshi Ray ; Michael G Kahn

ACTIVATION ENSEMBLES FOR DEEP NEURAL NETWORKS ........................................................................... 206 Diego Klabjan ; Mark Harmon

ONLINE FEDERATED MULTITASK LEARNING ..................................................................................................... 215 Rui Li ; Fenglong Ma ; Wenjun Jiang ; Jing Gao

SPARSE BLOCK REGRESSION (SBR) FOR BIG DATA WITH CATEGORICAL VARIABLES ........................ 221 Xiang Liu ; Huyunting Huang ; Ziyang Tang ; Tonglin Zhang ; Baijian Yang

ALGORITHMS ON COMPRESSED TIME-EVOLVING GRAPHS .......................................................................... 227 Michael Nelson ; Sridhar Radhakrishnan ; Chandra N. Sekharan

DISCOVERING PARTIAL PERIODIC SPATIAL PATTERNS IN SPATIOTEMPORAL

DATABASES ..................................................................................................................................................................... 233 R. Uday Kiran ; C. Saideep ; Koji Zettsu ; Masashi Toyoda ; Masaru Kitsuregawa ; P. Krishna Reddy

CO-REPRESENTATION LEARNING FRAMEWORK FOR THE OPEN-SET DATA

CLASSIFICATION ........................................................................................................................................................... 239 Zhuoyi Wang ; Bo Dong ; Yu Lin ; Yigong Wang ; Md Shihabul Islam ; Latifur Khan

MIXTURE-BASED MULTIPLE IMPUTATION MODEL FOR CLINICAL DATA WITH A

TEMPORAL DIMENSION .............................................................................................................................................. 245 Ye Xue ; Diego Klabjan ; Yuan Luo

A DISCRETE MATHEMATICS APPROACH FOR LARGE SCALE IMPROVEMENT IN

CLASSIFICATION TRAINING TIME .......................................................................................................................... 253 Sumedh Yadav ; Mathis Bode

DIAGNOSIS OF NEURAL NETWORK VIA BACKWARD DEDUCTION .............................................................. 260 Peifeng Yin ; Lei Huang ; Sunhwan Lee ; Mu Qiao ; Shubhi Asthana ; Tagiga Nakamura

WISER: A HIGHLY AVAILABLE HTAP DBMS FOR IOT APPLICATIONS ........................................................ 268 Ronald Barber ; Christian Garcia-Arellano ; Ronen Grosman ; Guy Lohman ; C. Mohan ; Rene Muller ; Hamid

Pirahesh ; Vijayshankar Raman ; Richard Sidle ; Adam Storm ; Yuanyuan Tian ; Pinar Tozun ; Yingjun Wu

PROGRESS-BASED CONTAINER SCHEDULING FOR SHORT-LIVED APPLICATIONS IN A

KUBERNETES CLUSTER .............................................................................................................................................. 278 Yuqi Fu ; Shaolun Zhang ; Jose Terrero ; Ying Mao ; Guangya Liu ; Sheng Li ; Dingwen Tao

FINDING MUTUAL X AT WECHAT-SCALE SOCIAL NETWORK IN TEN MINITUES .................................... 288 Conghui He ; Shijie Sun ; Benli Li ; Xiaogang Tu ; Donghai Yu

A BICAMERALISM VOTING FRAMEWORK FOR COMBINING KNOWLEDGE FROM

CLIENTS INTO BETTER PREDICTION ..................................................................................................................... 298 Yu-Tung Hsieh ; Chuan-Yu Lee ; Ching-Chi Lin ; Pangfeng Liu ; Jan-Jan Wu

RAPID PROTOTYPING OF HIERARCHICAL AGGLOMERATIVE CLUSTERING

ALGORITHMS FOR DISTRIBUTED SYSTEMS ........................................................................................................ 307 Saiyedul Islam ; Navneet Goyal ; Sundar Balasubramaniam ; Poonam Goyal ; Achal Agarwal ; Kirti Singh

Rathore ; Nischay Singh

GENERAL-PURPOSE VS. SPECIALIZED DATA ANALYTICS SYSTEMS: A GAME OF ML &

SQL THRONES ................................................................................................................................................................. 317 Evdokia Kassela ; Nikodimos Provatas ; Ioannis Konstantinou ; Avrilia Floratou ; Nectarios Koziris

D3N: A MULTI-LAYER CACHE FOR THE REST OF US ......................................................................................... 327 Emine Ugur Kaynar ; Mania Abdi ; Mohammad Hossein Hajkazemi ; Ata Turk ; Raja R. Sambasivan ; David

Cohen ; Larry Rudolph ; Peter Desnoyers ; Orran Krieger

AUPTIMIZER - AN EXTENSIBLE, OPEN-SOURCE FRAMEWORK FOR HYPERPARAMETER

TUNING ............................................................................................................................................................................. 339 Jiayi Liu ; Samarth Tripathi ; Unmesh Kurup ; Mohak Shah

MULTILYRA: SCALABLE DISTRIBUTED EVALUATION OF BATCHES OF ITERATIVE

GRAPH QUERIES ............................................................................................................................................................ 349 Abbas Mazloumi ; Xiaolin Jiang ; Rajiv Gupta

AFRAME: EXTENDING DATAFRAMES FOR LARGE-SCALE MODERN DATA ANALYSIS .......................... 359 Phanwadee Sinthong ; Michael J. Carey

PSPICE: PARTIAL MATCH SHEDDING FOR COMPLEX EVENT PROCESSING ............................................. 372 Ahmad Slo ; Sukanya Bhowmik ; Albert Flaig ; Kurt Rothermel

TAZER: HIDING THE COST OF REMOTE I/O IN DISTRIBUTED SCIENTIFIC WORKFLOWS .................... 383 Joshua Suetterlein ; Ryan D. Friese ; Nathan R. Tallent ; Malachi Schram

MECHANISM DESIGN FOR AN INCENTIVE-AWARE BLOCKCHAIN-ENABLED FEDERATED

LEARNING PLATFORM ................................................................................................................................................ 395 Kentaroh Toyoda ; Allan N. Zhang

NOVEL ONLINE ALGORITHMS FOR NONPARAMETRIC CORRELATIONS WITH

APPLICATION TO ANALYZE SENSOR DATA ......................................................................................................... 404 Wei Xiao

ELASTIC EXECUTOR PROVISIONING FOR ITERATIVE WORKLOADS ON APACHE SPARK ................... 413 Donglin Yang ; Wei Rang ; Dazhao Cheng ; Yu Wang ; Jiannan Tian ; Dingwen Tao

EIRENE: IMPROVING SHORT JOB LATENCY PERFORMANCE WITH COORDINATED

COLD DATA MIGRATION AND SCHEDULER-AWARE TASK CLONING ......................................................... 423 Wei Zhou ; K. Preston White ; Hongfeng Yu

THE OTREE: MULTIDIMENSIONAL INDEXING WITH EFFICIENT DATA SAMPLING FOR

HPC ..................................................................................................................................................................................... 433 Cesare Cugnasco ; Hadrien Calmet ; Pol Santamaria ; Raül Sirvent ; Ane Beatriz Eguzkitza ; Guillaume Houzeaux

; Yolanda Becerra ; Jordi Torres ; Jesus Labarta

AVES: A DECISION ENGINE FOR ENERGY-EFFICIENT STREAM ANALYTICS ACROSS

LOW-POWER DEVICES ................................................................................................................................................. 441 Roshan Bharath Das ; Marc X. Makkes ; Alexandru Uta ; Lin Wang ; Henri Bal

SCALING OUT MULTI-WAY STREAM JOINS USING OPTIMIZED, ITERATIVE PROBING ......................... 449 Manuel Dossinger ; Sebastian Michel

DATA SERVICES WITH BINDAAS: RESTFUL INTERFACES FOR DIVERSE DATA SOURCES .................... 457 Pradeeban Kathiravelu ; Yusuf Nadir Saghar ; Tushar Aggarwal ; Ashish Sharma

DATACALC: AD-HOC ANALYSES ON HETEROGENEOUS DATA SOURCES .................................................. 463 Johannes Luong ; Dirk Habich ; Wolfgang Lehner

INVESTIGATING EDGE VS. CLOUD COMPUTING TRADE-OFFS FOR STREAM

PROCESSING ................................................................................................................................................................... 469 Pedro Silva ; Alexandru Costan ; Gabriel Antoniu

EFFECTIVELY UNIFIED OPTIMIZATION FOR LARGE-SCALE GRAPH COMMUNITY

DETECTION ..................................................................................................................................................................... 475 Jianping Zeng ; Hongfeng Yu

SUBGRAPH2VEC: HIGHLY-VECTORIZED TREE-LIKE SUBGRAPH COUNTING .......................................... 483 Langshi Chen ; Jiayu Li ; Cenk Sahinalp ; Madhav Marathe ; Anil Vullikanti ; Andrey Nikolaev ; Egor Smirnov ;

Ruslan Israfilov ; Judy Qiu

NIOBE: AN INTELLIGENT I/O BRIDGING ENGINE FOR COMPLEX AND DISTRIBUTED

WORKFLOWS .................................................................................................................................................................. 493 Kun Feng ; Hariharan Devarajan ; Anthony Kougkas ; Xian-He Sun

NETDYNA: MINING NETWORKED COEVOLVING TIME SERIES WITH MISSING VALUES ...................... 503 Hairi ; Hanghang Tong ; Lei Ying

G-FINDER: APPROXIMATE ATTRIBUTED SUBGRAPH MATCHING ................................................................ 513 Lihui Liu ; Boxin Du ; Jiejun Xu ; Hanghang Tong

EXPERIMENTAL EVALUATION OF BOUNDED-DEPTH LSM MERGE POLICIES .......................................... 523 Qizhong Mao ; Steven Jacobs ; Waleed Amjad ; Vagelis Hristidis ; Vassilis J. Tsotras ; Neal E. Young

GENERALIZING DESIGN OF SUPPORT MEASURES FOR COUNTING FREQUENT

PATTERNS IN GRAPHS ................................................................................................................................................. 533 Jinghan Meng ; Napath Pitaksirianan ; Yicheng Tu

MULTIVARIATE LONG-TERM STATE FORECASTING IN CYBER-PHYSICAL SYSTEMS: A

SEQUENCE TO SEQUENCE APPROACH................................................................................................................... 543 Nikhil Muralidhar ; Sathappan Muthiah ; Kiyoshi Nakayama ; Ratnesh Sharma ; Naren Ramakrishnan

SEMI-SUPERVISED LEARNING OVER STREAMING DATA USING MOA ........................................................ 553 Minh Huong Le Nguyen ; Heitor Murilo Gomes ; Albert Bifet

UNDERSTANDING SPATIO-TEMPORAL URBAN PROCESSES ........................................................................... 563 Lais M. A. Rocha ; Aline Bessa ; Fernando Chirigati ; Eugene Ofriel ; Mirella M. Moro ; Juliana Freire

VULNERCHECK: A CONTENT-AGNOSTIC DETECTOR FOR ONLINE HATRED-

VULNERABLE VIDEOS ................................................................................................................................................. 573 Lanyu Shang ; Daniel Yue Zhang ; Michael Wang ; Dong Wang

CS*: APPROXIMATE QUERY PROCESSING ON BIG DATA USING SCALABLE JOIN

CORRELATED SAMPLE SYNOPSIS ........................................................................................................................... 583 Feng Yu ; Wen-Chi Hou

INCREMENTAL AND ADAPTIVE FEATURE EXPLORATION OVER TIME SERIES STREAM ..................... 593 Jingwei Zuo ; Karine Zeitouni ; Yehia Taher

CLUSTER-BASED COMPUTATION OF EXACT BETWEENNESS CENTRALITY IN LARGE

UNDIRECTED GRAPHS ................................................................................................................................................. 603 Cecile Daniel ; Angelo Furno ; Eugenio Zimeo

ENGINEERING A PARALLEL Δ-STEPPING ALGORITHM ................................................................................... 609 Erika Duriakova ; Deepak Ajwani ; Neil Hurley

APPROXIMATE QUERYING FOR THE PROPERTY GRAPH LANGUAGE CYPHER ...................................... 617 George Fletcher ; Alexandra Poulovassilis ; Petra Selmer ; Peter T. Wood

FAST RECORD LINKAGE FOR COMPANY ENTITIES ........................................................................................... 623 Thomas Gschwind ; Christoph Miksovic ; Julian Minder ; Katsiaryna Mirylenka ; Paolo Scotton

GRADI: TOWARDS GROUP RECOMMENDATION USING ATTENTIVE DUAL TOP-DOWN

AND BOTTOM-UP INFLUENCES ................................................................................................................................. 631 Zhixiang He ; Chi-Yin Chow ; Jia-Dong Zhang ; Ning Li

A SCALABLE ALGORITHM FOR MULTI-CLASS SUPPORT VECTOR MACHINE ON GEO-

DISTRIBUTED DATASETS ............................................................................................................................................ 637 Tasnim Kabir ; Muhammad Abdullah Adnan

ANALYSIS OF EVOLUTIONARY BEHAVIOR IN SELF-LEARNING MEDIA SEARCH

ENGINES ........................................................................................................................................................................... 643 Nikki Lijing Kuang ; Leung Clement H. C.

ENHANCED ALTERNATE ACTION RECOMMENDER SYSTEM USING RECURRENT

PATTERNS AND FAULT DETECTION SYSTEM FOR SMART HOME USERS .................................................. 651 Prabhat Mishra ; Suresh Kumar Gudla ; Amogha D Shanbhag ; Joy Bose

LEARNING DATA TRANSFORMATIONS WITH MINIMAL USER EFFORT ..................................................... 657 Minh Pham ; Craig A. Knoblock ; Jay Pujara

SENSE: SEMANTICALLY ENHANCED NODE SEQUENCE EMBEDDING ......................................................... 665 Swati Rallapalli ; Liang Ma ; Mudhakar Srivatsa ; Ananthram Swami ; Heesung Kwon ; Graham Bent ;

Christopher Simpkin

AGGREGATE QUERY PREDICTION UNDER DYNAMIC WORKLOADS ........................................................... 671 Fotis Savva ; Christos Anagnostopoulos ; Peter Triantafillou

IPROWA: A NOVEL PROBABILISTIC GRAPHICAL MODEL FOR CROWDSOURCING

AGGREGATION............................................................................................................................................................... 677 Tianqi Wang ; Houping Xiao ; Fenglong Ma ; Jing Gao

ONLINE FEATURE SELECTION WITH CAPRICIOUS STREAMING FEATURES: A GENERAL

FRAMEWORK .................................................................................................................................................................. 683 Di Wu ; Yi He ; Xin Luo ; Mingsheng Shang ; Xindong Wu

PERFORMANCE OPTIMIZATION OF SPMV ON SPARK ...................................................................................... 689 Kun Xie ; Che-Rung Lee ; Feng-Yuan Liu

DEEP MULTI-HEAD ATTENTION NETWORK FOR ASPECT-BASED SENTIMENT ANALYSIS ................... 695 Danfeng Yan ; Jiyuan Chen ; Jianfei Cui ; Ao Shan ; Wenting Shi

INDEX BASED EFFICIENT ALGORITHMS FOR CLOSEST COMMUNITY SEARCH ...................................... 701 Esra Akbas

LOCAL SEARCH FOR GROUP CLOSENESS MAXIMIZATION ON BIG GRAPHS ........................................... 711 Eugenio Angriman ; Alexander Van Der Grinten ; Henning Meyerhenke

COLLABORATIVE RANKING TAGS AND ITEMS VIA CROSS-DOMAIN RECOMMENDATION ................. 721 Huiyuan Chen ; Jing Li

COLLECTIVE TENSOR COMPLETION WITH MULTIPLE HETEROGENEOUS SIDE

INFORMATION ................................................................................................................................................................ 731 Huiyuan Chen ; Jing Li

ADAPTED TREE BOOSTING FOR TRANSFER LEARNING .................................................................................. 741 Wenjing Fang ; Chaochao Chen ; Bowen Song ; Li Wang ; Jun Zhou ; Kenny Q. Zhu

SIM: OPEN-WORLD MULTI-TASK STREAM CLASSIFIER WITH INTEGRAL SIMILARITY

METRICS ........................................................................................................................................................................... 751 Yang Gao ; Yi-Fan Li ; Bo Dong ; Yu Lin ; Latifur Khan

FEATURE SCORING USING TREE-BASED ENSEMBLES FOR EVOLVING DATA STREAMS ...................... 761 Heitor Murilo Gomes ; Rodrigo Fernandes De Mello ; Bernhard Pfahringer ; Albert Bifet

TOWARDS INTERPRETABLE GRAPH MODELING WITH VERTEX REPLACEMENT

GRAMMARS ..................................................................................................................................................................... 770 Justus Hibshman ; Satyaki Sikdar ; Tim Weninger

CLUSTERED HIERARCHICAL ENTROPY-SCALING SEARCH OF ASTRONOMICAL AND

BIOLOGICAL DATA ....................................................................................................................................................... 780 Najib Ishaq ; George Student ; Noah M. Daniels

MULTI-STEP FORECASTING VIA MULTI-TASK LEARNING .............................................................................. 790 Shayan Jawed ; Ahmed Rashed ; Lars Schmidt-Thieme

DATA CONTEXT ADAPTATION FOR ACCURATE RECOMMENDATION WITH ADDITIONAL

INFORMATION ................................................................................................................................................................ 800 Hyunsik Jeon ; Bonhun Koo ; U Kang

B-BIT SKETCH TRIE: SCALABLE SIMILARITY SEARCH ON INTEGER SKETCHES ................................... 810 Shunsuke Kanda ; Yasuo Tabei

MODELLING ONLINE COMMENT THREADS FROM THEIR START ................................................................ 820 Rachel Krohn ; Tim Weninger

IMPROVING SCALABILITY OF PARALLEL CNN TRAINING BY ADJUSTING MINI-BATCH

SIZE AT RUN-TIME ........................................................................................................................................................ 830 Sunwoo Lee ; Qiao Kang ; Sandeep Madireddy ; Prasanna Balaprakash ; Ankit Agrawal ; Alok Choudhary ;

Richard Archibald ; Wei-Keng Liao

MULTI-LAYER FACIAL REPRESENTATION LEARNING FOR EARLY PREDICTION OF

SEPTIC SHOCK ............................................................................................................................................................... 840 Chen Lin ; Julie Ivy ; Min Chi

JSCN: JOINT SPECTRAL CONVOLUTIONAL NETWORK FOR CROSS DOMAIN

RECOMMENDATION ..................................................................................................................................................... 850 Zhiwei Liu ; Lei Zheng ; Jiawei Zhang ; Jiayu Han ; Philip S. Yu

PREDICTING INFLUENCE PROBABILITIES USING GRAPH CONVOLUTIONAL NETWORKS .................. 860 Jing Liu ; Yudi Chen ; Duanshun Li ; Noseong Park ; Kisung Lee ; Dongwon Lee

HYBRID DEEP EMBEDDING FOR RECOMMENDATIONS WITH DYNAMIC ASPECT-LEVEL

EXPLANATIONS .............................................................................................................................................................. 870 Huanrui Luo ; Ning Yang ; Philip S. Yu

FAST COMPUTATION OF PERSISTENT HOMOLOGY WITH DATA REDUCTION AND DATA

PARTITIONING ............................................................................................................................................................... 880 Nicholas O. Malott ; Philip A. Wilsey

HIGH DIMENSIONAL DATA CLUSTERING BY MEANS OF DISTRIBUTED DIRICHLET

PROCESS MIXTURE MODELS ..................................................................................................................................... 890 Khadidja Meguelati ; Benedicte Fontez ; Nadine Hilgert ; Florent Masseglia

SENTIMENT-AWARE AND PERSONALIZED TOUR RECOMMENDATION ...................................................... 900 Prarthana Padia ; Kwan Hui Lim ; Jeffrey Cha ; Aaron Harwood

UNSUPERVISED AUTHOR DISAMBIGUATION USING HETEROGENEOUS GRAPH

CONVOLUTIONAL NETWORK EMBEDDING ......................................................................................................... 910 Ziyue Qiao ; Yi Du ; Yanjie Fu ; Pengfei Wang ; Yuanchun Zhou

NEWS RECOMMENDER SYSTEM CONSIDERING TEMPORAL DYNAMICS AND NEWS

TAXONOMY ..................................................................................................................................................................... 920 Shaina Raza ; Chen Ding

PATIENT-LEVEL CLASSIFICATION ON CLINICAL NOTE SEQUENCES GUIDED BY

ATTRIBUTED HIERARCHICAL ATTENTION .......................................................................................................... 930 Cansu Sen ; Thomas Hartvigsen ; Xiangnan Kong ; Elke Rundensteiner

RECANT: NETWORK-BASED RECRUITMENT FOR ACTIVE FAKE NEWS CORRECTION ......................... 940 Ajitesh Srivastava ; Rajgopal Kannan ; Charalampos Chelmis ; Viktor K. Prasanna

SCALABLE DISTRIBUTED SUBTRAJECTORY CLUSTERING ............................................................................. 950 Panagiotis Tampakis ; Nikos Pelekis ; Christos Doulkeridis ; Yannis Theodoridis

EFFECTIVE UTILIZATION OF EXTERNAL KNOWLEDGE AND HISTORY CONTEXT IN

MULTI-TURN SPOKEN LANGUAGE UNDERSTANDING MODEL ...................................................................... 960 Yufan Wang ; Tingting He ; Rui Fan ; Wenji Zhou ; Xinhui Tu

DEEPBLUE: BI-LAYERED LSTM FOR TWEET POPULARITY ESTIMATION .................................................. 968 Jian Wen ; Zhongbao Zhang ; Zichang Yin ; Li Sun ; Sen Su ; Philip S. Yu

NON-LOCAL ATTENTION LEARNING ON LARGE HETEROGENEOUS INFORMATION

NETWORKS ...................................................................................................................................................................... 978 Yuxin Xiao ; Zecheng Zhang ; Carl Yang ; Chengxiang Zhai

DETECT: DEEP TRAJECTORY CLUSTERING FOR MOBILITY-BEHAVIOR ANALYSIS .............................. 988 Mingxuan Yue ; Yaguang Li ; Haoze Yang ; Ritesh Ahuja ; Yao-Yi Chiang ; Cyrus Shahabi

ORIGIN: NON-RIGID NETWORK ALIGNMENT ...................................................................................................... 998 Si Zhang ; Hanghang Tong ; Jiejun Xu ; Yifan Hu ; Ross Maciejewski

TOWARDS REAL TIME TEAM OPTIMIZATION ................................................................................................... 1008 Qinghai Zhou ; Liangyue Li ; Hanghang Tong

INTEGRATING LOCAL VERTEX/EDGE EMBEDDING VIA DEEP MATRIX FUSION AND

SIAMESE MULTI-LABEL CLASSIFICATION ......................................................................................................... 1018 Yang Zhou ; Chao Jiang ; Zijie Zhang ; Dejing Dou ; Ruoming Jin ; Pengwei Wang

BIGMAT: A DISTRIBUTED AFFINITY-PRESERVING RANDOM WALK STRATEGY FOR

INSTANCE MATCHING ON KNOWLEDGE GRAPHS ........................................................................................... 1028 Ali Assi ; Hamid Mcheick ; Wajdi Dhifli

MULTIMODAL, CONTEXT-AWARE, FEATURE REPRESENTATION LEARNING FOR

CLASSIFICATION AND LOCALIZATION ............................................................................................................... 1034 Sreyasee Das Bhattacharjee ; William J. Tolone ; Roy Cheria ; Urmimala Sarka

ARBITRATED DYNAMIC ENSEMBLE WITH ABSTAINING FOR TIME-SERIES

FORECASTING ON DATA STREAMS ....................................................................................................................... 1040 Dihia Boulegane ; Albert Bifet ; Giyyarpuram Madhusudan

DEEP TENSOR FACTORIZATION FOR MULTI-CRITERIA RECOMMENDER SYSTEMS ........................... 1046 Zhengyu Chen ; Sibo Gai ; Donglin Wang

MOTIF2VEC: MOTIF AWARE NODE REPRESENTATION LEARNING FOR

HETEROGENEOUS NETWORKS ............................................................................................................................... 1052 Manoj Reddy Dareddy ; Mahashweta Das ; Hao Yang

GRAPH MATCHING ON SOCIAL NETWORKS WITHOUT ANY SIDE INFORMATION ............................... 1060 Charalampos Davalas ; Dimitrios Michail ; Iraklis Varlamis

PYRAMID: A GENERAL FRAMEWORK FOR DISTRIBUTED SIMILARITY SEARCH ON

LARGE-SCALE DATASETS ......................................................................................................................................... 1066 Shiyuan Deng ; Xiao Yan ; K. W. Ng Kelvin ; Chenyu Jiang ; James Cheng

VISUALIZATION FOR QUALITY HEALTHCARE: PATIENT FLOW EXPLORATION .................................. 1072 Veronika Domova ; Shiva Sander-Tavallaey

REFINING IMAGE SEARCH RESULTS USING MULTIPLE ATTRIBUTES ...................................................... 1080 Tingting Dong ; Shoji Nishimura ; Jianquan Liu

THE LINK REGRESSION PROBLEM IN GRAPH STREAMS ............................................................................... 1088 Bowen Dong ; Charu C. Aggarwal ; Philip S. Yu

HIERARCHICAL-DOCUMENT-STRUCTURE-AWARE ATTENTION WITH ADAPTIVE COST

SENSITIVE LEARNING FOR BIOMEDICAL DOCUMENT CLASSIFICATION ................................................ 1096 Dandan Fang ; Jinyong Zhang ; Weizhong Zhao ; Xiaowei Xu ; Xingpeng Jiang ; Xiaohua Hu ; Tingting He

METAPATH ENHANCED GRAPH ATTENTION ENCODER FOR HINS REPRESENTATION

LEARNING ...................................................................................................................................................................... 1103 Yuwei Fu ; Yun Xiong ; Philip S. Yu ; Tianyi Tao ; Yangyong Zhu

TASK FAILURE PREDICTION IN CLOUD DATA CENTERS USING DEEP LEARNING ................................ 1111 Jiechao Gao ; Haoyu Wang ; Haiying Shen

GOSSIP LEARNING: OFF THE BEATEN PATH ...................................................................................................... 1117 Lodovico Giaretta ; Šarunas Girdzijauskas

VISUAL ANOMALY DETECTION IN EVENT SEQUENCE DATA ....................................................................... 1125 Shunan Guo ; Zhuochen Jin ; Qing Chen ; David Gotz ; Hongyuan Zha ; Nan Cao

MULTI-SPATIAL SCALE EVENT DETECTION FROM GEO-TAGGED TWEET STREAMS VIA

POWER-LAW VERIFICATION ................................................................................................................................... 1131 Yi Han ; Shanika Karunasekera ; Christopher Leckie ; Aaron Harwood

FROM TEXT CLASSIFICATION TO KEYPHRASE EXTRACTION FOR SHORT TEXT ................................ 1137 Song-Eun Lee ; Kang-Min Kim ; Woo-Jong Ryu ; Jemin Park ; Sangkeun Lee

ENHANCING ITEMSET TREE RULES AND PERFORMANCE ............................................................................ 1143 Jay Lewis ; Ryan G. Benton ; David Bourrie ; Jennifer Lavergne

MC2:UNSUPERVISED MULTIPLE SOCIAL NETWORK ALIGNMENT ............................................................. 1151 Gen Li ; Li Sun ; Zhongbao Zhang ; Pengxin Ji ; Sen Su ; Philip S. Yu

GRAPH MATCHING VIA MULTI-SCALE HEAT DIFFUSION ............................................................................. 1157 Lin Li ; Daniel L. Sussman

COMMUNITY-PRESERVING GRAPH CONVOLUTIONS FOR STRUCTURAL AND

FUNCTIONAL JOINT EMBEDDING OF BRAIN NETWORKS ............................................................................. 1163 Jiahao Liu ; Guixiang Ma ; Fei Jiang ; Chun-Ta Lu ; Philip S. Yu ; Ann B. Ragin

LATTE: APPLICATION ORIENTED SOCIAL NETWORK EMBEDDING .......................................................... 1169 Lin Meng ; Jiyang Bai ; Jiawei Zhang

D2D-TM: A CYCLE VAE-GAN FOR MULTI-DOMAIN COLLABORATIVE FILTERING ............................... 1175 Linh Nguyen ; Tsukasa Ishigaki

TWO PROBLEMS IN KNOWLEDGE GRAPH EMBEDDING: NON-EXCLUSIVE RELATION

CATEGORIES AND ZERO GRADIENTS................................................................................................................... 1181 Nasheen Nur ; Noseong Park ; Kookjin Lee ; Hyunjoong Kang ; Soonhyeon Kwon

ADAPTIVE DEEP LEARNING BASED TIME-VARYING VOLUME COMPRESSION ...................................... 1187 Yu Pan ; Feiyu Zhu ; Tian Gao ; Hongfeng Yu

WEAKLY SUPERVISED DEEP LEARNING APPROACH IN STREAMING ENVIRONMENTS...................... 1195 Mahardhika Pratama ; Andri Ashfahani ; Abdul Hady

MAXIMIZING CONTRASTING OPINIONS IN SIGNED SOCIAL NETWORKS ................................................ 1203 Kaivalya Rawal ; Arijit Khan

USTAR: ONLINE MULTIMODAL EMBEDDING FOR MODELING USER-GUIDED

SPATIOTEMPORAL ACTIVITY ................................................................................................................................. 1211 Amila Silva ; Shanika Karunasekera ; Christopher Leckie ; Ling Luo

FAST ANOMALY DETECTION IN MULTIPLE MULTI-DIMENSIONAL DATA STREAMS .......................... 1218 Hongyu Sun ; Qiang He ; Kewen Liao ; Timos Sellis ; Longkun Guo ; Xuyun Zhang ; Jun Shen ; Feifei Chen

DNA: DYNAMIC SOCIAL NETWORK ALIGNMENT ............................................................................................. 1224 Li Sun ; Zhongbao Zhang ; Pengxin Ji ; Jian Wen ; Sen Su ; Philip S. Yu

STUDY GROUP TRAVEL BEHAVIOUR PATTERNS FROM LARGE-SCALE SMART CARD

DATA ................................................................................................................................................................................ 1232 Xiancai Tian ; Baihua Zheng

IMPROVED TABLE RETRIEVAL USING MULTIPLE CONTEXT EMBEDDINGS FOR

ATTRIBUTES .................................................................................................................................................................. 1238 Mohamed Trabelsi ; Brian D. Davison ; Jeff Heflin

META-PATH REDUCTION WITH TRANSITION PROBABILITY PRESERVING IN

HETEROGENEOUS INFORMATION NETWORK .................................................................................................. 1245 Xiaokai Wei ; Zhiwei Liu ; Lichao Sun ; Philip S. Yu

PRACTICAL END-TO-END REPOSITIONING ALGORITHM FOR MANAGING BIKE-

SHARING SYSTEM ....................................................................................................................................................... 1251 Akihiro Yoshida ; Yosuke Yatsushiro ; Nozomi Hata ; Tatsuru Higurashi ; Nariaki Tateiwa ; Takashi Wakamatsu ;

Akira Tanaka ; Kenichi Nagamatsu ; Katsuki Fujisawa

DEEP DIFFUSIVE NEURAL NETWORK BASED FAKE NEWS DETECTION FROM

HETEROGENEOUS SOCIAL NETWORKS .............................................................................................................. 1259 Jiawei Zhang ; Bowen Dong ; Philip S. Yu

SEMI-SUPERVISED CLASSIFICATION-BASED LOCAL VERTEX RANKING VIA DUAL

GENERATIVE ADVERSARIAL NETS ....................................................................................................................... 1267 Yang Zhou ; Jiaxiang Ren ; Sixing Wu ; Dejing Dou ; Ruoming Jin ; Zijie Zhang ; Pengwei Wang

INTERACTIVE VISUALIZATION OF TIME-VARYING HYPERSPECTRAL PLANT IMAGES

FOR HIGH-THROUGHPUT PHENOTYPING ........................................................................................................... 1274 Feiyu Zhu ; Yu Pan ; Tian Gao ; Harkamal Walia ; Hongfeng Yu

DENOISING AND VERIFICATION CROSS-LAYER ENSEMBLE AGAINST BLACK-BOX

ADVERSARIAL ATTACKS .......................................................................................................................................... 1282 Ka-Ho Chow ; Wenqi Wei ; Yanzhao Wu ; Ling Liu

RAPID: RAPID AND PRECISE INTERPRETABLE DECISION SETS .................................................................. 1292 Sunny Dhamnani ; Dhruv Singal ; Ritwik Sinha ; Tharun Mohandoss ; Manish Dash

DIFFERENTIALLY PRIVATE ROBUST ADMM FOR DISTRIBUTED MACHINE LEARNING ..................... 1302 Jiahao Ding ; Xinyue Zhang ; Mingsong Chen ; Kaiping Xue ; Chi Zhang ; Miao Pan

SECUREGBM: SECURE MULTI-PARTY GRADIENT BOOSTING ...................................................................... 1312 Zhi Feng ; Haoyi Xiong ; Chuanyuan Song ; Sijia Yang ; Baoxin Zhao ; Licheng Wang ; Zeyu Chen ; Shengwen

Yang ; Liping Liu ; Jun Huan

WOULD A FILE BY ANY OTHER NAME SEEM AS MALICIOUS? ..................................................................... 1322 Andre T. Nguyen ; Edward Raff ; Aaron Sant-Miller

OPAL: HIGH PERFORMANCE PLATFORM FOR LARGE-SCALE PRIVACY-PRESERVING

LOCATION DATA ANALYTICS ................................................................................................................................. 1332 Axel Oehmichen ; Shubham Jain ; Andrea Gadotti ; Yves-Alexandre De Montjoye

INSIDER THREAT DETECTION VIA HIERARCHICAL NEURAL TEMPORAL POINT

PROCESSES .................................................................................................................................................................... 1343 Shuhan Yuan ; Panpan Zheng ; Xintao Wu ; Qinghua Li

TOWARD IMAGE PRIVACY CLASSIFICATION AND SPATIAL ATTRIBUTION OF PRIVATE

CONTENT ....................................................................................................................................................................... 1351 Haoti Zhong ; Hao Li ; Anna Squicciarini ; Sarah Rajtmajer ; David Miller

DEEPMANNER: AUTOMATICALLY DETERMINING MANNER OF DEATH .................................................. 1361 Esteban Guillen ; Trilce Estrada ; Matthew Cain

RAMP: REAL-TIME ANOMALY DETECTION IN SCIENTIFIC WORKFLOWS .............................................. 1367 J. Dinal Herath ; Changxin Bai ; Guanhua Yan ; Ping Yang ; Shiyong Lu

FAE: A FAIRNESS-AWARE ENSEMBLE FRAMEWORK ...................................................................................... 1375 Vasileios Iosifidis ; Besnik Fetahu ; Eirini Ntoutsi

WALK2PRIVACY: LIMITING TARGET LINK PRIVACY DISCLOSURE AGAINST THE

ADVERSARIAL LINK PREDICTION ......................................................................................................................... 1381 Zhongyuan Jiang ; Jianfeng Ma ; Philip S. Yu

PRIVACY BARGAINING WITH FAIRNESS: PRIVACY-PRICE NEGOTIATION SYSTEM FOR

APPLYING DIFFERENTIAL PRIVACY IN DATA MARKET ENVIRONMENTS .............................................. 1389 Kangsoo Jung ; Seog Park

INDIRECT ADVERSARIAL ATTACKS VIA POISONING NEIGHBORS FOR GRAPH

CONVOLUTIONAL NETWORKS ............................................................................................................................... 1395 Tsubasa Takahashi

FAIRGAN+: ACHIEVING FAIR DATA GENERATION AND CLASSIFICATION THROUGH

GENERATIVE ADVERSARIAL NETS ....................................................................................................................... 1401 Depeng Xu ; Shuhan Yuan ; Lu Zhang ; Xintao Wu

FAST FREQUENT PATTERN MINING WITHOUT CANDIDATE GENERATIONS ON GPU BY

LOW LATENCY MEMORY ALLOCATION ............................................................................................................. 1407 Yu-Chen Wu ; Mi-Yen Yeh ; Tei-Wei Kuo

BIG DATA AND TRADITIONAL CHINESE MEDICINE (TCM): WHAT’S STATE OF THE ART? ................ 1417 David C. Mainenti

CHALLENGES WITH EXTREME CLASS-IMBALANCE AND TEMPORAL COHERENCE: A

STUDY ON SOLAR FLARE DATA .............................................................................................................................. 1423 Azim Ahmadzadeh ; Maxwell Hostetter ; Berkay Aydin ; Manolis K. Georgoulis ; Dustin J. Kempton ; Sushant S.

Mahajan ; Rafal Angryk

INTO THE REVERIE: EXPLORATION OF THE DREAM MARKET .................................................................. 1432 Theo Carr ; Jun Zhuang ; Dwight Sablan ; Emma Larue ; Yubao Wu ; Mohammad Al Hasan ; George Mohler

LEARNING TO DIFFERENTIATE BETWEEN MAIN-ARTICLES AND SUB-ARTICLES IN

WIKIPEDIA ..................................................................................................................................................................... 1442 Muhao Chen ; Changping Meng ; Gang Huang ; Carlo Zaniolo

ORIGIN-DESTINATION FLOW PREDICTION WITH VEHICLE TRAJECTORY DATA AND

SEMI-SUPERVISED RECURRENT NEURAL NETWORK ..................................................................................... 1450 Tao Huang ; Yintai Ma ; Zhiwei Tony Qin ; Jianfeng Zheng ; Henry X. Liu ; Hongtu Zhu ; Jieping Ye

CRIME-GAN: A CONTEXT-BASED SEQUENCE GENERATIVE NETWORK FOR CRIME

FORECASTING WITH ADVERSARIAL LOSS ......................................................................................................... 1460 Guangyin Jin ; Qi Wang ; Xia Zhao ; Yanghe Feng ; Qing Cheng ; Jincai Huang

RAIL TRACK QUALITY AND T-STOCHASTIC NEIGHBOR EMBEDDING FOR HYBRID

TRACK INDEX ............................................................................................................................................................... 1470 Ahmed Lasisi ; Antonio Merheb ; Allan Zarembski ; Nii Attoh-Okine

VIRTUAL BIG DATA FOR GAN BASED DATA AUGMENTATION .................................................................... 1478 Hadi Mansourifar ; Lin Chen ; Weidong Shi

EXASCALE DEEP LEARNING TO ACCELERATE CANCER RESEARCH ........................................................ 1488 Robert M. Patton ; J. Travis Johnston ; Steven R. Young ; Catherine D. Schuman ; Thomas E. Potok ; Derek C.

Rose ; Seung-Hwan Lim ; Junghoon Chae ; Le Hou ; Shahira Abousamra ; Dimitris Samaras ; Joel Saltz

TALE OF THREE STATES: ANALYSIS OF LARGE PERSON-TO-PERSON ONLINE

FINANCIAL TRANSACTIONS IN THREE BALTIC COUNTRIES........................................................................ 1497 Rajesh Sharma ; Artem Mateush ; Jaan Übi

REGULARIZED OPERATING ENVELOPE WITH INTERPRETABILITY AND

IMPLEMENTABILITY CONSTRAINTS .................................................................................................................... 1506 Qiyao Wang ; Haiyan Wang ; Chetan Gupta ; Susumu Serita

KERNEL-BASED MULTI-TASK CONTEXTUAL BANDITS IN CELLULAR NETWORK

CONFIGURATION ......................................................................................................................................................... 1517 Xiaoxiao Wang ; Xueying Guo ; Jie Chuai ; Zhitang Chen ; Xin Liu

ABR-HIC: ATTENTION BASED BIDIRECTIONAL RNN FOR HIERARCHICAL INDUSTRY

CLASSIFICATION ......................................................................................................................................................... 1527 Rongzhe Wei ; Qinghua Zheng ; Bo Dong ; Kuanzheng Yang ; Huan He ; Jianfei Ruan

LEARNING TO DISCOVER CURBSIDE PARKING SPACES FROM VEHICLE TRAJECTORIES ................ 1537 Yuxin Wen ; Jizhou Huang ; Chongli Zhu ; Miao Fan ; Ying Li

MSSTN: MULTI-SCALE SPATIAL TEMPORAL NETWORK FOR AIR POLLUTION

PREDICTION .................................................................................................................................................................. 1547 Zhiyuan Wu ; Yue Wang ; Lin Zhang

DSSLP: A DISTRIBUTED FRAMEWORK FOR SEMI-SUPERVISED LINK PREDICTION ............................. 1557 Dalong Zhang ; Xianzheng Song ; Ziqi Liu ; Zhiqiang Zhang ; Xin Huang ; Lin Wang ; Jun Zhou

TRANSLAND: AN ADVERSARIAL TRANSFER LEARNING APPROACH FOR MIGRATABLE

URBAN LAND USAGE CLASSIFICATION USING REMOTE SENSING ............................................................. 1567 Yang Zhang ; Ruohan Zong ; Jun Han ; Hao Zheng ; Qiuwen Lou ; Daniel Zhang ; Dong Wang

PLACE REPRESENTATION BASED BIKE DEMAND PREDICTION .................................................................. 1577 Yang Zhou ; Yan Huang

SCALING DEEP LEARNING MODELS FOR LARGE SPATIAL TIME-SERIES FORECASTING .................. 1587 Zainab Abbas ; Jon Reginbald Ivarsson ; Ahmad Al-Shishtawy ; Vladimir Vlassov

SIMILARITY HASHING FOR CHARGED PARTICLE TRACKING .................................................................... 1595 Sabrina Amrouche ; Tobias Golling ; Moritz Kiehn ; Claudia Plant ; Andreas Salzburger

A KING’S RANSOM FOR ENCRYPTION: RANSOMWARE CLASSIFICATION USING

AUGMENTED ONE-SHOT LEARNING AND BAYESIAN APPROXIMATION .................................................. 1601 Amir Atapour-Abarghouei ; Stephen Bonner ; Andrew Stephen McGough

STRATUM: A BIGDATA-AS-A-SERVICE FOR LIFECYCLE MANAGEMENT OF IOT

ANALYTICS APPLICATIONS ..................................................................................................................................... 1607 Anirban Bhattacharjee ; Yogesh Barve ; Shweta Khare ; Shunxing Bao ; Zhuangwei Kang ; Aniruddha Gokhale ;

Thomas Damiano

TILE & MERGE: DISTRIBUTED DELAUNAY TRIANGULATIONS FOR CLOUD COMPUTING ................. 1613 Laurent Caraffa ; Pooran Memari ; Murat Yirci ; Mathieu Brédif

OBJECTIVE SLEEP QUALITY AS A PREDICTOR OF MILD COGNITIVE IMPAIRMENT IN

SENIORS LIVING ALONE ........................................................................................................................................... 1619 Brian Chen ; Hwee-Pink Tan ; Iris Rawtaer ; Hwee-Xian Tan

BIG DATA ANALYTIC FOR CASCADING FAILURE ANALYSIS ........................................................................ 1625 Yousu Chen ; Tianzhixi Yin ; Renke Huang ; Xiaoyuan Fan ; Qiuhua Huang

MODELING HUMAN ATTENTION BY LEARNING FROM LARGE AMOUNT OF EMOTIONAL

IMAGES ........................................................................................................................................................................... 1631 Macario O. Cordel

ADAPTIVELY SELECTING OCCUPATIONS TO DETECT SKILL SHORTAGES FROM

ONLINE JOB ADS .......................................................................................................................................................... 1637 Nik Dawson ; Marian-Andrei Rizoiu ; Benjamin Johnston ; Mary-Anne Williams

REGRESSION PREDICTION FOR GEOLOCATION AWARE THROUGH RELATIVE DENSITY

RATIO ESTIMATION ................................................................................................................................................... 1644 Bo Dong ; Jinghui Guo ; Zhuoyi Wang ; Rong Wu ; Yang Gao ; Latifur Khan

VALIDATING THE USE OF WI-FI SIGNALS TO ESTIMATE HYPERLOCAL URBAN

POPULATIONS .............................................................................................................................................................. 1650 Nicholas E. Johnson ; Pablo Mandiola ; Cyrus Blankinship ; Bartosz Bonczak ; Constantine E. Kontokosta

OD-NETWORK-BASED PEDESTRIAN-PATH PREDICTION FOR PEOPLE-FLOW

SIMULATION ................................................................................................................................................................. 1656 Yu Kitano ; Satoshi Kuwamoto ; Akinori Asahara

AD2: IMPROVING QUALITY OF IOT DATA THROUGH COMPRESSIVE ANOMALY

DETECTION ................................................................................................................................................................... 1662 Aekyeung Moon ; Xiaoyan Zhuo ; Jialing Zhang ; Seung Woo Son

COMPLEX EVENT ANALYSIS FOR TRAFFIC RISK PREDICTION BASED ON 3D-CNN WITH

MULTI-SOURCES URBAN SENSING DATA ............................................................................................................ 1669 Ngoc-Thanh Nguyen ; Minh-Son Dao ; Koji Zettsu

UNSUPERVISED CONDITIONAL ADVERSARIAL NETWORKS FOR TAX EVASION

DETECTION ................................................................................................................................................................... 1675 Rongzhe Wei ; Bo Dong ; Qinghua Zheng ; Xulyu Zhu ; Jianfei Ruan ; Huan He

TEDM-PU: A TAX EVASION DETECTION METHOD BASED ON POSITIVE AND UNLABELED

LEARNING ...................................................................................................................................................................... 1681 Yingchao Wu ; Qinghua Zheng ; Yuda Gao ; Bo Dong ; Rongzhe Wei ; Fa Zhang ; Huan He

IMPLEMENTING A DOMAIN-INDEPENDENT FRAMEWORK TO DETECT SUSPICIOUS

REVIEW PATTERNS .................................................................................................................................................... 1687 Diwen Xue ; Willie Yee ; Yueping Wang ; Suzanne McIntosh

DETECTION OF ANOMALY STATE CAUSED BY UNEXPECTED ACCIDENT USING DATA OF

SMART CARD FOR PUBLIC TRANSPORTATION ................................................................................................. 1693 Sakura Yamaki ; Shou-De Lin ; Wataru Kameyama

DATA STREAMING ANALYSIS FRAMEWORK FOR THROUGH-TIME 3D FREE-BREATHING

LIVER DCE-MRI ............................................................................................................................................................ 1699 Kun Yang ; Xuhui Chen ; Pan Li

APPLICATION OF MULTI-CHANNEL 3D-CUBE SUCCESSIVE CONVOLUTION NETWORK

FOR CONVECTIVE STORM NOWCASTING ........................................................................................................... 1705 Wei Zhang ; Lei Han ; Juanzhen Sun ; Hanyang Guo ; Jie Dai

AD BLOCKING WHITELIST PREDICTION FOR ONLINE PUBLISHERS ......................................................... 1711 Shuai Zhao ; Achir Kalra ; Chong Wang ; Cristian Borcea ; Yi Chen

A CONGESTION DIFFUSION MODEL WITH INFLUENCE MAXIMIZATION FOR TRAFFIC

BOTTLENECKS IDENTIFICATION IN METROCITY SCALES ........................................................................... 1717 Baoxin Zhao ; Chengzhong Xu ; Siyuan Liu ; Juanjuan Zhao ; Li Li

ATTRIBUTED SEQUENCE EMBEDDING ................................................................................................................ 1723 Zhongfang Zhuang ; Xiangnan Kong ; Rundensteiner Elke ; Jihane Zouaoui ; Aditya Arora

FEDERATED MULTIMODAL BIG DATA STORAGE & ANALYTICS PLATFORM FOR

ADDITIVE MANUFACTURING .................................................................................................................................. 1729 Kareem S. Aggour ; Vijay S. Kumar ; Paul Cuddihy ; Jenny Weisenberg Williams ; Vipul Gupta ; Laura Dial ; Tim

Hanlon ; Justin Gambone ; Joseph Vinciquerra

DELOG: A HIGH-PERFORMANCE PRIVACY PRESERVING LOG FILTERING FRAMEWORK ................ 1739 Amey Agrawal ; Abhishek Dixit ; Namrata A Shettar ; Darshil Kapadia ; Vikram Agrawal ; Rajat Gupta ; Rohit

Karlupia

VERTICA FLATTENED TABLES AND LIVE AGGREGATE PROJECTIONS: A COLUMN-

BASED ALTERNATIVE TO MATERIALIZED VIEWS FOR ANALYTICS ......................................................... 1749 Yuanzhe Bei ; Thao Pham ; Akshay Aggarwal ; Nga Tran ; Jaimin Dave ; Chuck Bear ; Michael Leuchtenburg

VISUAL ANALYTICS FOR DEEP EMBEDDINGS OF LARGE SCALE MOLECULAR

DYNAMICS SIMULATIONS ........................................................................................................................................ 1759 Junghoon Chae ; Debsindhu Bhowmik ; Heng Ma ; Arvind Ramanathan ; Chad Steed

INFDETECT: A LARGE SCALE GRAPH-BASED FRAUD DETECTION SYSTEM FOR E-

COMMERCE INSURANCE .......................................................................................................................................... 1765 Cen Chen ; Chen Liang ; Jianbin Lin ; Li Wang ; Ziqi Liu ; Xinxing Yang ; Jun Zhou ; Yang Shuang ; Yuan Qi

SEQ2GRAPH: DISCOVERING DYNAMIC NON-LINEAR DEPENDENCIES FROM

MULTIVARIATE TIME SERIES ................................................................................................................................. 1774 Xuan-Hong Dang ; Syed Yousaf Shah ; Petros Zerfos

DATA-CENTRIC HELICOPTER FAILURE ANTICIPATION: THE MGB OIL PRESSURE

VIRTUAL SENSOR CASE ............................................................................................................................................ 1784 Nassia Daouayry ; Ammar Mechouche ; Pierre-Loic Maisonneuve ; Vasile-Marian Scuturici ; Jean-Marc Petit

SCALING UP HETEROGENEOUS WAVEFORM CLUSTERING FOR LONG-DURATION

MONITORING SIGNAL ACQUISITION, ANALYSIS, AND INTERACTION: BRIDGING BIG

DATA ANALYTICS WITH MEASUREMENT INSTRUMENT USAGE PATTERN ............................................. 1794 Masaharu Goto ; Naoki Kobayashi ; Gang Ren ; Mitsunori Ogihara

HIGH-RESOLUTION ROAD VEHICLE COLLISION PREDICTION FOR THE CITY OF

MONTREAL .................................................................................................................................................................... 1804 Antoine Hébert ; Timothée Guédon ; Tristan Glatard ; Brigitte Jaumard

TOWARDS POWER EFFICIENCY IN DEEP LEARNING ON DATA CENTER HARDWARE ......................... 1814 Miro Hodak ; Masha Gorkovenko ; Ajay Dholakia

LARGE DATA FLOW GRAPHS IN LIMITED GPU MEMORY ............................................................................. 1821 Geert Janssen ; Vladimir Zolotov ; Tung D. Le

INTELLIGENT EXPLORATION OF LARGE-SCALE DATA: WHAT CAN WE LEARN IN TWO

PASSES? ........................................................................................................................................................................... 1831 Chandrika Kamath

AN EFFICIENT CLOUD-BASED FRAMEWORK FOR DIGITAL MEDIA KNOWLEDGE

EXTRACTION ................................................................................................................................................................ 1841 Chaitanya Kanchibhotla ; Pruthviraj Venkatesh ; Dvln Somayajulu ; P Radha Krishna

CONTEXTUAL PRICE FEATURES FOR E-COMMERCE SEARCH RANKING ................................................ 1851 Ishita Khan ; Aritra Mandal ; Prathyusha Senthil Kumar

A FRAMEWORK FOR EXPLAINABLE TEXT CLASSIFICATION IN LEGAL DOCUMENT

REVIEW ........................................................................................................................................................................... 1858 Christian J. Mahoney ; Jianping Zhang ; Nathaniel Huber-Fliflet ; Peter Gronvall ; Haozhen Zhao

TARGETED DISPLAY ADVERTISING: THE CASE OF PREFERENTIAL ATTACHMENT ........................... 1868 Saurav Manchanda ; Pranjul Yadav ; Khoa Doan ; S. Sathiya Keerthi

MNP INSIDE OUT: A GAME THEORY ASSISTED MACHINE LEARNING MODEL TO

DETECT SUBSCRIBER CHURN BEHAVIORS UNDER CHINA’S MOBILE NUMBER

PORTABILITY POLICY ............................................................................................................................................... 1878 Ye Ouyang ; Aidong Yang ; Shuming Zeng ; Fanyu Meng

A DYNAMIC NEURAL NETWORK MODEL FOR CLICK-THROUGH RATE PREDICTION IN

REAL-TIME BIDDING .................................................................................................................................................. 1887 Xianshan Qu ; Li Li ; Xi Liu ; Rui Chen ; Yong Ge ; Soo-Hyun Choi

EDGEINSIGHT: CHARACTERIZING AND MODELING THE PERFORMANCE OF MACHINE

LEARNING INFERENCE ON THE EDGE AND CLOUD ........................................................................................ 1897 Philipp Ross ; Andre Luckow

HIGH VALUE CUSTOMER ACQUISITION & RETENTION MODELLING – A SCALABLE

DATA MASHUP APPROACH ...................................................................................................................................... 1907 Kajanan Sangaralingam ; Nisha Verma ; Aravind Ravi ; Su Won Bae ; Anindya Datta

MINING VESSEL TRAJECTORIES FOR ILLEGAL FISHING DETECTION ..................................................... 1917 Amir Yaghoubi Shahir ; Mohammad A. Tayebi ; Uwe Glässer ; Tilemachos Charalampous ; Zahra Zohrevand ;

Hans Wehn

REPRESENTATION LEARNING IN HETEROGENEOUS PROFESSIONAL SOCIAL

NETWORKS WITH AMBIGUOUS SOCIAL CONNECTIONS ............................................................................... 1928 Baoxu Shi ; Jaewon Yang ; Tim Weninger ; Jing How ; Qi He

THE CONCEPTUAL BACKGROUND OF OPTIMIST’S AI MODULE ................................................................. 1938 Sören Stöhrmann ; Vera Kamp ; Reinhard Moratz

FORWARD INDEX COMPRESSION FOR INSTANCE RETRIEVAL IN AN AUGMENTED

REALITY APPLICATION ............................................................................................................................................ 1946 Qi Wang ; Michal Siedlaczek ; Yen-Yu Chen ; Michael Gormish ; Torsten Suel

DEEPCAROTENE - JOB TITLE CLASSIFICATION WITH MULTI-STREAM

CONVOLUTIONAL NEURAL NETWORK ................................................................................................................ 1953 Jingya Wang ; Kareem Abdelfatah ; Mohammed Korayem ; Janani Balaji

AN ASSISTIVE LEARNING WORKFLOW ON ANNOTATING IMAGES FOR OBJECT

DETECTION ................................................................................................................................................................... 1962 Vivian Wen Hui Wong ; Max Ferguson ; Kincho H. Law ; Yung-Tsun Tina Lee

DEMYSTIFYING LEARNING RATE POLICIES FOR HIGH ACCURACY TRAINING OF DEEP

NEURAL NETWORKS .................................................................................................................................................. 1971 Yanzhao Wu ; Ling Liu ; Juhyun Bae ; Ka-Ho Chow ; Arun Iyengar ; Calton Pu ; Wenqi Wei ; Lei Yu ; Qi Zhang

FREQUENT CAUSAL PATTERN MINING: A COMPUTATIONALLY EFFICIENT

FRAMEWORK FOR ESTIMATING BIAS-CORRECTED EFFECTS .................................................................... 1981 Pranjul Yadav ; Michael Steinbach ; M. Regina Castro ; Pedro J. Caraballo ; Vipin Kumar ; Gyorgy Simon

ACE – AN ANOMALY CONTRIBUTION EXPLAINER FOR CYBER-SECURITY

APPLICATIONS ............................................................................................................................................................. 1991 Xiao Zhang ; Manish Marwah ; I-Ta Lee ; Martin Arlitt ; Dan Goldwasser

STREETIFY: USING STREET VIEW IMAGERY AND DEEP LEARNING FOR URBAN

STREETS DEVELOPMENT ......................................................................................................................................... 2001 Fahad Alhasoun ; Marta González

SUSPICIOUS LOCATION DETECTION USING TRAJECTORY ANALYSIS & LOCATION

BACKFILLING – A SCALABLE APPROACH .......................................................................................................... 2007 Su Bae ; Aravind Ravi ; Kajanan Sangaralingam ; Nisha Verma ; Anindya Datta ; Varun Chugh

STREAMLINED AND ACCELERATED CYBER ANALYST WORKFLOWS WITH CLX AND

RAPIDS ............................................................................................................................................................................ 2011 Nicholas Becker ; Ayush Dattagupta ; Eli Fajardo ; Prem Gali ; Bianca Rhodes ; Bartley Richardson ; Bhargav

Suryadevara

A SYSTEM FOR ANALYSIS AND REMEDIATION OF ATTRITION ................................................................... 2016 Neil Brockett ; Catriona Clarke ; Michele Berlingerio ; Sourav Dutta

AN INTERACTIVE ONLINE AUDIENCE EXTENSION SYSTEM ........................................................................ 2020 Yu Cheng ; Shuo Yang ; David Araujo ; Yibo Yao ; Yantao Yao

TSOUTLIER: EXPLAINING OUTLIERS WITH UNIFORM PROFILES OVER IOT DATA ............................. 2024 Ruihong Huang ; Zhiwei Chen ; Zhicheng Liu ; Shaoxu Song ; Jianmin Wang

MATRICS: A SYSTEM FOR HUMAN-MACHINE HYBRID FORECASTING OF

GEOPOLITICAL EVENTS ........................................................................................................................................... 2028 David J. Huber ; Sam Johnson ; Nigel Stepp ; Aruna Jammalamadaka ; Dana Warmsley ; Tiffany Kim ; Tsai-

Ching Lu

DEEP TRANSFER LEARNING FOR THERMAL DYNAMICS MODELING IN SMART

BUILDINGS ..................................................................................................................................................................... 2033 Zhanhong Jiang ; Young M. Lee

EMPIRICAL COMPARISONS OF CNN WITH OTHER LEARNING ALGORITHMS FOR TEXT

CLASSIFICATION IN LEGAL DOCUMENT REVIEW ........................................................................................... 2038 Robert Keeling ; Rishi Chhatwal ; Nathaniel Huber-Fliflet ; Jianping Zhang ; Fusheng Wei ; Haozhen Zhao ; Ye

Shi ; Han Qin

ENTERPRISE-WIDE MACHINE LEARNING USING TERADATA VANTAGE: AN

INTEGRATED ANALYTICS PLATFORM ................................................................................................................. 2043 Choudur Lakshminarayan ; Thiagarajan Ramakrishnan ; Awny Al-Omari ; Khaled Bouaziz ; Faraz Ahmad ; Sri

Raghavan ; Prama Agarwal

INDUSTRIAL TRACK: ARCHITECTING RAILWAY KPIS DATA PROCESSING WITH BIG

DATA TECHNOLOGIES ............................................................................................................................................... 2047 Alexander Suleykin ; Peter Panfilov ; Natalya Bakhtadze

NEAREST NEIGHBOR SUBSEQUENCE SEARCH IN TIME SERIES DATA ...................................................... 2057 Ramoza Ahsan ; Muzammil Bashir ; Rodica Neamtu ; Elke A. Rundensteiner ; Gabor Sarkozy

EFFECTS OF DATA REDUCTION METHODS AND RATES ON CLASSIFIERS ............................................... 2067 Reham M. Alamro ; Abdou S. Youssef

PROBABILISTIC NAMED ENTITY RECOGNITION FOR NON-STANDARD FORMAT

ENTITIES USING CO-OCCURRENCE WORD EMBEDDINGS ............................................................................. 2077 Jabir Alshehabi Al-Ani ; Maria Fasli

THE DYNAMIC-FPM: AN APPROACH FOR IDENTIFYING EVENTS FROM SOCIAL

NETWORKS USING FREQUENT PATTERN MINING AND DYNAMIC SUPPORT VALUES ........................ 2087 Nora Alkhamees ; Maria Fasli

DEEP LEARNING FOR NON-STATIONARY MULTIVARIATE TIME SERIES FORECASTING .................. 2097 Manal Almuammar ; Maria Fasli

POPULAR SEARCH TERMS AND STOCK PRICE PREDICTION ........................................................................ 2107 Izzat Alsmadi ; Muhammad Al-Abdullah ; Hisham Alsmadi

A ROBUST, EFFICIENT, AND BALANCED PARALLEL ALGORITHM FOR FINDING

CONNECTED COMPONENTS .................................................................................................................................... 2113 M. Asokan

INCREMENTAL ANALYSIS OF LARGE-SCALE SYSTEM LOGS FOR ANOMALY DETECTION ............... 2119 Merve Astekin ; Selim Özcan ; Hasan Sözer

QOS PREDICTION METHODS IN IOT A SURVEY ................................................................................................. 2128 Beyza Bagiröz ; Metehan Güzel ; Uraz Yavanoglu ; Suat Özdemir

ATM WITHDRAWAL AMOUNT FORECASTING THROUGH NEURAL ARCHITECTURES ........................ 2134 Orhun Bugra Baran ; Saim Sunel ; Pinar Karagoz ; Ismail Hakki Toroslu

ANALYZING THE CASCADING EFFECT OF TRAFFIC CONGESTION USING LSTM

NETWORKS .................................................................................................................................................................... 2144 Sanchita Basak ; Abhishek Dubey ; Leao Bruno

A RULE-BASED HOLISTIC APPROACH FOR TURKISH ASPECT-BASED SENTIMENT

ANALYSIS ....................................................................................................................................................................... 2154 Kivanc Bayraktar ; Uraz Yavanoglu ; Alper Ozbilen

A BI-OBJECTIVE APPROACH FOR PRODUCT RECOMMENDATIONS .......................................................... 2159 Idir Benouaret ; Sihem Amer-Yahia ; Christiane Kamdem-Kengne ; Jalil Chagraoui

WORDPREP: WORD-BASED PREPOSITION PREDICTION TOOL .................................................................... 2169 Pooja Bhagat ; Aparna S. Varde ; Anna Feldman

ON GENERATING DOMINATORS OF CUSTOMER PREFERENCES ................................................................ 2177 Jiang Bian ; Weibo Wang ; Xiang Zhang ; Wei Wang ; Arthur Huang ; Zhishan Guo

ATTENTION-BASED MULTI-TASK LEARNING FOR SENSOR ANALYTICS .................................................. 2187 Yujing Chen ; Huzefa Rangwala

HYPERPARAMETER ESTIMATION IN SVM WITH GPU ACCELERATION FOR PREDICTION

OF PROTEIN-PROTEIN INTERACTIONS ................................................................................................................ 2197 Huaming Chen ; Lei Wang ; Yaochu Jin ; Chi-Hung Chi ; Fucun Li ; Huaiyuan Chu ; Jun Shen

MULTI-TIME-HORIZON TRAFFIC RISK PREDICTION USING SPATIO-TEMPORAL URBAN

SENSING DATA FUSION .............................................................................................................................................. 2205 Minh-Son Dao ; Ngoc-Thanh Nguyen ; Koji Zettsu

SENTIMENT ANALYSIS IN TURKISH WITH DEEP LEARNING ........................................................................ 2215 Gözde Merve Demirci ; Seref Recep Keskin ; Gülüstan Dogan

A DATA-DRIVEN APPROACH TO KINEMATIC ANALYTICS OF SPINAL MOTION .................................... 2222 Aysenur Gençdogmus ; Seref Recep Keskin ; Gülüstan Dogan ; Yusuf Öztürk

PRACTICAL MACHINE LEARNING APPROACH TO CAPTURE THE SCHOLAR DATA

DRIVEN ALPHA IN AI INDUSTRY ............................................................................................................................ 2230 Yunzhe Fang ; Xiao-Yang Liu ; Hongyang Yang

RECURRENT NEURAL NETWORK-ASSISTED ADAPTIVE SAMPLING FOR APPROXIMATE

COMPUTING .................................................................................................................................................................. 2240 Yi Feng ; Yi Zhou ; Vahid Tarokh

SPATIO-TEMPORAL EVENT DETECTION USING POISSON MODEL AND QUAD-TREE ON

GEOTAGGED SOCIAL MEDIA .................................................................................................................................. 2247 Yasmeen George ; Shanika Karunasekera ; Aaron Harwood ; Kwan Hui Li

FAST SELF-ORGANIZING MAPS TRAINING ......................................................................................................... 2257 Flavio Giobergia ; Elena Baralis

PREDICTIVE MODELING OF CAMPAIGNS TO QUANTIFY PERFORMANCE IN FASHION

RETAIL INDUSTRY ...................................................................................................................................................... 2267 Chandadevi Giri ; Ulf Johansson ; Tuwe Löfström

UNDERSTANDING MULTILINGUAL COMMUNITIES THROUGH ANALYSIS OF CODE-

SWITCHING BEHAVIORS IN SOCIAL MEDIA DISCUSSIONS ........................................................................... 2274 Aaron Harwood ; Shanika Karunasekera ; Michelle Vanni ; Lucia Falzon ; Prarthana Padia ; Amila Silva

INFLUENCE OF NEIGHBORHOOD ON THE PREFERENCE OF AN ITEM IN ECOMMERCE

SEARCH ........................................................................................................................................................................... 2284 Saratchandra Indrakanti ; Svetlana Strunjas ; Shubhangi Tandon ; Manojkumar Kannadasan

CORAD: CORRELATION-AWARE COMPRESSION OF MASSIVE TIME SERIES USING

SPARSE DICTIONARY CODING ................................................................................................................................ 2289 Abdelouahab Khelifati ; Mourad Khayati ; Philippe Cudré-Mauroux

AUGMENTING CO-TRAINING WITH RECOMMENDATIONS TO CLASSIFY HUMAN RIGHTS

VIOLATIONS .................................................................................................................................................................. 2299 Ragini Kihlman ; Maria Fasli

TIME-VARYING ITEM FEATURE CONDITIONAL VARIATIONAL AUTOENCODER FOR

COLLABORATIVE FILTERING ................................................................................................................................. 2309 Jeeyung Kim

MAGNITUDE AND UNCERTAINTY PRUNING CRITERION FOR NEURAL NETWORKS ............................ 2317 Vinnie Ko ; Stefan Oehmcke ; Fabian Gieseke

DEEP LEARNING BASED DELAY AND BANDWIDTH EFFICIENT DATA TRANSMISSION IN

IOT .................................................................................................................................................................................... 2327 Ibrahim Kök ; Burak H. Çorak ; Uraz Yavanoglu ; Suat Özdemir

ACTIVE LEARNING WITH ABSTAINING CLASSIFIERS FOR IMBALANCED DRIFTING

DATA STREAMS ............................................................................................................................................................ 2334 Lukasz Korycki ; Alberto Cano ; Bartosz Krawczyk

TACKLING THE NEIGHBORING NETWORK HIT PROBLEM IN CELLULAR DATA .................................. 2344 Andrés Leiva-Araos ; Héctor Allende-Cid ; Denis Khryashchev ; Huy T. Vo

EXPLAINABLE VISUALIZATION FOR INTERACTIVE EXPLORATION OF CNN ON

WIKIPEDIA VANDAL DETECTION .......................................................................................................................... 2354 Zerong Liu ; Aidong Lu

A SERP-MINING APPROACH FOR CLASSIFICATION OF DNS REQUESTS ................................................... 2364 Junlan Lu ; Nikhil Takappa Saunshi ; Aldrich Mangune ; Magdalini Eirinaki ; Bin Yu ; Cricket Liu

DAOC: STABLE CLUSTERING OF LARGE NETWORKS ..................................................................................... 2373 Artem Lutov ; Mourad Khayati ; Philippe Cudré-Mauroux

MERGING LARGE ONTOLOGIES USING BIGDATA GRAPHDB ....................................................................... 2383 Kurosh Madani ; Cristiano Russo ; Antonio M. Rinaldi

TOWARDS COMPARING AND USING MACHINE LEARNING TECHNIQUES FOR

DETECTING AND PREDICTING HEART ATTACK AND DISEASES ................................................................. 2393 Thankgod Obasi ; M. Omair Shafiq

DETECTING HARDLY VISIBLE ROADS IN LOW-RESOLUTION SATELLITE TIME SERIES

DATA ................................................................................................................................................................................ 2403 Stefan Oehmcke ; Christoffer Thrysøe ; Andreas Borgstad ; Marcos Antonio Vaz Salles ; Martin Brandt ; Fabian

Gieseke

NEURAL AND QUANTUM CRYPTOGRAPHY IN BIG DATA: A REVIEW ........................................................ 2413 Burak Özçakmak ; Alper Özbilen ; Uraz Yavanoglu ; Kübra Cin

HELP ONESELF IN HELPING THE OTHERS: THE ECOLOGY OF ONLINE SUPPORT

GROUPS ........................................................................................................................................................................... 2418 Shuaidong Pan ; Tianran Hu ; Shujing Sun ; Jianbo Yuan ; Jiebo Luo

AN INFORMATIVE BASE OF POSITIVE AND NEGATIVE ASSOCIATION RULES ON BIG

DATA ................................................................................................................................................................................ 2428 Bemarisika Parfait ; Totohasina André

END-TO-END JOINT OPINION ROLE LABELING WITH BERT ......................................................................... 2438 Wei Quan ; Jinli Zhang ; Xiaohua Tony Hu

ELIMINATING DATA COLLECTION BOTTLENECK FOR WAKE WORD ENGINE TRAINING

USING FOUND AND SYNTHETIC DATA.................................................................................................................. 2447 Buvaneswari Ramanan ; Lawrence Drabeck ; Thomas Woo ; Troy Cauble ; Anil Rana

TOWARDS ANALYZING LARGE GRAPHS WITH QUANTUM ANNEALING .................................................. 2457 Hannu Reittu ; Ville Kotovirta ; Lasse Leskelä ; Hannu Rummukainen ; Tomi Räty

REVISITING TEXT AND KNOWLEDGE GRAPH JOINT EMBEDDINGS: THE AMOUNT OF

SHARED INFORMATION MATTERS! ...................................................................................................................... 2465 Paolo Rosso ; Dingqi Yang ; Philippe Cudré-Mauroux

LEARNING TEMPORAL RELEVANCE IN LONGITUDINAL MEDICAL NOTES ............................................ 2474 Cansu Sen ; Thomas Hartvigsen ; Xiangnan Kong ; Elke Rundensteiner

BAGGING USING INSTANCE-LEVEL DIFFICULTY FOR MULTI-CLASS IMBALANCED BIG

DATA CLASSIFICATION ON SPARK ........................................................................................................................ 2484 William C. Sleeman IV ; Bartosz Krawczyk

NET PROMOTER SENTIMENT CLASSIFIER USING OHPL-ALL ...................................................................... 2494 Bob Vanderheyden ; Ying Xie ; Mohan Rachumallu

AWDF: AN ADAPTIVE WEIGHTED DEEP FUSION ARCHITECTURE FOR MULTI-

MODALITY LEARNING ............................................................................................................................................... 2503 Qinghan Xue ; Abhishek Kolagunda ; Steven Eliuk ; Xiaolong Wang

VIDCEP: COMPLEX EVENT PROCESSING FRAMEWORK TO DETECT SPATIOTEMPORAL

PATTERNS IN VIDEO STREAMS ............................................................................................................................... 2513 Piyush Yadav ; Edward Curry

WOTBOOST: WEIGHTED OVERSAMPLING TECHNIQUE IN BOOSTING FOR IMBALANCED

LEARNING ...................................................................................................................................................................... 2523 Wenhao Zhang ; Ramin Ramezani ; Arash Naeim

TRUST INFERENCE FOR RIDESHARE THROUGH CO-TRAINING ON SOCIAL MEDIA DATA ................ 2532 Yang Zhou ; Yan Huang ; Joseph McGlynn ; Alexander Han

INFER LATENT PRIVACY FOR ATTRIBUTE NETWORK IN KNOWLEDGE GRAPH .................................. 2542 Zeyuan Cui ; Li Pan ; Shijun Liu ; Lizhen Cui

PRIVACY-PRESERVING HETEROGENEOUS FEDERATED TRANSFER LEARNING ................................... 2552 Dashan Gao ; Yang Liu ; Anbu Huang ; Ce Ju ; Han Yu ; Qiang Yang

SGNN: A GRAPH NEURAL NETWORK BASED FEDERATED LEARNING APPROACH BY

HIDING STRUCTURE ................................................................................................................................................... 2560 Guangxu Mei ; Ziyu Guo ; Shijun Liu ; Li Pan

SECURE AND EFFICIENT FEDERATED TRANSFER LEARNING ..................................................................... 2569 Shreya Sharma ; Chaoping Xing ; Yang Liu ; Yan Kang

PROFIT ALLOCATION FOR FEDERATED LEARNING ....................................................................................... 2577 Tianshu Song ; Yongxin Tong ; Shuyue Wei

FEDERATED LEARNING WITH BAYESIAN DIFFERENTIAL PRIVACY ......................................................... 2587 Aleksei Triastcyn ; Boi Faltings

MEASURE CONTRIBUTION OF PARTICIPANTS IN FEDERATED LEARNING ............................................. 2597 Guan Wang ; Charlie Xiaoqian Dang ; Ziye Zhou

POWER DEMAND RESPONSE INCENTIVE PRICING MODEL .......................................................................... 2605 Kun Zhang ; Yuliang Shi ; Yuecan Liu ; Zhongmin Yan

RADICALLY SIMPLIFYING GATED RECURRENT ARCHITECTURES WITHOUT LOSS OF

PERFORMANCE ............................................................................................................................................................ 2615 Jonathan W. Boardman ; Ying Xie

COARSE GRAINING OF DATA VIA INHOMOGENEOUS DIFFUSION CONDENSATION ............................. 2624 Nathan Brugnone ; Alex Gonopolskiy ; Mark W. Moyle ; Manik Kuchroo ; David Van Dijk ; Kevin R. Moon ;

Daniel Colon-Ramos ; Guy Wolf ; Matthew J. Hirn ; Smita Krishnaswamy

FINDING ARCHETYPAL SPACES USING NEURAL NETWORKS ...................................................................... 2634 David Van Dijk ; Daniel B. Burkhardt ; Matthew Amodio ; Alexander Tong ; Guy Wolf ; Smita Krishnaswamy

UTILITY-DRIVEN MINING OF HIGH UTILITY EPISODES ................................................................................. 2644 Wensheng Gan ; Jerry Chun-Wei Lin ; Han-Chieh Chao ; Philip S. Yu

MINING FREQUENT TEMPORAL PATTERNS FROM MEDICAL DATA BASED ON FUZZY

RANGED RELATIONS .................................................................................................................................................. 2654 Shoji Hirano ; Shusaku Tsumoto

MINING TEMPORAL FUZZY UTILITY ITEMSETS BY TREE STRUCTURE ................................................... 2659 Tzung-Pei Hong ; Cheng-Yu Lin ; Wei-Ming Huang ; Shu-Min Li ; Shyue-Liang Wang ; Jerry Chun-Wei Lin

STUDY OF THE EFFECTS OF VISUAL COMPLEXITY AND CONSUMER EXPERIENCE ON

VISUAL ATTENTION AND PURCHASE BEHAVIOR THROUGH THE USE OF EYE

TRACKING ..................................................................................................................................................................... 2664 Ken Ishibashi ; Chen Xiao ; Katsutoshi Yada

MINING HIGH-UTILITY SEQUENTIAL PATTERNS FROM BIG DATASETS .................................................. 2674 Jerry Chun-Wei Lin ; Yuanfa Li ; Philippe Fournier-Viger ; Youcef Djenouri ; Leon Shyue-Liang Wang

BRIDGING THE GAP BETWEEN COMMUNITY AND NODE REPRESENTATIONS: GRAPH

EMBEDDING VIA COMMUNITY DETECTION ...................................................................................................... 2681 Artem Lutov ; Dingqi Yang ; Philippe Cudré-Mauroux

AN EFFECTIVE AND SCALABLE DATA MODELING FOR ENTERPRISE BIG DATA

PLATFORM .................................................................................................................................................................... 2691 Jayesh Patel

ESTIMATION OF DISEASE CODE FROM ELECTRONIC PATIENT RECORDS ............................................. 2698 Shusaku Tsumoto ; Tomohiro Kimura ; Haruko Iwata ; Shoji Hirano

A GA-BASED FRAMEWORK FOR MINING HIGH FUZZY UTILITY ITEMSETS ............................................ 2708 Jimmy Ming-Tai Wu ; Jerry Chun-Wei Lin ; Philippe Fournier-Viger ; Tomasz Wiktorski ; Tzung-Pei Hong ; Matin

Pirouz

A DATA-DRIVEN APPROACH FOR CONTINUOUS ADHERENCE PREDICTIONS IN SLEEP

APNEA THERAPY MANAGEMENT .......................................................................................................................... 2716 Matheus Araujo ; Louis Kazaglis ; Conrad Iber ; Jaideep Srivastava

THE SERUMS TOOL-CHAIN: ENSURING SECURITY AND PRIVACY OF MEDICAL DATA IN

SMART PATIENT-CENTRIC HEALTHCARE SYSTEMS ...................................................................................... 2726 V. Janjic ; J. K. F. Bowles ; A. F. Vermeulen ; A. Silvina ; M. Belk ; C. Fidas ; A. Pitsillides ; M. Kumar ; M.

Rossbory ; M. Vinov ; T. Given-Wilson ; A. Legay ; E. Blackledge ; R. Arredouani ; G. Stylianou ; W. Huang

CLASSIFICATION MODELS AND SURVIVAL ANALYSIS FOR PROSTATE CANCER USING

RNA SEQUENCING AND CLINICAL DATA ............................................................................................................ 2736 Md Faisal Kabir ; Simone A. Ludwig

REGIONAL ANALYSIS OF DEATH RATE DUE TO AIR POLLUTION IN TURKEY AND ITS

NEIGHBORS ................................................................................................................................................................... 2746 Yunus Emre Karazag ; Sultan Turhan ; Ozgun Pinarer ; A. Teoman Naskali

USING HOSPITAL ADMINISTRATIVE DATA TO INFER PATIENT-PATIENT CONTACT VIA

THE CONSISTENT CO-PRESENCE ALGORITHM................................................................................................. 2756 Jeffrey Lienert ; Felix Reed-Tsochas ; Laura Koehly ; Christopher Steven Marcum

ANALYZING PUBLIC OUTLOOK TOWARDS VACCINATION USING TWITTER ......................................... 2763 Rutuja Mahajan ; William Romine ; Michele Miller ; Tanvi Banerjee

A HYBRID MODEL USING LSTM AND DECISION TREE FOR MORTALITY PREDICTION

AND ITS APPLICATION IN PROVIDER PERFORMANCE EVALUATION ....................................................... 2773 Peichang Shi ; Aryya Gangopadhyay ; Carolyn Owens ; Brenda Blunt ; Christine Grogan

SINGLE-CELL REGULATORY NETWORK INFERENCE AND CLUSTERING FROM HIGH-

DIMENSIONAL SEQUENCING DATA ....................................................................................................................... 2782 Aristidis G. Vrahatis ; Georgios N. Dimitrakopoulos ; Sotiris K. Tasoulis ; Spiros V. Georgakopoulos ; Vassilis P.

Plagianakos

THE DEVELOPMENT OF MACHINE LEARNING INFUSED OUTPATIENT PROGNOSTIC

MODELS FOR TACKLING IMPACTS OF CLIMATE CHANGE AND ENSURING DELIVERY

OF EFFECTIVE POPULATION HEALTH SERVICES ............................................................................................ 2790 Jaya Shankar Vuppalapati ; Santosh Kedari ; Anitha Ilapakurti ; Chandrasekar Vuppalapati ; Sharat Kedari ;

Rajasekar Vuppalapati

ENHANCING CLINICAL INFORMATION RETRIEVAL THROUGH CONTEXT-AWARE

QUERIES AND INDICES .............................................................................................................................................. 2800 Andrew Wen ; Yanshan Wang ; Vinod C. Kaggal ; Sijia Liu ; Hongfang Liu ; Jungwei Fan

TRANSPOSE-BASED INTEGRATED DATA REDUCTION TECHNIQUES FOR SPEEDING UP

CLASSIFIER TRAINING .............................................................................................................................................. 2808 Reham M. Alamro ; Abdou S. Youssef

HIERARCHICAL TRANSFER CONVOLUTIONAL NEURAL NETWORKS FOR IMAGE

CLASSIFICATION ......................................................................................................................................................... 2817 Xishuang Dong ; Hsiang-Huang Wu ; Yuzhong Yan ; Lijun Qian

RECONSTRUCTION OF AGENTS’ CORRUPTED TRAJECTORIES OF COLLECTIVE MOTION

USING LOW-RANK MATRIX COMPLETION ......................................................................................................... 2826 Kelum Gajamannage ; Randy Paffenroth

AUTONOMIC WORKLOAD CHANGE CLASSIFICATION AND PREDICTION FOR BIG DATA

WORKLOADS ................................................................................................................................................................ 2835 Mikhail Genkin ; Frank Dehne

PARALLEL GRADIENT BOOSTING BASED GRANGER CAUSALITY LEARNING ....................................... 2845 Pei Guo ; Chen Liuy ; Yan Tang ; Jianwu Wang

MODELLING WEALTH FROM CALL DETAIL RECORDS AND SURVEY DATA WITH

MACHINE LEARNING: EVIDENCE FROM PAPUA NEW GUINEA ................................................................... 2855 Muhammad Rizal Khaefi ; Hendrik ; Dharani Dhar Burra ; Rio Fandi Dianco ; Dikara Maitri Pradipta Alkarisya ;

Muhammad Rheza Muztahid ; Annissa Zahara ; George Hodge ; Rajius Idzalika

ONE-PASS SPARSIFIED GAUSSIAN MIXTURES ................................................................................................... 2865 Eric Kightley ; Stephen Becker

PERSONALIZED DEEPINF: ENHANCED SOCIAL INFLUENCE PREDICTION WITH DEEP

LEARNING AND TRANSFER LEARNING ................................................................................................................ 2871 Carson K. Leung ; Alfredo Cuzzocrea ; Jiaxing Jason Mai ; Deyu Deng ; Fan Jiang

VIDANOMALY: LSTM-AUTOENCODER-BASED ADVERSARIAL LEARNING FOR ONE-

CLASS VIDEO CLASSIFICATION WITH MULTIPLE DYNAMIC IMAGES ...................................................... 2881 Shusheng Li ; Wenbo He

A NOVEL OVERSAMPLING METHOD BASED ON SEQGAN FOR IMBALANCED TEXT

CLASSIFICATION ......................................................................................................................................................... 2891 Yin Luo ; Haishan Feng ; Xuanlong Weng ; Ke Huang ; Huang Zheng

ATTENTION-BASED MULTI-LAYER CHINESE WORD EMBEDDING ............................................................. 2895 Bing Ma ; Haifeng Sun ; Jingyu Wang ; Qi Qi

SWAG: ITEM RECOMMENDATIONS USING CONVOLUTIONS ON WEIGHTED GRAPHS ........................ 2903 Amit Pande ; Kai Ni ; Venkataramani Kini

DQA: SCALABLE, AUTOMATED AND INTERACTIVE DATA QUALITY ADVISOR ..................................... 2913 Shrey Shrivastava ; Dhaval Patel ; Anuradha Bhamidipaty ; Wesley M. Gifford ; Stuart A. Siegel ; Venkata

Sitaramagiridharganesh Ganapavarapu ; Jayant R. Kalagnanam

RECOVERY-ORIENTED BIG DATA COMPUTING FOR EXACTLY ONCE MESSAGE

PROCESSING ................................................................................................................................................................. 2923 Fangchen Sun ; Feng Mao ; Stephen Guo ; Yitao Yao ; Paritosh Malaviya ; Kushal Bhatt ; Anurag Gupta ; Mridul

Jain ; Kannan Achan ; Xiaotong Suo ; Nishad Kamat

AVERAGING ENSEMBLES MODEL FOR FORECASTING OF SHORT-TERM LOAD IN

SMART GRIDS ............................................................................................................................................................... 2931 Dabeeruddin Syed ; Shady S. Refaat ; Haitham Abu-Rub ; Othmane Bouhali ; Ameema Zainab ; Le Xie

Q-LEARNING WITH KALMAN FILTERS ................................................................................................................ 2939 Kei Takahata ; Takao Miura

INFORMED TRADING SUPPORT FOR THE AMATEUR INVESTORON THE NEW YORK

STOCK EXCHANGE ..................................................................................................................................................... 2948 T. Warren De Wit ; Vineetha Menon

RECCITE: A HYBRID APPROACH TO RECOMMEND POTENTIAL PAPERS ................................................ 2956 Pratyush Yadav ; Nikhila Remala ; Nargis Pervin

VIRTUAL-LINK REPRESENTATION FOR LINK PREDICTION ......................................................................... 2965 Can Yao ; Hai Huang ; Longxuan Ma ; Ze Yang ; Lei Zhang

FAULTED LINE IDENTIFICATION AND LOCALIZATION IN POWER SYSTEM USING

MACHINE LEARNING TECHNIQUES ...................................................................................................................... 2975 Ameema Zainab ; Shady S. Refaat ; Dabeeruddin Syed ; Ali Ghrayeb ; Haitham Abu-Rub

AN INNOVATIVE ONLINE PROCESS MINING FRAMEWORK FOR SUPPORTING

INCREMENTAL GDPR COMPLIANCE OF BUSINESS PROCESSES .................................................................. 2982 Rashid Zaman ; Alfredo Cuzzocrea ; Marwan Hassani

MULTI-TASK DEEP REINFORCEMENT LEARNING FOR SCALABLE PARALLEL TASK

SCHEDULING ................................................................................................................................................................ 2992 Lingxin Zhang ; Qi Qi ; Jingyu Wang ; Haifeng Sun ; Jianxin Liao

IDENTIFYING IMPORTANT INTERNET OUTAGES ............................................................................................. 3002 Ryan Bogutz ; Yuri Pradkin ; John Heidemann

QUANTIFYING THE EFFECTS OF RECOMMENDATION SYSTEMS ............................................................... 3008 Sunshine Chong ; Andrés Abeliuk

COMPUTING AS CURRICULAR PERIPHERAL ..................................................................................................... 3016 Shanni Lam ; Jacob Van Der Leeuw ; Wayne Ying ; Zachary Dodds ; Henry Coxe

DETECTING FAKE NEWS ARTICLES ...................................................................................................................... 3021 Jun Lin ; Glenna Tremblay-Taylor ; Guanyi Mou ; Di You ; Kyumin Lee

EFFECTS OF NETWORK STRUCTURE ON SUBJECTIVE PREFERENCE DIVERSITY ................................ 3026 Anne Lin ; Andrés Abeliuk ; Emilio Ferrara

SOMEF: A FRAMEWORK FOR CAPTURING SCIENTIFIC SOFTWARE METADATA FROM

ITS DOCUMENTATION ............................................................................................................................................... 3032 Allen Mao ; Daniel Garijo ; Shobeir Fakhraei

LOW COST GUNSHOT DETECTION USING DEEP LEARNING ON THE RASPBERRY PI ........................... 3038 Alex Morehead ; Lauren Ogden ; Gabe Magee ; Ryan Hosler ; Bruce White ; George Mohler

GROUP LINK PREDICTION ....................................................................................................................................... 3045 Andrew Stanhope ; Hao Sha ; Danielle Barman ; Mohammad Al Hasan ; George Mohler

PAPER RECOMMENDATION BASED ON CITATION RELATION ..................................................................... 3053 William Tanner ; Esra Akbas ; Mir Hasan

ARCHCONTRACT: USING SMART CONTRACTS FOR DISPOSITION ............................................................. 3060 Danielle Alves Batista ; Tim Weingaertner

TOWARDS A FLEXIBLE SYSTEM ARCHITECTURE FOR AUTOMATED KNOWLEDGE BASE

CONSTRUCTION FRAMEWORKS ............................................................................................................................ 3066 Osman Din

AUTOMATED INTERPRETABILITY OF LINKED DATA ONTOLOGIES: AN EVALUATION

WITHIN THE CULTURAL HERITAGE DOMAIN ................................................................................................... 3072 Nuno Freire ; Sjors De Valk

BUILDING THE NATIONAL RADIO RECORDINGS DATABASE: A BIG DATA APPROACH TO

DOCUMENTING AUDIO HERITAGE ........................................................................................................................ 3080 Emily Goodmann ; Mark A. Matienzo ; Shawn Vancour ; William Vanden Dries

EXTENDING THE SCOPE OF COMPUTATIONAL ARCHIVAL SCIENCE: A CASE STUDY ON

LEVERAGING ARCHIVAL AND ENGINEERING APPROACHES TO DEVELOP A

FRAMEWORK TO DETECT AND PREVENT “FAKE VIDEO” ............................................................................ 3087 Hoda Hamouda ; Jessica Bushey ; Victoria Lemieux ; James Stewart ; Corinne Rogers ; James Cameron ; Ken

Thibodeau ; Chen Feng

USING DATA PARTITIONS AND STATELESS SERVERS TO SCALE UP FEDORA

REPOSITORIES ............................................................................................................................................................. 3098 Gregory Jansen ; Aaron Coburn ; Adam Soroka ; Richard Marciano

DIGITAL LEGACIES ON PAPER: READING PUNCHCARDS WITH COMPUTER VISION ........................... 3103 Gregory Jansen

PRELIMINARY ANALYSIS OF A LARGE-SCALE DIGITAL ENTERTAINMENT

DEVELOPMENT ARCHIVE: A CASE STUDY OF THE ENTERTAINMENT TECHNOLOGY

CENTER’S PROJECTS ................................................................................................................................................. 3109 Eric Kaltman

ENTERPRISE ARCHITECTURE — A VALUE PROPOSITION FOR RECORDS

PROFESSIONALS .......................................................................................................................................................... 3116 Shadrack Katuu

REFRAMING DIGITAL CURATION PRACTICES THROUGH A COMPUTATIONAL

THINKING FRAMEWORK .......................................................................................................................................... 3126 Richard Marciano ; Sarah Agarrat ; Hannah Frisch ; Margaret Rose Hunt ; Kanishka Jain ; Genevieve Kocienda ;

Hannah Krauss ; Chenxi Liu ; Mary McKinley ; Danish Mir ; Connor Mullane ; Emery Patterson ; Debashish

Pradhan ; James Santos ; Britton Schams ; H

AN INTELLIGENT CLASS: THE DEVELOPMENT OF A NOVEL CONTEXT CAPTURING

FRAMEWORK SUPPORTING THE FUNCTIONAL AUTO-CLASSIFICATION OF RECORDS ..................... 3136 Nathaniel Payne

COMPUTATIONAL THINKING IN ARCHIVAL SCIENCE RESEARCH AND EDUCATION ......................... 3146 William Underwood ; Richard Marciano

WHAT COMPUTATIONAL ARCHIVAL SCIENCE CAN LEARN FROM ART HISTORY AND

MATERIAL CULTURE STUDIES ............................................................................................................................... 3153 Lyneise Williams

AUTOMATIC EXTRACTION OF PERSONALITY FROM TEXT: CHALLENGES AND

OPPORTUNITIES .......................................................................................................................................................... 3156 Nazar Akrami ; Johan Fernquist ; Tim Isbister ; Lisa Kaati ; Björn Pelzer

EXAMINATION OF DOUBLE ARBITER PUFS ON SECURITY AGAINST MACHINE

LEARNING ATTACKS .................................................................................................................................................. 3165 Meznah A. Alamro ; Yu Zhuang ; Ahmad O. Aseeri ; Mohammed Saeed Alkatheiri

MALICIOUS URL LINKAGE ANALYSIS AND COMMON PATTERN DISCOVERY ....................................... 3172 Shin-Ying Huang ; Tzu-Hsien Chuang ; Shi-Meng Huang ; Tao Ban

LEARNING TO GENERATE DIVERSE AND AUTHENTIC REVIEWS VIA AN ENCODER-

DECODER MODEL WITH TRANSFORMER AND GRU ........................................................................................ 3180 Kaifu Jin ; Xi Zhang ; Jiayuan Zhang

BYAKKO: AUTOMATIC WHITELIST GENERATION BASED ON OCCURRENCE

DISTRIBUTION OF FEATURES OF NETWORK TRAFFIC .................................................................................. 3190 Nobuyuki Kanaya ; Yu Tsuda ; Yuuki Takano ; Daisuke Inoue

A FRAMEWORK FOR CYBER THREAT INTELLIGENCE EXTRACTION FROM RAW LOG

DATA ................................................................................................................................................................................ 3200 Max Landauer ; Florian Skopik ; Markus Wurzenberger ; Wolfgang Hotwagner ; Andreas Rauber

DETECTING ADVERSARY USING WINDOWS DIGITAL ARTIFACTS ............................................................. 3210 Seng Pei Liew ; Satoshi Ikeda

DETECTION OF PHISHING WEBSITES USING GENERATIVE ADVERSARIAL NETWORK ...................... 3216 Pierrick Robic-Butez ; Thu Yein Win

DO’S AND DON’TS OF DISTRIBUTED INTRUSION DETECTION FOR INDUSTRIAL

NETWORK TOPOLOGIES ........................................................................................................................................... 3222 Peter Schneider

WORTH THE WAIT? TIME WINDOW FEATURE OPTIMIZATION FOR ATTACK

CLASSIFICATION ......................................................................................................................................................... 3232 Casey Wilson ; Xenia Mountrouidou ; Anna Little

FRAMEWORK FOR HEALTHCARE SECURITY PRACTICE ANALYSIS, MODELING AND

INCENTIVIZATION ...................................................................................................................................................... 3242 Prosper Kandabongee Yeng ; Bian Yang ; Einar Arthur Snekkenes

CAN MACHINE/DEEP LEARNING CLASSIFIERS DETECT ZERO-DAY MALWARE WITH

HIGH ACCURACY? ...................................................................................................................................................... 3252 Faranak Abri ; Sima Siami-Namini ; Mahdi Adl Khanghah ; Fahimeh Mirza Soltani ; Akbar Siami Namin

PROVENANCE–AWARE WORKFLOW FOR DATA QUALITY MANAGEMENT AND

IMPROVEMENT FOR LARGE CONTINUOUS SCIENTIFIC DATA STREAMS ................................................ 3260 Jitendra Kumar ; Michael C. Crow ; Ranjeet Devarakonda ; Michael Giansiracusa ; Kavya Guntupally ; Joseph V.

Olatt ; Zach Price ; Harold A. Shanafield ; Alka Singh

MTSAD: MULTIVARIATE TIME SERIES ABNORMALITY DETECTION AND

VISUALIZATION ........................................................................................................................................................... 3267 Vung Pham ; Ngan Nguyen ; Jie Li ; Jon Hass ; Yong Chen ; Tommy Dang

COMPARISON OF SUPPORT VECTOR MACHINE AND GRADIENT BOOSTING

REGRESSION TREE FOR PREDICTING SPATIALLY EXPLICIT LIFE CYCLE GLOBAL

WARMING AND EUTROPHICATION IMPACTS: A CASE STUDY IN CORN PRODUCTION ....................... 3277 Xiaobo Xue Romeiko ; Zhijian Guo ; Yulei Pang

THE PERFORMANCE OF LSTM AND BILSTM IN FORECASTING TIME SERIES ........................................ 3285 Sima Siami-Namini ; Neda Tavakoli ; Akbar Siami Namin

HACKERNETS: VISUALIZING MEDIA CONVERSATIONS ON INTERNET OF THINGS, BIG

DATA, AND CYBERSECURITY .................................................................................................................................. 3293 Hao Van ; Huyen N. Nguyen ; Rattikorn Hewett ; Tommy Dang

GPU-SFFT: A GPU BASED PARALLEL ALGORITHM FOR COMPUTING THE SPARSE FAST

FOURIER TRANSFORM (SFFT) OF K-SPARSE SIGNALS .................................................................................... 3303 Oswaldo Artiles ; Fahad Saeed

PARALLEL HYBRID METAHEURISTICS WITH DISTRIBUTED INTENSIFICATION AND

DIVERSIFICATION FOR LARGE-SCALE OPTIMIZATION IN BIG DATA STATISTICAL

ANALYSIS ....................................................................................................................................................................... 3312 Wendy K. Tam Cho ; Yan Y. Liu

PLANT EVENT DETECTION FROM TIME-VARYING POINT CLOUDS ........................................................... 3321 Tian Gao ; Jianxin Sun ; Feiyu Zhu ; Henry Akrofi Doku ; Yu Pan ; Harkamal Walia ; Hongfeng Yu

AUTO-CNNP: A COMPONENT-BASED FRAMEWORK FOR AUTOMATING CNN

PARALLELISM .............................................................................................................................................................. 3330 Soulaimane Guedria ; Noël De Palma ; Félix Renard ; Nicolas Vuillerme

AN EVALUATION OF RDMA-BASED MESSAGE PASSING PROTOCOLS ....................................................... 3340 Haoyu Huang ; Shahram Ghandeharizadeh

PERFORMANCE COMPARISON OF JULIA DISTRIBUTED IMPLEMENTATIONS OF

DIRICHLET PROCESS MIXTURE MODELS ........................................................................................................... 3350 Ruizhu Huang ; Weijia Xu ; Yinzhi Wang ; Silvia Liverani ; Ann E. Stapleton

ACCELERATING RNN ON FPGA WITH EFFICIENT CONVERSION OF HIGH-LEVEL

DESIGNS TO RTL .......................................................................................................................................................... 3355 Zongze Li ; Song Fu

VIEW SELECTION IN KNOT DEFORMATION ....................................................................................................... 3365 Juan Lin ; Hui Zhang

CONSTRUCTING SUFFIX ARRAY OF NEXT-GENERATION SEQUENCING UPON IN-

MEMORY LOOKUP CLOUD AND MAPREDUCE ................................................................................................... 3373 Li-Chen Liu ; Meng-Huang Lee ; Shin-Hung Chang ; Ray-I Chang ; Yu-Jung Chang ; Jan-Ming Ho

AN “ON THE FLY” FRAMEWORK FOR EFFICIENTLY GENERATING SYNTHETIC BIG

DATA SETS ..................................................................................................................................................................... 3379 Karl Mason ; Sadegh Vejdan ; Santiago Grijalva

TRANSPARENT IN-MEMORY CACHE MANAGEMENT IN APACHE SPARK BASED ON

POST-MORTEM ANALYSIS ........................................................................................................................................ 3388 Atsuya Nasu ; Kenji Yoneo ; Masao Okita ; Fumihiko Ino

A FAST EXACT VIEWSHED ALGORITHM ON GPU ............................................................................................. 3397 Faisal F. Qarah ; Yi-Cheng Tu

PARALLELIZED TOPOLOGICAL RELAXATION ALGORITHM ....................................................................... 3406 Guangchen Ruan ; Hui Zhang

PARALLEL R COMPUTING ON THE WEB ............................................................................................................. 3416 Ranjini Subramanian ; Hui Zhang

SPATIAL-TEMPORAL SCIENTIFIC DATA CLUSTERING VIA DEEP CONVOLUTIONAL

NEURAL NETWORK .................................................................................................................................................... 3424 Jianxin Sun ; Chunxia Wu ; Yufeng Ge ; Yusong Li ; Hongfeng Yu

PARALLEL TRAINING VIA COMPUTATION GRAPH TRANSFORMATION .................................................. 3430 Fei Wang ; Guoyang Chen ; Weifeng Zhang ; Tiark Rompf

A METHODOLOGY FOR CROSS-PLATFORM, EVENT-DRIVEN BIG DATA ANALYTICS-AS-

A-SERVICE ..................................................................................................................................................................... 3440 Claudio A. Ardagna ; Valerio Bellandi ; Paolo Ceravolo ; Ernesto Damiani ; Rino Finazzo

CONTEXT-AUGMENTED SOFTWARE DEVELOPMENT IN TRADITIONAL AND BIG DATA

PROJECTS: LITERATURE REVIEW AND PRELIMINARY FRAMEWORK ..................................................... 3449 Glaucia Melo ; Paulo Alencar ; Don Cowan

A FRAMEWORK FOR IDENTIFYING AND PRIORITIZING DATA ANALYTICS

OPPORTUNITIES IN ADDITIVE MANUFACTURING ........................................................................................... 3458 Hyunseop Park ; Hyunwoong Ko ; Yung-Tsun T. Lee ; Hyunbo Cho ; Paul Witherell

SKI: AN AGILE FRAMEWORK FOR DATA SCIENCE .......................................................................................... 3468 Jeffrey Saltz ; Alex Suthrland

ACHIEVING AGILE BIG DATA SCIENCE: THE EVOLUTION OF A TEAM’S AGILE PROCESS

METHODOLOGY .......................................................................................................................................................... 3477 Jeffrey S. Saltz ; Ivan Shamshurin

A HYBRID APPROACH TO DYNAMIC ENTERPRISE DATA PLATFORM ....................................................... 3486 Mehmet Selman Sezgin ; Ahmet Tugrul Bayrak ; Olcay Taner Ytldtz

REAL-TIME MACHINE LEARNING COMPETITION ON DATA STREAMS AT THE IEEE BIG

DATA 2019 ....................................................................................................................................................................... 3493 Dihia Boulegane ; Nedeljko Radulovic ; Albert Bifet ; Ghislain Fievet ; Jimin Sohn ; Yeonwoo Nam ; Seojeong Yu ;

Dong-Wan Choi

COLLABORATIVE STREAMING: TRUST REQUIREMENTS FOR PRICE SHARING .................................... 3498 Tobias Grubenmann ; Daniele Dell’Aglio ; Abraham Bernstein

MLK SMART CORRIDOR: AN URBAN TESTBED FOR SMART CITY APPLICATIONS ............................... 3506 Austin Harris ; Jose Stovall ; Mina Sartipi

SCALABLE AND RELIABLE MULTI-DIMENSIONAL AGGREGATION OF SENSOR DATA

STREAMS ........................................................................................................................................................................ 3512 Sören Henning ; Wilhelm Hasselbring

TV AD EVENTS AND DIGITAL SEARCH: ON THE SELECTION OF OUTCOME MEASURES .................... 3518 Shawndra Hill ; Anthony Colas ; H. Andrew Schwartz ; Gordon Burtch

IMAGE MINING FOR REAL TIME QUALITY ASSURANCE IN RAPID PROTOTYPING .............................. 3529 Sebastian Trinks ; Carsten Felden

A DEEP LEARNING APPROACH TO TRESPASSING DETECTION USING VIDEO

SURVEILLANCE DATA ............................................................................................................................................... 3535 Muzammil Bashir ; Elke A. Rundensteiner ; Ramoza Ahsan

CLASSIFYING IN-VEHICLE NOISE FROM MULTI-CHANNEL SOUND SPECTRUM BY DEEP

BEAMFORMING NETWORKS ................................................................................................................................... 3545 Seok-Jun Bu ; Sung-Bae Cho

AGENT-BASED MODELING TO SIMULATE ROAD TRAVEL USING BIG DATA FROM

SMARTPHONE GPS: AN APPLICATION TO THE CONTINENTAL UNITED STATES ................................... 3553 Sashikanth Gurram ; Vijayaraghavan Sivaraman ; Jonathan T. Apple ; Abdul R. Pinjari

TRAVEL PATTERN EXTRACTION FROM SMART CARD DATA USING DATA POLISHING ..................... 3563 Mio Hosoe ; Masashi Kuwano ; Taku Moriyama ; Kosuke Miyazaki ; Masaki Ito

USING TIMED SEQUENTIAL PATTERNS IN THE TRANSPORTATION INDUSTRY ..................................... 3573 Somayah Karsoum ; Le Gruenwald ; Clark Barrus ; Eleazar Leal

PERSONALIZED POI EMBEDDING FOR SUCCESSIVE POI RECOMMENDATION WITH

LARGE-SCALE SMART CARD DATA ....................................................................................................................... 3583 Jin-Young Kim ; Kyung-Hyun Lim ; Sung-Bae Cho

MODELING DYNAMIC SPATIAL-TEMPORAL CLUSTER RELATIONSHIPS ................................................. 3590 Ivens Portugal ; Paulo Alencar ; Donald Cowan

BLOCKCHAIN TECHNOLOGY AS A MECHANISM FOR DIGITAL RAILWAY TICKETING ...................... 3599 J. D. Preece ; J. M. Easton

PERFORMANCE BENCHMARKING OF DATA AUGMENTATION AND DEEP LEARNING FOR

TORNADO PREDICTION ............................................................................................................................................. 3607 Carlos A. Barajas ; Matthias K. Gobbert ; Jianwu Wang

MEASURING, QUANTIFYING, AND PREDICTING THE COST-ACCURACY TRADEOFF ........................... 3616 Matt Baughman ; Nifesh Chakubaji ; Hong-Linh Truong ; Krists Kreics ; Kyle Chard ; Ian Foster

BENCHMARKING DISCRETISATION LEVEL OF CONTINUOUS ATTRIBUTES:

THEORETICAL AND EXPERIMENTAL APPROACHES ....................................................................................... 3623 Wanghu Chen ; Chao Wang ; Jing Li ; Bo Yang ; Yang Liu ; Jianwu Wang

FAST STOCHASTIC BLOCK PARTITIONING USING A SINGLE COMMODITY MACHINE ....................... 3632 Md Abdul Motaleb Faysal ; Shaikh Arifuzzaman

SEARCH FOR K: ASSESSING FIVE TOPIC-MODELING APPROACHES TO 120,000

CANADIAN ARTICLES ................................................................................................................................................ 3640 Qiang Fu ; Yufan Zhuang ; Jiaxin Gu ; Yushu Zhu ; Huihui Qin ; Xin Guo

ANALYSIS AND PREDICTION OF DATA TRANSFER THROUGHPUT FOR DATA-INTENSIVE

WORKLOADS ................................................................................................................................................................ 3648 Devarshi Ghoshal ; Kesheng Wu ; Eric Pouyoul ; Erich Strohmaier

REACTIVE MICROSERVICES IN COMMODITY RESOURCES .......................................................................... 3658 Divya Goel ; Amaresh Nayak

GEOYCSB: A BENCHMARK FRAMEWORK FOR THE PERFORMANCE AND SCALABILITY

EVALUATION OF NOSQL DATABASES FOR GEOSPATIAL WORKLOADS ................................................... 3666 Suneuy Kim ; Yuvraj Singh Kanwar

AN EMPIRICAL STUDY OF QUAD-LEVEL CELL (QLC) NAND FLASH SSDS FOR BIG DATA

APPLICATIONS ............................................................................................................................................................. 3676 Shuwen Liang ; Zhi Qiao ; Sihai Tang ; Jacob Hochstetler ; Song Fu ; Weisong Shi ; Hsing-Bung Chen

AN EMPIRICAL STUDY OF RABIN FINGERPRINTING PARAMETERS .......................................................... 3686 Paul Lu ; Owen Randall ; Emma McDonald

GHOSTZ PW/GF: DISTRIBUTED PARALLEL HOMOLOGY SEARCH SYSTEM FOR LARGE-

SCALE METAGENOMIC ANALYSIS ........................................................................................................................ 3692 Kenta Machida ; Osamu Tatebe

AN EXPERIMENTAL COMPARISON OF GPU TECHNIQUES FOR DBSCAN CLUSTERING ....................... 3701 Hamza Mustafa ; Eleazar Leal ; Le Gruenwald

TOWARDS A HIGH-LEVEL DESCRIPTION FOR GENERATING STREAM PROCESSING

BENCHMARK APPLICATIONS .................................................................................................................................. 3711 Alessio Pagliari ; Fabrice Huet ; Guillaume Urvoy-Keller

MIX AND RANK: A FRAMEWORK FOR BENCHMARKING RECOMMENDER SYSTEMS .......................... 3717 Bibek Paudel ; Dragi Kocev ; Tome Eftimov

MULTIDIMENSIONAL PREFERENCE QUERY OPTIMIZATION ON INFRASTRUCTURE

MONITORING SYSTEMS ............................................................................................................................................ 3727 Yinghua Qin ; Gheorghi Guzun

A GRAY-BOX TESTING METHOD FOR DIVIDE&CONQUER IN IMAGE PROCESSING ............................. 3737 Marco Strutz ; Hermann Heßling ; Peter Hufnagl

GRAPHOPT: A FRAMEWORK FOR AUTOMATIC PARAMETERS TUNING OF GRAPH

PROCESSING FRAMEWORKS ................................................................................................................................... 3744 Muaz Twaty ; Amine Ghrab ; Sabri Skhiri

CLUSTER-SIZE OPTIMIZATION WITHIN A CLOUD-BASED ETL FRAMEWORK FOR BIG

DATA ................................................................................................................................................................................ 3754 Eftim Zdravevski ; Petre Lameski ; Ace Dimitrievski ; Marek Grzegorowski ; Cas Apanowicz

EFFICIENT FEATURE EMBEDDING OF 3D BRAIN MRI IMAGES FOR CONTENT-BASED

IMAGE RETRIEVAL WITH DEEP METRIC LEARNING ...................................................................................... 3764 Yuto Onga ; Shingo Fujiyama ; Hayato Arai ; Yusuke Chayama ; Hitoshi Iyatomi ; Kenichi Oishi

USING GOVERNANCE AND ADAPTIVE NORMATIVE MULTIAGENT SYSTEMS FOR

DYNAMIC VEHICLE PLATOON FORMATION ...................................................................................................... 3770 Lauro Caetano ; Marx Viana ; Elder Cirilo ; Paulo Alencar ; Carlos Lucena

TOWARDS BUILDING AN INTERACTIVE PLATFORM FOR ANALYZING MOVEMENT OF

BUSES IN HAJJ .............................................................................................................................................................. 3775 Emad Felemban ; Faizan Ur Rehman ; Asad Ali Biabani ; Atif Naseer ; Usama Alabdulwahab

PIECEWISE STATIONARY MODELING OF RANDOM PROCESSES OVER GRAPHS WITH AN

APPLICATION TO TRAFFIC PREDICTION ............................................................................................................ 3779 Arman Hasanzadeh ; Xi Liu ; Nick Duffield ; Krishna R. Narayanan

CLASSIFICATION OF DRIVING BEHAVIOR EVENTS UTILIZING KINEMATIC

CLASSIFICATION AND MACHINE LEARNING FOR DOWN SAMPLED TIME SERIES DATA ................... 3789 Vikram Krishnamurthy ; Kusha Nezafati ; Juhyun Bae ; Emre Gursoy ; Mian Zhong ; Vikrant Singh

802.11AC AND P IN A SIMULATED VANET ENVIRONMENT ............................................................................. 3797 Michael Lee ; Beichen Yang ; Travis Atkison

KNOWLEDGE AND SITUATION-AWARE VEHICLE TRAFFIC FORECASTING ........................................... 3803 Hao Peng ; Nicholas Klepp ; Mohammadhossein Toutiaee ; I. Budak Arpinar ; John A. Miller

SCALABLE OBJECT TRACKING IN SMART CITIES ........................................................................................... 3813 Jose Stovall ; Austin Harris ; Amanda O’Grady ; Mina Sartipi

PRELIMINARY RESEARCH ON VEHICLE SPEED DETECTION USING TRAFFIC CAMERAS .................. 3820 Chen Wang ; Aibek Musaev

DETECTING PEDESTRIAN CROSSING EVENTS IN LARGE VIDEO DATA FROM TRAFFIC

MONITORING CAMERAS ........................................................................................................................................... 3824 Weijia Xu ; Natalia Ruiz ; Kelly Pierce ; Ruizhu Huang ; Joel Meyer ; Jen Duthie

DISCOVERING HIGH DEMANDING BUS ROUTES USING FARECARD DATA .............................................. 3832 Hongzhi Yang ; Chieh-Feng Chiang ; Chen Arbee L. P.

TRAINING-FREE MONOCULAR 3D EVENT DETECTION SYSTEM FOR TRAFFIC

SURVEILLANCE ............................................................................................................................................................ 3838 Lijun Yu ; Peng Chen ; Wenhe Liu ; Guoliang Kang ; Alexander G. Hauptmann

EXPLORING THE BRAND COMPETITION PATTERNS OF CHINA’S AUTO MARKET WITH

MASSIVE SALES LEADS ............................................................................................................................................. 3844 Sheng Zhang ; Haoyang Che ; Jiacai Zhang ; Yucong Duan

AN EVALUATION OF RDMA-BASED MESSAGE PASSING PROTOCOLS ....................................................... 3854 Haoyu Huang ; Shahram Ghandeharizadeh

A TECHNICAL PERSPECTIVE OF DATACALC — AD-HOC ANALYSES ON

HETEROGENEOUS DATA SOURCES ....................................................................................................................... 3864 Johannes Luong ; Dirk Habich ; Wolfgang Lehner

KAFKA: THE DATABASE INVERTED, BUT NOT GARBLED OR COMPROMISED ....................................... 3874 Sean Rooney ; Peter Urbanetz ; Chris Giblin ; Daniel Bauer ; Florian Froese ; Luis Garcés-Erice ; Saša Tomic

ON THE RESTFUL WEB SERVICES FOR MANAGING APPLICATION VIRTUALIZATION

ENVIRONMENTS .......................................................................................................................................................... 3881 Elif Cansu Yildiz ; Engin Unal ; Hakan Tuzun ; Doruk Eren Aktas ; Mehmet S. Aktas

A MOBILITY-DRIVEN APPROACH TO MODELING BUILDING ENERGY ..................................................... 3887 Anne Berres ; Piljae Im ; Kuldeep Kurte ; Melissa Allen-Dumas ; Gautam Thakur ; Jibonananda Sanyal

THE BLESSING OF DIMENSIONALITY IN MANY-OBJECTIVE SEARCH: AN INVERSE

MACHINE LEARNING INSIGHT ............................................................................................................................... 3896 Abhishek Gupta ; Yew-Soon Ong ; Mojtaba Shakeri ; Xu Chi ; Allan Zhang Nengsheng

FAST TOP-N PERSONALIZED RECOMMENDATION ON ITEM GRAPH ......................................................... 3903 Zhuoyi Lin ; Lei Feng ; Chee-Keong Kwoh ; Chi Xu

ANALYSIS OF HURRICANE MATTHEW 2016 DATA TO ESTIMATE AIRLINE PASSENGERS

DISRUPTION .................................................................................................................................................................. 3909 Harshitha Meda ; Lauren B. Davis ; Chrysafis Vogiatzis

ESTIMATION OF TRANSACTIONAL NETWORK DATA BETWEEN BRANCH OFFICES

USING TRANSACTIONAL BIG DATA THROUGHOUT JAPAN .......................................................................... 3916 Yoshiki Ogawa ; Yuki Akiyama ; Sekimoto Yoshihide ; Ryosuke Shibasaki

COPING WITH BIG DATA IN TRANSFER OPTIMIZATION................................................................................ 3925 Mojtaba Shakeri ; Abhishek Gupta ; Yew-Soon Ong ; Xu Chi ; Allan Zhang Nengsheng

SERVICE TIME PREDICTION FOR LAST-YARD DELIVERY ............................................................................. 3933 Junxian Song ; Rong Wen ; Chi Xu ; Joel Wei En Tay

PERFORMANCE EVALUATION OF ETHEREUM-BASED ON-CHAIN SENSOR DATA

MANAGEMENT PLATFORM FOR INDUSTRIAL IOT .......................................................................................... 3939 Kentaroh Toyoda ; Mojtaba Shakeri ; Xu Chi ; Allan N. Zhang

ENSEMBLE KRIGING FOR ENVIRONMENTAL SPATIAL PROCESSES .......................................................... 3947 Gokhan Mert Yagli ; Joel Wei En Tay ; Dazhi Yang

DYNAMIC VISUALIZATION OF LARGE SCALE TACTICAL NETWORK SIMULATIONS .......................... 3951 Venkat Dasari ; Stephen Allen ; Scott E. Brown

ANALYSIS OF HIGH FIDELITY NS-3 SIMULATIONS TO STUDY REAL-TIME APPLICATION

PERFORMANCE IN TACTICAL WIRELESS NETWORKS ................................................................................... 3955 Venkat Dasari ; David Alexander ; Scott Brown ; Jenmfer Brooks ; Brian Panneton ; Simon Su

LEVERAGING COMPREHENSIVE DATA ANALYSIS TO INFORM PARALLEL HPC

WORKLOADS ................................................................................................................................................................ 3960 Matthew Dwyer ; Nicole Kaff ; Jacob Cohen ; Michael Frauenhoffer

DYNAMIC COLLABORATIVE VISUALIZATION ECOSYSTEM TO SUPPORT THE ANALYSIS

OF LARGE-SCALE DISPARATE DATA .................................................................................................................... 3968 Christopher Koehler ; Andrew Berger ; Raksha Rajashekar ; Thomas Wischgoll ; Simon Su

DETECTING NETWORK SOFT-FAILURES WITH THE NETWORK LINK OUTLIER FACTOR

(NLOF) ............................................................................................................................................................................. 3978 Christopher Mendoza ; Venkat Dasari ; Michael P. McGarry

VISUALIZATION TECHNIQUES FOR LARGE-SCALE MONTE CARLO SIMULATION ............................... 3984 Vincent Perry ; Wendy Gao ; Michael Chen ; J. Michael Barton ; Simon Su

HYBRID 2D AND 3D VISUAL ANALYTICS OF NETWORK SIMULATION DATA .......................................... 3992 Simon Su ; Vincent Perry ; Venkat Dasari

ON THE QUALITY OF CLASSIFICATION MODELS FOR INFERRING ABAC POLICIES

FROM ACCESS LOGS .................................................................................................................................................. 4000 Luca Cappelletti ; Stefano Valtolina ; Giorgio Valentini ; Marco Mesiti ; Elisa Bertino

TOWARDS A NEURAL-SYMBOLIC GENERATIVE POLICY MODEL............................................................... 4008 Daniel Cunnington ; Mark Law ; Alessandra Russo ; Elisa Bertino ; Seraphin Calo

ACCESS CONTROL MODEL EXTENSIONS TO SUPPORT DATA PRIVACY PROTECTION

BASED ON GDPR ........................................................................................................................................................... 4017 Maryam Davari ; Elisa Bertino

A POLICY-BASED APPROACH FOR MEASURING DATA QUALITY................................................................ 4025 K. Grueneberg ; S. Calo ; P. Dewan ; D. Verma ; Tristan O’Gorman

COUNTING DEVICES: REVISITING EXISTING APPROACHES IN TODAY’S SETTINGS ........................... 4032 Franck Le ; Enriquillo Valdez ; Pau-Chen Cheng

POLICY BASED ENSEMBLES FOR APPLYING ML ON BIG DATA ................................................................... 4038 D. Verma ; S. Calo ; E. Bertino ; A. Russo ; G. White

ENABLING PRIVACY POLICIES FOR MHEALTH STUDIES .............................................................................. 4045 Brian Wang ; Mani B. Srivastava

MINING TOP-UP TRANSACTIONS AND ONLINE CLASSIFIED ADS TO PREDICT URBAN

NEIGHBORHOODS SOCIOECONOMIC STATUS .................................................................................................. 4055 Eduardo Cruz ; Carmen Vaca ; Allan Avendaño

TOWARDS BUILDING EVACUATION PLANNING PLATFORM USING MULTIMODAL

TRANSPORTATION FOR A LARGE CROWD ......................................................................................................... 4063 Emad Felemban ; Faizan Ur Rehman ; Hassan Wadood ; Atif Naseer

BENCH4GIS: BENCHMARKING PRIVACY-AWARE GEOCODING WITH OPEN BIG DATA ...................... 4067 Daniel R. Harris ; Chris Delcher

SHORT PAPER: EVALUATION OF LOCATION ESTIMATION METHOD THAT FOCUSES ON

GEOGRAPHICAL PROXIMITY OF FRIENDS ......................................................................................................... 4071 Keisuke Ikeda ; Kazufumi Kojima ; Masahiro Tani

DECISION-MAKING SYSTEM FOR ROAD-RECOVERY CONSIDERING HUMAN MOBILITY

BY APPLYING DEEP Q-NETWORK .......................................................................................................................... 4075 Soo-Hyun Joo ; Yoshiki Ogawa ; Yoshihide Sekimoto

SHORT PAPER: USER IDENTIFICATION ACROSS ONLINE SOCIAL NETWORKS BASED ON

SIMILARITIES AMONG DISTRIBUTIONS OF FRIENDS’ LOCATIONS ........................................................... 4085 Kazufumi Kojima ; Keisuke Ikeda ; Masahiro Tani

MONITORING OF NATURAL DISASTERS THROUGH ANOMALY DETECTION ON MOBILE

PHONE DATA ................................................................................................................................................................. 4089 Aude Marzuoli ; Fengmei Liu

CO-LOCATION PATTERN MINING OF GEOSOCIAL DATA TO CHARACTERIZE URBAN

FUNCTIONAL SPACES ................................................................................................................................................ 4099 Arif Masrur ; Gautam Thakur ; Kevin Sparks ; Rachel Palumbo ; Donna J. Peuquet

BIGDATACUBE: A SCALABLE, FEDERATED SERVICE PLATFORM FOR COPERNICUS ......................... 4103 Dimitar Mišev ; Peter Baumann ; Dimitris Bellos ; Stefan Wiehle

ACCURATE SPATIAL MAPPING OF SOCIAL MEDIA DATA WITH PHYSICAL LOCATIONS ................... 4113 Mohit Mittal ; Panote Siriaraya ; Chonho Lee ; Yukiko Kawai ; Takashi Yoshikawa ; Shinji Shimojo

SPATIO-TEMPORAL CLASSIFICATION AT MULTIPLE RESOLUTIONS USING MULTI-VIEW

REGULARIZATION ...................................................................................................................................................... 4117 Guruprasad Nayak ; Rahul Ghosh ; Xiaowei Jia ; Varun Mithal ; Vipin Kumar

A DEMONSTRATION OF B-EAGLEV VISUALIZING MASSIVE POINT CLOUD DIRECTLY

FROM HDFS ................................................................................................................................................................... 4121 Minh Hieu Nguyen ; Sanghyun Yoon ; Sangyoon Park ; Joon Heo

DEMO PAPER: LARGE SCALE SPATIAL DATA PROCESSING WITH USER DEFINED

FILTERS IN BBOXDB ................................................................................................................................................... 4125 Jan Kristof Nidzwetzki ; Ralf Hartmut Güting

EULER++: IMPROVED SELECTIVITY ESTIMATION FOR RECTANGULAR SPATIAL

RECORDS ........................................................................................................................................................................ 4129 A. B. Siddique ; Ahmed Eldawy ; Vagelis Hristidis

CPSITRES: A COLLABORATIVE SYSTEM FOR ANALYSIS OF BIG DATA ON SEA ICE ............................. 4134 Vinit Veerendraveer Singh ; Scott Sorensen ; Chandra Kambhamettu

SEXTANT: A COMPUTATIONAL FRAMEWORK FOR SCALABLE AND EFFICIENT

CORRELATION OF SPATIO-TEMPORAL TRAJECTORIES ............................................................................... 4138 Brian Thompson ; Dave Cedel ; Jeremy Martin ; Kristen Snee ; Alex Cheung

PTOLEMAIC INDEXING FOR MANAGING AND QUERYING INTERNET OF THINGS (IOT)

DATA ................................................................................................................................................................................ 4148 Christian Beecks ; Fabian Berns ; Kjeld Willy Schmidt

A NEW APPROACH FOR EFFICIENT STRUCTURE DISCOVERY IN IOT ....................................................... 4152 Fabian Berns ; Kjeld Schmidt ; Alexander Grass ; Christian Beecks

EFFECTIVELY TESTING SYSTEM CONFIGURATIONS OF CRITICAL IOT ANALYTICS

PIPELINES ...................................................................................................................................................................... 4157 Morgan K. Geldenhuys ; Lauritz Thamsen ; Kain Kordian Gontarskay ; Felix Lorenz ; Odej Kao

DETECTING ERRORS IN SHORT-TERM ELECTRICITY DEMAND FORECAST USING

PEOPLE DYNAMICS .................................................................................................................................................... 4163 Guillaume Habault ; Yasutaka Nishimura ; Kiyohito Yoshihara ; Chihiro Ono

KENSOR: COORDINATED INTELLIGENCE FROM CO-LOCATED SENSORS ............................................... 4169 Olivera Kotevska ; Kalyan Perumalla ; Juan Lopez

TOWARDS AN INTERACTIVE VISUALIZATION FRAMEWORK FOR IOT DEVICE DATA

FLOW ............................................................................................................................................................................... 4175 Ebelechukwu Nwafor ; Habeeb Olufowobi

IOT BASED URBAN NOISE MONITORING IN DEEP LEARNING USING HISTORICAL

REPORTS ........................................................................................................................................................................ 4179 Sayed Khushal Shah ; Zeenat Tariq ; Yugyung Lee

SMART HOME IOT ANOMALY DETECTION BASED ON ENSEMBLE MODEL LEARNING

FROM HETEROGENEOUS DATA .............................................................................................................................. 4185 Sihai Tang ; Zhaochen Gu ; Qing Yang ; Song Fu

SPEECH EMOTION DETECTION USING IOT BASED DEEP LEARNING FOR HEALTH CARE ................. 4191 Zeenat Tariq ; Sayed Khushal Shah ; Yugyung Lee

ASSOCIATION MODEL BETWEEN VISUAL FEATURE AND AQI RANK USING LIFELOG

DATA ................................................................................................................................................................................ 4197 Phuong-Binh Vo ; Trong-Dat Phan ; Minh-Son Dao ; Koji Zettsu

AUTOMATIC HYPERPARAMETER TUNING METHOD FOR LOCAL OUTLIER FACTOR,

WITH APPLICATIONS TO ANOMALY DETECTION ............................................................................................ 4201 Zekun Xu ; Deovrat Kakde ; Arin Chaudhuri

COST-ADAPTIVE NEURAL NETWORKS FOR PEAK VOLUME PREDICTION WITH EMM

FILTERING ..................................................................................................................................................................... 4208 Bin Yu ; Giovanna Graciani ; Anderson Nascimento ; Juhua Hu

SCALABILITY ANALYSIS OF BLOCKCHAIN ON A SERVERLESS CLOUD ................................................... 4214 Alex Kaplunovich ; Karuna P. Joshi ; Yelena Yesha

PREDICTING TRANSACTION LATENCY WITH DEEP LEARNING IN PROOF-OF-WORK

BLOCKCHAINS ............................................................................................................................................................. 4223 Enrico Tedeschi ; Tor-Arne S. Nordmo ; Dag Johansen ; Håvard D. Johansen

AONT-NZZD: A SECURE AND EFFICIENT DISPERSAL SCHEME IN DISTRIBUTED

STORAGE SYSTEMS .................................................................................................................................................... 4232 Pengcheng Xie ; Hui Li ; Haiyang Yu ; Zequan Chen

SECURITY VALIDATION TESTING ENVIRONMENT IN THE CLOUD ............................................................ 4240 Cristian Bucur ; Eduard Babulak

THE IMPACT OF PREPROCESSING IN NATURAL LANGUAGE FOR OPEN SOURCE

INTELLIGENCE AND CRIMINAL INVESTIGATION ............................................................................................ 4248 Jan William Johnsen ; Katrin Franke

BASIC FORENSIC PROCEDURES FOR CYBER CRIME INVESTIGATION IN SMART GRID

NETWORKS .................................................................................................................................................................... 4255 Igor Kotsiuba ; Inna Skarga-Bandurova ; Alkiviadis Giannakoulias ; Oksana Bulda

TECHNIQUE FOR FINDING AND INVESTIGATING THE STRONGEST COMBINATIONS OF

CYBERATTACKS ON SMART GRID INFRASTRUCTURE ................................................................................... 4265 Igor Kotsiuba ; Inna Skarga-Bandurova ; Alkiviadis Giannakoulias ; Mykhailo Chaikin ; Aleksandar Jevremovic

DEEP IN THE DARK: A NOVEL THREAT DETECTION SYSTEM USING DARKNET TRAFFIC ................. 4273 Sanjay Kumar ; Harald Vranken ; Joost Van Dijk ; Timo Hamalainen

PACE: PLATFORM FOR ANDROID MALWARE CLASSIFICATION AND PERFORMANCE

EVALUATION ................................................................................................................................................................ 4280 Ajit Kumar ; Vinti Agarwal ; Shishir K. Shandilya ; Andrii Shalaginov ; Saket Upadhyay ; Bhawna Yadav

DETECTION OF FRAUDULENT BEHAVIOR USING THE COMBINED ALGEBRAIC AND

MACHINE LEARNING APPROACH .......................................................................................................................... 4289 Oleksandr Letychevskyi ; Tetiana Polhul

COMPARATIVE STUDY OF WEAR-LEVELING IN SOLID-STATE DRIVE WITH NTFS FILE

SYSTEM ........................................................................................................................................................................... 4294 Ashar Neyaz ; Bing Zhou ; Narasimha Karpoor

DETECTING WEB SPAM IN WEBGRAPHS WITH PREDICTIVE MODEL ANALYSIS .................................. 4299 Naw Safrin Sattar ; Shaikh Arifuzzaman ; Minhaz F. Zibran ; Md Mohiuddin Sakib

CYBERCRIME INVESTIGATIONS IN THE ERA OF SMART APPLICATIONS: WAY

FORWARD THROUGH BIG DATA ............................................................................................................................ 4309 Andrii Shalaginov ; Igor Kotsiuba ; Asif Iqbal

FORECASTING OF TRENDS IN LEGAL SPEND MANAGEMENT ...................................................................... 4315 Pragati Awasthi ; Jerzy Bala ; Sebastian Carter

EXPERIMENTAL EVALUATION OF CNN PARAMETERS FOR TEXT CATEGORIZATION IN

LEGAL DOCUMENT REVIEW ................................................................................................................................... 4320 Qian Han ; Yufeng Kou ; Derek Snaidauf

IMAGE ANALYTICS FOR LEGAL DOCUMENT REVIEW : A TRANSFER LEARNING

APPROACH ..................................................................................................................................................................... 4325 Nathaniel Huber-Fliflet ; Fusheng Wei ; Haozhen Zhao ; Han Qin ; Shi Ye ; Amy Tsang

NORMALISATION OF SWIFT MESSAGE COUNTERPARTIES WITH FEATURE

EXTRACTION AND CLUSTERING ............................................................................................................................ 4329 Thanasis Schoinas ; Benjamin Guinard ; Diba Esbati ; Richard Chalk

QUALITATIVE MAPPING MODELING OF CRIMINALS’ SENSE OF SECURITY IN THEFT

CASES .............................................................................................................................................................................. 4337 Yajie Su ; Hao Lu ; Hongzhou Zhang

JAPANESE MISTAKABLE LEGAL TERM CORRECTION USING INFREQUENCY-AWARE

BERT CLASSIFIER ........................................................................................................................................................ 4342 Takahiro Yamakoshi ; Takahiro Komamizu ; Yasuhiro Ogawa ; Katsuhiko Toyama

SUPERVISED KEY TERMS CLUSTERING FOR REGULATORY MONITORING ............................................ 4352 Yong Zou ; Andrew Pat Waldo

FAILING & !FALLING (F&!F): LEARNING TO CLASSIFY ACCIDENTS AND INCIDENTS IN

AIRCRAFT DATA .......................................................................................................................................................... 4357 Jarrod Carson ; Kane Hollingsworth ; Rituparna Datta ; Aviv Segev

PRESCRIPTIVE EQUIPMENT MAINTENANCE: A FRAMEWORK ................................................................... 4366 Suresh Choubey ; Ryan Benton ; Tom Johnsten

MACHINE LEARNING USE CASES FOR SMART MANUFACTURING KPIS ................................................... 4375 Sandeep Jeereddy ; Ken Kennedy ; Eddie Duffy ; Annie Walker ; Bennie Vorster

SPATIOTEMPORAL REAL-TIME ANOMALY DETECTION FOR SUPERCOMPUTING

SYSTEMS ......................................................................................................................................................................... 4381 Qiao Kang ; Ankit Agrawal ; Alok Choudhary ; Alex Sim ; Kesheng Wu ; Rajkumar Kettimuthu ; Peter H. Beckman

; Zhengchun Liu ; Wei-Keng Liao

FORECASTING CROSS-BORDER POWER EXCHANGES THROUGH AN HVDC LINE USING

DYNAMIC MODELLING ............................................................................................................................................. 4390 Sylvie Koziel ; Patrik Hilber ; Per Westerlund ; Ebrahim Shayesteh

APPLICATION OF MACHINE LEARNING AND SPATIAL BOOTSTRAPPING TO IMAGE

PROCESSING FOR PREDICTIVE MAINTENANCE ............................................................................................... 4395 Vikram Krishnamurthy ; Kusha Nezafati ; Vikrant Singh

SELF-SUPERVISED MULTI-STAGE ESTIMATION OF REMAINING USEFUL LIFE FOR

ELECTRIC DRIVE UNITS ............................................................................................................................................ 4402 Ivan Melendez ; Rolando Doelling ; Oliver Bringmann

DATA IMPUTATION METHOD BASED ON PROGRAMMING BY EXAMPLE: APREP-S .............................. 4412 Hiroko Nagashima ; Yuka Kato

WIND TURBINE OPERATIONAL STATE PREDICTION: TOWARDS FEATURELESS, END-TO-

END PREDICTIVE MAINTENANCE .......................................................................................................................... 4422 Adrian Stetco ; Anees Mohammed ; Siniša Djurovic ; Goran Nenadic ; John Keane

SUBJECT-ORIENTED DATA RETRIEVAL AND ANALYSIS ON SINA WEIBO ................................................ 4431 Dan Chia-Tien Lo ; Charles Garnder ; Pascal Paschos ; Chung Ng

SEASONALITY-ADJUSTED CONCEPTUAL-RELEVANCY-AWARE RECOMMENDER

SYSTEM IN ONLINE GROCERIES ............................................................................................................................ 4435 Luyi Ma ; Jason H. D. Cho ; Sushant Kumar ; Kannan Achan

PURCHASE PREDICTION IN FREE ONLINE GAMES VIA SURVIVAL ANALYSIS ....................................... 4444 Wanshan Yang ; Ting Huang ; Junlin Zeng ; Yan Tang ; Lijun Chen ; Shivakant Mishra ; Youjian Eugene Liu

AN ENERGY-EFFICIENT RECONFIGURABLE LSTM ACCELERATOR FOR NATURAL

LANGUAGE PROCESSING.......................................................................................................................................... 4450 Elham Azari ; Sarma Vrudhula

EXPLORATION OF OPENCL 2D CONVOLUTION KERNELS ON INTEL FPGA, CPU, AND GPU

PLATFORMS .................................................................................................................................................................. 4460 Zheming Jin ; Hal Finkel

A CASE STUDY OF K-MEANS CLUSTERING USING SYCL ................................................................................ 4466 Zheming Jin ; Hal Finkel

BAYESIAN-BASED HYPERPARAMETER OPTIMIZATION FOR SPIKING NEUROMORPHIC

SYSTEMS ......................................................................................................................................................................... 4472 Maryam Parsa ; J. Parker Mitchell ; Catherine D. Schuman ; Robert M. Patton ; Thomas E. Potok ; Kaushik Roy

EVOLVING ENERGY EFFICIENT CONVOLUTIONAL NEURAL NETWORKS ............................................... 4479 Steven R. Young ; Pravallika Devineni ; Maryam Parsa ; J. Travis Johnston ; Bill Kay ; Robert M. Patton ;

Catherine D. Schuman ; Derek C. Rose ; Thomas E. Potok

QUANTUM GROVER SEARCH-BASED OPTIMIZATION FOR INNOVATIVE MATERIAL

DISCOVERY ................................................................................................................................................................... 4486 Sima Esfandiarpour Borujeni ; Ramkumar Harikrishnakumar ; Saideep Nannapaneni

LEARNING TO PREDICT MATERIAL STRUCTURE FROM NEUTRON SCATTERING DATA ................... 4490 Cristina Garcia-Cardona ; Ramakrishnan Kannan ; Travis Johnston ; Thomas Proffen ; Katharine Page ; Sudip K.

Seal

VISUALIZATION SYSTEM FOR EVOLUTIONARY NEURAL NETWORKS FOR DEEP

LEARNING ...................................................................................................................................................................... 4498 Junghoon Chae ; Catherine D. Schuman ; Steven R. Young ; J. Travis Johnston ; Derek C. Rose ; Robert M. Patton

; Thomas E. Potok

CLUSTERED LATENT DIRICHLET ALLOCATION FOR SCIENTIFIC DISCOVERY .................................... 4503 Christopher Gropp ; Alexander Herzog ; Ilya Safro ; Paul W. Wilson ; Amy W. Apon

REALISTIC TRANSPORT SIMULATION: TACKLING THE SMALL DATA CHALLENGE

WITH OPEN DATA ........................................................................................................................................................ 4512 Guimu Guo ; Jalal Majed Khalil ; Da Yan ; Virginia Sisiopiku

MACHINE LEARNING FOR PREDICTION OF MID TO LONG TERM HABITUAL

TRANSPORTATION MODE USE ................................................................................................................................ 4520 Alina Lazar ; Alexandra Ballow ; Ling Jin ; C. Anna Spurlock ; Alexander Sim ; Kesheng Wu

UNDERSTANDING DATA SIMILARITY IN LARGE-SCALE SCIENTIFIC DATASETS .................................. 4525 Payton Linton ; William Melodia ; Alina Lazar ; Deborah Agarwal ; Ludovico Bianchi ; Devarshi Ghoshal ;

Gilberto Pastorello ; Lavanya Ramakrishnan ; Kesheng Wu

DETECTING DEPENDENCY BETWEEN DISCRETE RANDOM VARIABLES AND

APPLICATION ............................................................................................................................................................... 4532 Edgar Llamas ; Ivan Garcia ; Andres Mendez

EXPLORATION OF WORKFLOW MANAGEMENT SYSTEMS EMERGING FEATURES FROM

USERS PERSPECTIVES ............................................................................................................................................... 4537 Ryan Mitchell ; Loic Pottier ; Steve Jacobs ; Rafael Ferreira Da Silva ; Mats Rynge ; Karan Vahi ; Ewa Deelman

EMPOWERING AGROECOSYSTEM MODELING WITH HTC SCIENTIFIC WORKFLOWS:

THE CYCLES MODEL USE CASE .............................................................................................................................. 4545 Rafael Ferreira Da Silva ; Rajiv Mayani ; Yuning Shi ; Armen R. Kemanian ; Mats Rynge ; Ewa Deelman

EVALUATING SCIENTIFIC WORKFLOW ENGINES FOR DATA AND COMPUTE INTENSIVE

DISCOVERIES ................................................................................................................................................................ 4553 Rina Singh ; Jeffrey A. Graves ; Valentine Anantharaj ; Sreenivas R. Sukumar

INFORMATION EXTRACTION FROM CANCER PATHOLOGY REPORTS WITH GRAPH

CONVOLUTION NETWORKS FOR NATURAL LANGUAGE TEXTS ................................................................. 4561 Hong-Jun Yoon ; John Gounley ; M. Todd Young ; Georgia Tourassi

HUMAN-MACHINE INFORMATION EXTRACTION SIMULATOR FOR BIOLOGICAL

COLLECTIONS .............................................................................................................................................................. 4565 Icaro Alzuru ; Aditi Malladi ; Andréa Matsunaga ; Maurício Tsugawa ; Fortes José A. B.

WATAPI: COMPOSING WEB API SPECIFICATION FROM API DOCUMENTATIONS

THROUGH AN INTELLIGENT AND INTERACTIVE ANNOTATION TOOL .................................................... 4573 Mehdi Bahrami ; Wei-Peng Chen

ANALYSING SOCIAL MEDIA AS A HYBRID TOOL TO DETECT AND INTERPRET LIKELY

RADICAL BEHAVIOURAL TRAITS FOR NATIONAL SECURITY ..................................................................... 4579 Pedro Cardénas ; Boguslaw Obara ; Georgios Theodoropoulos ; Ibad Kureshi

CLUSTERCLEAN: A WEAK SEMI-SUPERVISED APPROACH FOR CLEANING DATA

LABELS ........................................................................................................................................................................... 4589 Kyriaki Dimitriadou ; Rahul Manghwani ; Timothy C. Hoad

INCENTIVE DESIGN FOR CROWDSOURCED DEVELOPMENT OF SELECTIVE AI FOR

HUMAN AND MACHINE DATA PROCESSING: A CASE STUDY ........................................................................ 4596 Masafumi Hayashi ; Masaki Kobayashi ; Masaki Matsubara ; Toshiyuki Amagasa ; Atsuyuki Morishima

DISAMBIGUATION AND ERROR RESOLUTION IN CALL TRANSCRIPTS..................................................... 4602 Jordan Hosier ; Vijay K. Gurbani ; Neil Milstead

COLLABORATIVE WORKFLOW FOR ANALYZING LARGE-SCALE DATA FOR

ANTIMICROBIAL RESISTANCE: AN EXPERIENCE REPORT ........................................................................... 4608 Pei-Yu Hou ; Jing Ao ; Andrew Rindos ; Shivaramu Keelara ; Paula J. Fedorka-Cray ; Rada Chirkova

ESTABLISHMENT OF WORK-FLOW FOR ROOF DAMAGE DETECTION UTILIZING

DRONES, HUMAN AND AI BASED ON HUMAN-IN-THE-LOOP FRAMEWORK ............................................. 4618 Munenari Inoguchi ; Keiko Tamura ; Ryota Hamamoto

A MICROTASK APPROACH TO IDENTIFYING INCOMPREHENSION FOR FACILITATING

PEER LEARNING .......................................................................................................................................................... 4624 Hinako Izumi ; Masaki Matsubara ; Chiemi Watanabe ; Atsuyuki Morishima

ACTIVE LEARNING WITHOUT UNLABELED SAMPLES: GENERATING QUESTIONS AND

LABELS USING MONTE CARLO TREE SEARCH .................................................................................................. 4628 Sathish K. Sankarpandi ; Spyros Samothrakis ; Luca Citi ; Peter Brady

SUPER LONG INTERVAL TIME-LAPSE IMAGE GENERATION FOR PROACTIVE

PRESERVATION OF CULTURAL HERITAGE USING CROWDSOURCING .................................................... 4632 Hidehiko Shishido ; Hansung Kim ; Itaru Kitahara

EXPLAINABLE RECOMMENDATION USING REVIEW TEXT AND A KNOWLEDGE GRAPH .................. 4638 Takafumi Suzuki ; Satoshi Oyama ; Masahito Kurihara

MISINFORMATION HARMS DURING CRISES: WHEN THE HUMAN AND MACHINE LOOPS

INTERACT ...................................................................................................................................................................... 4644 Thi Tran ; Paul Rad ; Rohit Valecha ; H. Raghav Rao

ACTIVE LEARNING STRATEGIES FOR HIERARCHICAL LABELING MICROTASKS ............................... 4647 Kousuke Uo ; Masaki Kobayashi ; Masaki Matsubara ; Yukino Baba ; Atsuyuki Morishima

EXTRACTING EXPLAINABLE DEEP REPRESENTATION FOR MACHINE TUTORING ............................. 4651 Ming-Chen Wang ; Vahid Golderzahi ; Hsing-Kuo Pao

TOWARDS QUALITY ASSESSMENT OF CROWDWORKER OUTPUT BASED ON

BEHAVIORAL DATA .................................................................................................................................................... 4659 Shigeaki Yuasa ; Takumi Nakai ; Takanori Maruichi ; Manuel Landsmann ; Koichi Kise ; Masaki Matsubara ;

Atsuyuki Morishima

OPEN-SOURCE INNOVATION IN PRACTICE: A LEAN-BASED DEVELOPMENT PROCESS

LEVERAGING OPEN-SOURCE BIG DATA TOOLS ............................................................................................... 4662 Silvio Alonso ; Marx Viana ; Elder Cirilo ; Paulo Alencar ; Carlos Lucena

USING SYNTHETIC DATA GENERATORS TO PROMOTE OPEN SCIENCE IN HIGHER

EDUCATION LEARNING ANALYTICS .................................................................................................................... 4672 Mohsen Dorodchi ; Erfan Al-Hossami ; Aileen Benedict ; Elise Demeter

SANKHYA: AN UNBIASED BENCHMARK FOR BANGLA HANDWRITTEN DIGITS

RECOGNITION .............................................................................................................................................................. 4676 Aminul Islam ; Fuad Rahman ; Akm Shahariar Azad Rabby

USING COLLABORATIVE OPEN SCIENCE TO ADVANCE K-12 COMPUTING EDUCATION .................... 4684 Monica M. McGill

IENVIRONMENT: PERSPECTIVES ON METADATA-ORIENTED TESTING OF RESEARCH

SOFTWARE .................................................................................................................................................................... 4687 Doug Mulholland ; Paulo Alencar ; Donald Cowan

CODE CONVENTION ADHERENCE IN RESEARCH DATA INFRASTRUCTURE SOFTWARE:

AN EXPLORATORY STUDY ....................................................................................................................................... 4691 Michael Smit

VOLENTI NON FIT INJURIA: RANSOMWARE AND ITS VICTIMS ................................................................... 4701 Amir Atapour-Abarghouei ; Stephen Bonner ; Andrew Stephen McGough

IDENTIFYING ANDROID MALWARE FAMILIES USING ANDROID-ORIENTED METRICS ...................... 4708 William Blanc ; Lina G. Hashem ; Karim O. Elish ; M. J. Hussain Almohri

MODELING AND FORECASTING ARMED CONFLICT: AUTOML WITH HUMAN-GUIDED

MACHINE LEARNING ................................................................................................................................................. 4714 Vito D’Orazio ; James Honaker ; Raman Prasady ; Michael Shoemate

ONLINE HATE: A STUDY ON THE FEASIBILITY TO DETECT HATE SPEECH IN SWEDISH ................... 4724 Johan Fernquist ; Oskar Lindholm ; Lisa Kaati ; Nazar Akrami

CLASS BALANCING FOR FRAUD DETECTION IN POINT OF SALE SYSTEMS ............................................ 4730 Christine Hines ; Abdou Youssef

A LOCATION INDEPENDENT MACHINE LEARNING APPROACH FOR EARLY FAKE NEWS

DETECTION ................................................................................................................................................................... 4740 Haohui Liu

IOCMINER: AUTOMATIC EXTRACTION OF INDICATORS OF COMPROMISE FROM

TWITTER ........................................................................................................................................................................ 4747 Amirreza Niakanlahiji ; Lida Safarnejad ; Reginald Harper ; Bei-Tseng Chu

PRAT - A TOOL FOR ASSESSING RISK IN WRITTEN COMMUNICATION .................................................... 4755 Amendra Shrestha ; Lisa Kaati ; Nazar Akrami

NETWORK EMBEDDING: ON COMPRESSION AND LEARNING ...................................................................... 4763 Esra Akbas ; Mehmet Emin Aktas

DISTRIBUTED COMMUNITY DETECTION IN LARGE NETWORKS USING AN

INFORMATION-THEORETIC APPROACH ............................................................................................................. 4773 Md Abdul Motaleb Faysal ; Shaikh Arifuzzaman

A SCALABLE GRAPH ANALYTICS FRAMEWORK FOR PROGRAMMING WITH BIG DATA

IN R (PBDR) .................................................................................................................................................................... 4783 S. M. Shamimul Hasan ; Drew Schmidt ; Ramakrishnan Kannan ; Neena Imam

EFFICIENT SIMILARITY-BASED ALIGNMENT OF TEMPORALLY-SITUATED GRAPH

NODES WITH APACHE SPARK ................................................................................................................................. 4793 Hubert Naacke ; Ke Li ; Bernd Amann ; Olivier Curé

GRAPHEVO: CHARACTERIZING AND UNDERSTANDING SOFTWARE EVOLUTION USING

CALL GRAPHS ............................................................................................................................................................... 4799 Vijay Walunj ; Gharib Gharibi ; Duy H. Ho ; Yugyung Lee

COMPUTING COMPLEX GRAPH PROPERTIES WITH SQL QUERIES ............................................................ 4808 Xiantian Zhou ; Carlos Ordonez

PREDICTING POST-STROKE HOSPITAL DISCHARGE DISPOSITION USING

INTERPRETABLE MACHINE LEARNING APPROACHES .................................................................................. 4817 Jin Cho ; Alnour Alharin ; Zhen Hu ; Nancy Fell ; Mina Sartipi

RECURRENT NEURAL NETWORK BASED FEATURE SELECTION FOR HIGH

DIMENSIONAL AND LOW SAMPLE SIZE MICRO-ARRAY DATA .................................................................... 4823 Shanta Chowdhury ; Xishuang Dong ; Xiangfang Li

REINFORCEMENT LEARNING FRAMEWORK TO IDENTIFY CAUSE OF DISEASES -

PREDICTING ASTHMA ATTACK CASE .................................................................................................................. 4829 Quan Do ; Son Tran ; Alexa Doig

COMPUTER-AIDED CLINICAL SKIN DISEASE DIAGNOSIS USING CNN AND OBJECT

DETECTION MODELS ................................................................................................................................................. 4839 Xin He ; Shihao Wang ; Shaohuai Shi ; Zhenheng Tang ; Yuxin Wang ; Zhihao Zhao ; Jing Dai ; Ronghao Ni ;

Xiaofeng Zhang ; Xiaoming Liu ; Zhili Wu ; Wu Yu ; Xiaowen Chu

AUTOMATED MACHINE LEARNING FOR EEG-BASED CLASSIFICATION OF PARKINSON’S

DISEASE PATIENTS ..................................................................................................................................................... 4845 Milan Koch ; Victor Geraedts ; Hao Wang ; Martijn Tannemaat ; Thomas Bäck

TOWARDS EXPLAINABLE MELANOMA DIAGNOSIS: PREDICTION OF CLINICAL

INDICATORS USING SEMI-SUPERVISED AND MULTI-TASK LEARNING ..................................................... 4853 Seiya Murabayashi ; Hitoshi Iyatomi

STOCHASTIC GASTRIC IMAGE AUGMENTATION FOR CANCER DETECTION FROM X-

RAY IMAGES ................................................................................................................................................................. 4858 Hideaki Okamoto ; Quan Huu Cap ; Takakiyo Nomura ; Hitoshi Iyatomi ; Jun Hashimoto

EXPLOITING ANTI-MONOTONIC CONSTRAINTS IN MINING PALINDROMIC MOTIFS

FROM BIG GENOMIC DATA ...................................................................................................................................... 4864 Oluwafemi A. Sarumi ; Carson K. Leung

BAYESIAN NON-LINEAR SUPPORT VECTOR MACHINE FOR HIGH-DIMENSIONAL DATA

WITH INCORPORATION OF GRAPH INFORMATION ON FEATURES ........................................................... 4874 Wenli Sun ; Changgee Chang ; Qi Long

EXPLAINABLE DEEP LEARNING APPLIED TO UNDERSTANDING OPIOID USE DISORDER

AND ITS RISK FACTORS ............................................................................................................................................. 4883 T. Elizabeth Workman ; Yijun Shao ; Joel Kupersmith ; Friedhelm Sandbrink ; Joseph L. Goulet ; Nawar M. Shaar

; Christopher Spevak ; Cynthia Brandt ; Marc R. Blackman ; Qing Zeng-Treitler

DISCOVERING SUBLANGUAGES IN A LARGE CLINICAL CORPUS THROUGH

UNSUPERVISED MACHINE LEARNING AND INFORMATION GAIN .............................................................. 4889 T. Elizabeth Workman ; Guy Divita ; Qing Zeng-Treitler

NOISY SUBGRAPH ISOMORPHISMS ON MULTIPLEX NETWORKS ............................................................... 4899 Hui Jin ; Xie He ; Yanghui Wang ; Hao Li ; Andrea L. Bertozzi

FILTERING STRATEGIES FOR INEXACT SUBGRAPH MATCHING ON NOISY MULTIPLEX

NETWORKS .................................................................................................................................................................... 4906 Alexei Kopylov ; Jiejun Xu

APPLICATIONS OF STRUCTURAL EQUIVALENCE TO SUBGRAPH ISOMORPHISM ON

MULTICHANNEL MULTIGRAPHS ........................................................................................................................... 4913 Thien Nguyen ; Dominic Yang ; Yurun Ge ; Hao Li ; Andrea L. Bertozzi

MULTIPLEX GRAPH MATCHING MATCHED FILTERS ..................................................................................... 4921 Konstantinos Pantazis ; Daniel L. Sussman ; Youngser Park ; Carey E. Priebe ; Vince Lyzinski

GRAPH GENERATION WITH A FOCUSING LEXICON ........................................................................................ 4928 Mayanka Chandra Shekar ; Joseph A. Cottam

TOWARD FILAMENT SEGMENTATION USING DEEP NEURAL NETWORKS .............................................. 4932 Azim Ahmadzadeh ; Sushant S. Mahajan ; Dustin J. Kempton ; Rafal A. Angryk ; Shihao Ji

SOLAR EVENT TRACKING WITH DEEP REGRESSION NETWORKS: A PROOF OF

CONCEPT EVALUATION ............................................................................................................................................ 4942 Toqi Tahamid Sarker ; Juan M. Banda

AN APPLICATION OF SPATIO-TEMPORAL CO-OCCURRENCE ANALYSES FOR

INTEGRATING SOLAR ACTIVE REGION DATA FROM MULTIPLE REPORTING MODULES ................. 4950 Xumin Cai ; Berkay Aydin ; Manolis K. Georgoulis ; Rafal Angryk

UNDERSTANDING THE IMPACT OF STATISTICAL TIME SERIES FEATURES FOR FLARE

PREDICTION ANALYSIS ............................................................................................................................................. 4960 Maxwell Hostetter ; Azim Ahmadzadeh ; Berkay Aydin ; Manolis K. Georgoulis ; Dustin J. Kempton ; Rafal A.

Angryk

SOLAR PRE-FLARE CLASSIFICATION WITH TIME SERIES PROFILING ..................................................... 4967 Ruizhe Ma ; Azim Ahmadzadeh ; Soukaina Filali Boubrahimi ; Manolis K. Georgoulis ; Rafal A. Angryk

STREAMING TEMPORAL GRAPHS: SUBGRAPH MATCHING .......................................................................... 4977 Eric L. Goodman ; Dirk Grunwald

CONSIDERING THE BLACKBOX: AN INVESTIGATION OF OPTIMIZATION TECHNIQUES

WITH COMPLETELY BALANCED DATASETS OF PACKET TRAFFIC ........................................................... 4987 Bruce Hartpence ; Andres Kwasinski

ARE WE REALLY PROTECTED? AN INVESTIGATION INTO THE PLAY PROTECT SERVICE ............... 4997 Shinelle Hutchinson ; Bing Zhou ; Umit Karabiyik

INTELLIGENT FEATURE ENGINEERING FOR CYBERSECURITY ................................................................. 5005 Paul Maxwell ; Elie Alhajjar ; Nathaniel D. Bastian

AN EXTREMELY LIGHTWEIGHT APPROACH FOR DDOS DETECTION AT HOME

GATEWAYS .................................................................................................................................................................... 5012 Gabriel Mendonça ; Gustavo H. A. Santos ; Edmundo De Souza E Silva ; Rosa M. M. Leão ; Daniel S. Menasché ;

Don Towsley

THE EFFECTIVENESS OF EDGE CENTRALITY MEASURES FOR ANOMALY DETECTION..................... 5022 Candice Mitchell ; Rajeev Agrawal ; Joshua Parker

SECP2I: A SECURE MULTI-PARTY DISCOVERY OF PERSONALLY IDENTIFIABLE

INFORMATION (PII) IN STRUCTURED AND SEMI-STRUCTURED DATASETS ............................................ 5028 Amine Mrabet ; Mehdi Bentounsi ; Patrice Darmon

EXTRACTING RICH SEMANTIC INFORMATION ABOUT CYBERSECURITY EVENTS ............................. 5034 Taneeya Satyapanich ; Tim Finin ; Francis Ferraro

RNR: EXTRACTION OF VISUAL ATTRIBUTES FROM LARGE-SCALE FASHION DATASET ................... 5043 Sungjae Lee ; Yeonji Lee ; Junho Kim ; Kyungyong Lee

MULTI-VIEW SUBSPACE CLUSTERING BASED ON TENSOR SCHATTEN-P NORM .................................. 5048 Yongli Liu ; Xiaoqin Zhang ; Guiying Tang ; Di Wang

STRUCTURAL DICTIONARY LEARNING BASED ON NON-CONVEX SURROGATE OF L2,1

NORM FOR CLASSIFICATION .................................................................................................................................. 5056 Xiaoju Lu ; Guiying Tang ; Di Wang ; Xiaoqin Zhang ; Jingjing Zheng

SINGLE IMAGE DEHAZING VIA LIGHTWEIGHT MULTI-SCALE NETWORKS ........................................... 5062 Guiying Tang ; Li Zhao ; Runhua Jiang ; Xiaoqin Zhang

SEMANTIC CORRELATIONS LOSS: IMPROVING MODEL INTERPRETABILITY FOR

MULTI-CLASS CLASSIFICATION ............................................................................................................................. 5070 Xuezhi Tong ; Rui Wang ; Xiaochun Cao ; Wenqi Ren

SINGLE-IMAGE DEHAZING USING COLOR ATTENUATION PRIOR BASED ON HAZE-LINES ............... 5080 Qianru Wang ; Li Zhao ; Guiying Tang ; Hanli Zhao ; Xiaoqin Zhang

AUTOMATIC LANDMARK PLACEMENT FOR LARGE 3D FACIAL IMAGE DATASET .............................. 5088 Jerry Wang ; Shiaofen Fang ; Meie Fang ; Jeremy Wilson ; Noah Herrick ; Susan Walsh

ROBUST BASKETBALL PLAYER TRACKING BASED ON A HYBRID DETECTION

GROUPING FRAMEWORK FOR OVERLAPPING CAMERAS ............................................................................ 5094 Kuan-Hsien Wu ; Wan-Lun Tsai ; Tse-Yu Pan ; Min-Chun Hu

BANKNOTES SERIAL NUMBER CODING RECOGNITION ................................................................................. 5101 Ruru Xu ; Jungang An ; Liandeng Su ; Xinli Min

SCALABLE DOCUMENT IMAGE INFORMATION EXTRACTION WITH APPLICATION TO

DOMAIN-SPECIFIC ANALYSIS ................................................................................................................................. 5108 Yingbin Zheng ; Shuchen Kong ; Wanshan Zhu ; Hao Ye

WEIGHTED FOCUS-ATTENTION DEEP NETWORK FOR FINE-GRAINED IMAGE

CLASSIFICATION ......................................................................................................................................................... 5116 Cong Zou ; Rui Wang ; Xiaochun Cao ; Feixiao Lv

EXPLORING DIETARY INTAKE DATA COLLECTED BY FPQ USING UNSUPERVISED

LEARNING ...................................................................................................................................................................... 5126 Martin Gjoreski ; Stefan Kochev ; Nina Rešcic ; Matej Gregoric ; Tome Eftimov ; Barbara Koroušic Seljak

COMPARING SEMANTIC AND NUTRIENT VALUE SIMILARITIES OF RECIPES ........................................ 5131 Gordana Ispirova ; Tome Eftimov ; Barbara Koroušic Seljak

OPTIMIZATION OF ARABLE LAND USE TOWARDS MEAT-FREE AND CLIMATE-SMART

AGRICULTURE: A CASE STUDY IN FOOD SELF-SUFFICIENCY OF VIETNAM ........................................... 5140 Vladimir Kuzmanovski ; Daniel Ellehammer Larsen ; Christian Bugge Henriksen

USING DEEP LEARNING FOR FOOD AND BEVERAGE IMAGE RECOGNITION .......................................... 5149 Simon Mezgec ; Barbara Koroušic Seljak

THE STATE AND FUTURE OF SMART AGRICULTURE: INSIGHTS FROM MINING SOCIAL

MEDIA ............................................................................................................................................................................. 5152 Martinson Ofori ; Omar El-Gayar

PROMOTING METROLOGY IN FOOD AND NUTRITION: A POSITION PAPER ON

METROFOOD-RI AND ITS E-COMPONENT ........................................................................................................... 5162 Nives Ogrinc ; Barbara Koroušic Seljak ; Karl Presser ; Marga Ocke ; Iulia Iatco ; Claudia Zoani

QUALIFOOD: AN INTELLIGENT QUALITY FOOD EVALUATION USING LOGICAL

SATISFIABILITY REASONING ON SPARK ............................................................................................................. 5165 Salima Benbernou ; Mourad Ouziri

EXPLORING A STANDARDIZED LANGUAGE FOR DESCRIBING FOODS USING

EMBEDDING TECHNIQUES ....................................................................................................................................... 5172 Gorjan Popovski ; Bibek Paudel ; Tome Eftimov ; Barbara Koroušic Seljak

AOP: AN ANTI-OVERFITTING PRETREATMENT FOR PRACTICAL IMAGE-BASED PLANT

DIAGNOSIS ..................................................................................................................................................................... 5177 Takumi Saikawa ; Quan Huu Cap ; Satoshi Kagiwada ; Hiroyuki Uga ; Hitoshi Iyatomi

INFRASTRUCTURE FOR INNOVATIVE RESEARCH ON HEALTHY FOOD CHOICE,

PREPARATION AND CONSUMPTION: A POSITION PAPER ON THE RICHFIELDS PROJECT ................. 5183 Barbara Koroušic Seljak ; Krijn Popp ; Paul Finglas ; Lada Timotijevic ; Pieter Van’T Vee ; Karin Zimmerman

SEMI-AUTOMATIC CROWDSOURCING TOOL FOR ONLINE FOOD IMAGE COLLECTION

AND ANNOTATION ...................................................................................................................................................... 5186 Zeman Shao ; Runyu Mao ; Fengqing Zhu

FOOD WASTE ONTOLOGY: A FORMAL DESCRIPTION OF KNOWLEDGE FROM THE

DOMAIN OF FOOD WASTE ........................................................................................................................................ 5190 Riste Stojanov ; Tome Eftimov ; Hannah Pinchen ; Maria Traka ; Paul Finglas ; Drago Torkar ; Barbara

Koroušic Seljak

A COMPARABLE STUDY: INTRINSIC DIFFICULTIES OF PRACTICAL PLANT DIAGNOSIS

FROM WIDE-ANGLE IMAGES ................................................................................................................................... 5195 Katsumasa Suwa ; Quan Huu Cap ; Ryunosuke Kotani ; Hiroyuki Uga ; Satoshi Kagiwada ; Hitoshi Iyatomi

FROM DIKW PYRAMID TO GRAPH DATABASE: A TOOL FOR MACHINE PROCESSING OF

NUTRITIONAL EPIDEMIOLOGIC RESEARCH DATA ......................................................................................... 5202 Chen Yang ; Bernard De Baets ; Carl Lachat

IOT-BASED MULTI-VIEW MACHINE VISION SYSTEMS .................................................................................... 5206 Emmanuel A. Castillo ; Ali Ahmadinia

BLOCKCHAIN AND IOT FOR DELIVERY ASSURANCE ON SUPPLY CHAIN (BIDAS) ................................ 5213 Mehmet Demir ; Ozgur Turetken ; Alexander Ferwom

A MICROSERVICES PLATFORM FOR MONITORING AND ANALYSIS OF IOT TRAFFIC

DATA IN SMART CITIES ............................................................................................................................................. 5223 Antonio De Iasio ; Angelo Furno ; Lorenzo Goglia ; Eugenio Zimeo

IBFRAME: IOT DATA PROCESSING FRAMEWORK FOR INTELLIGENT BUILDING

MANAGEMENT ............................................................................................................................................................. 5233 Dongwoo Kwon ; Kisu Ok ; Youngmin Ji

CENTRALIZED TRUST SCHEME FOR CLUSTER ROUTING OF WIRELESS SENSOR

NETWORKS .................................................................................................................................................................... 5239 Yunfan Li ; Nesrine Berjab ; Hieu Hanh Le ; Haruo Yokota

DETECTING DOS ATTACK IN SMART HOME IOT DEVICES USING A GRAPH-BASED

APPROACH ..................................................................................................................................................................... 5249 Ramesh Paudel ; Timothy Muncy ; William Eberle

RF-MSIP: RADIO FREQUENCY MULTI-SOURCE INDOOR POSITIONING .................................................... 5259 Vishal Perekadan ; Tathagata Mukherjee ; Chaity Banerjee ; Eduardo Pasiliao

NON-INTRUSIVE BEHAVIOR AWARENESS FOR RESIDENTS OF A SMART HOUSE ................................. 5269 Guillermo Ramirez-Prado ; Bashar Barmada ; Veronica Liesaputra

RESILIENT ACTIVITIES TRACKING IN A SMART HOME USING ULTRASONIC SENSORS ..................... 5274 Kashyap Venkatesh ; Bashar Barmada ; Veronica Liesaputra ; Guillermo Ramirez-Prado

PRIVACY IN IOT BLOCKCHAINS: WITH BIG DATA COMES BIG RESPONSIBILITY ................................ 5282 Steven A. Wright

STOCK INDEX FORECASTING BY HIDDEN MARKOV MODELS WITH TRENDS

RECOGNITION .............................................................................................................................................................. 5292 Xiaoning Cui ; Wei Shang ; Fuxin Jiang ; Shouyang Wang

A CONVERSATIONAL USER INTERFACE FOR STOCK ANALYSIS ................................................................. 5298 Paula Lauren ; Paul Watta

WHAT MAKE A NETWORK NOVEL POPULAR? IMPLICATIONS FOR “QIDIAN.COM” ............................ 5306 Lu Liu ; Liangqiang Li ; Miyan Liao ; Liang Yang ; You Li

STUDY ON THE RELATIONSHIP BETWEEN HOUSE RENT AND PEOPLE CONGESTION BY

TIME IN TOKYO BASED ON MOBILE PHONE GPS DATA ................................................................................. 5313 Yinglan Qin ; Yuki Akiyama ; Yoshiki Ogawa ; Ryosuke Shibasaki ; Taisei Sato

INVENTORY COST CONTROL MODEL FOR FRESH PRODUCT RETAILERS BASED ON DQN ................ 5321 Ruoying Sun ; Peng Sun ; Jiahui Li ; Gang Zhao

NATURAL GAS PRICE PREDICTION WITH BIG DATA ...................................................................................... 5326 Yuanyuan Tang ; Qingmei Wang ; Wei Xu ; Mingming Wang ; Zhaowei Wang

THE RESEARCH ON CROSS-BORDER ONLINE SHOPPING TRANSACTION RISK BASED ON

ONLINE DATA ACCESS ............................................................................................................................................... 5331 Wang Cui ; Wang Xingfen ; Zhuang Wenying

TEMPORAL NEIGHBOURHOOD AGGREGATION: PREDICTING FUTURE LINKS IN

TEMPORAL GRAPHS VIA RECURRENT VARIATIONAL GRAPH CONVOLUTIONS .................................. 5336 Stephen Bonner ; Amir Atapour-Abarghouei ; Philip T Jackson ; John Brennan ; Ibad Kureshi ; Georgios

Theodoropoulos ; Andrew Stephen McGough ; Boguslaw Obara

CHARACTERIZATION AND GRAPH EMBEDDING OF WEIGHTED SOCIAL NETWORKS

THROUGH DIFFUSION WAVELETS ........................................................................................................................ 5346 Zhiliang Chen ; Junfeng Wu ; Huakang Li ; Guozi Sun

OPTIMIZING VARIATIONAL GRAPH AUTOENCODER FOR COMMUNITY DETECTION ........................ 5353 Jun Jin Choong ; Xin Liu ; Tsuyoshi Murata

PREDICTING ALZHEIMER’S DISEASE BY HIERARCHICAL GRAPH CONVOLUTION FROM

POSITRON EMISSION TOMOGRAPHY IMAGING ............................................................................................... 5359 Jiaming Guo ; Wei Qiu ; Xiang Li ; Xuandong Zhao ; Ning Guo ; Quanzheng Li

LEARNING RELEVANT MOLECULAR REPRESENTATIONS VIA SELF-ATTENTIVE GRAPH

NEURAL NETWORKS .................................................................................................................................................. 5364 Shoma Kikuchi ; Ichigaku Takigawa ; Satoshi Oyama ; Masahito Kurihara

A DYNAMIC FINANCIAL KNOWLEDGE GRAPH BASED ON REINFORCEMENT LEARNING

AND TRANSFER LEARNING ...................................................................................................................................... 5370 Rui Miao ; Xia Zhang ; Hongfei Yan ; Chong Chen

EXPLOITING GRAPH CONVOLUTIONAL NETWORKS FOR REPRESENTATION LEARNING

OF MOBILE APP USAGE ............................................................................................................................................. 5379 Keiichi Ochiai ; Naoki Yamamoto ; Takashi Hamatani ; Yusuke Fukazawa ; Takayasu Yamaguchi

GRAPH CLASSIFICATION WITH THE HYPERNETWORK, A MOLECULE INTERACTION

BASED EVOLUTIONARY ARCHITECTURE ........................................................................................................... 5384 Jose Segovia-Juarez ; Silvano Colombano ; Alex Flores-Mamani ; Daniel Hidalgo-Chavez ; Miguel Mejia-Puma

IMPROVED DEEP EMBEDDINGS FOR INFERENCING WITH MULTI-LAYERED GRAPHS ....................... 5394 Huan Song ; Jayaraman J. Thiagarajan

HETEROGENEOUS GRAPH MATCHING NETWORKS: APPLICATION TO UNKNOWN

MALWARE DETECTION ............................................................................................................................................. 5401 Shen Wang ; Philip S. Yu

TWO-STAGE FRAMEWORK FOR BIG SPATIAL DATA ANALYTICS TO SUPPORT DISASTER

RESPONSE ...................................................................................................................................................................... 5409 Xuan Hu ; Jie Gong ; Eduard Gibert Renard ; Manish Parashar

A COMPARISON OF DEEP LEARNING VEHICLE GROUP DETECTION IN SATELLITE

IMAGERY ....................................................................................................................................................................... 5419 J. Alex Hurt ; David Huangal ; Curt H. Davis ; Grant J. Scott

DECISION-LEVEL FUSION OF DNN OUTPUTS FOR IMPROVING FEATURE DETECTION

PERFORMANCE ON LARGE-SCALE REMOTE SENSING IMAGE DATASETS .............................................. 5428 Alan B. Cannaday II ; Raymond L. Chastain ; J. Alex Hurt ; Curt H. Davis ; Grant J. Scott ; A. J. Maltenfort

SCALING DEEP LEARNING-BASED ANALYSIS OF HIGH-RESOLUTION SATELLITE

IMAGERY WITH DISTRIBUTED PROCESSING .................................................................................................... 5437 Mai H. Nguyen ; Jiaxin Li ; Daniel Crawl ; Jessica Block ; Ilkay Altintas

NOVEL DEEP-LEARNING-BASED SPATIAL-SPECTRAL FEATURE EXTRACTION FOR

HYPERSPECTRAL REMOTE SENSING APPLICATIONS .................................................................................... 5444 Bishwas Praveen ; Vineetha Menon

REMOTE SENSING OBJECT LOCALIZATION WITH DEEP HETEROGENEOUS SUPERPIXEL

FEATURES ...................................................................................................................................................................... 5453 Alex Yang ; J. Alex Hurt ; Charlie T. Veal ; Grant J. Scott

SMART TRACKING OF INTERNAL LAYERS OF ICE IN RADAR DATA VIA MULTI-SCALE

LEARNING ...................................................................................................................................................................... 5462 Masoud Yari ; Maryam Rahnemoonfar ; John Paden ; Ibikunle Oluwanisola ; Lora Koenig ; Lynn Montgomery

ASYNCHRONOUS BLOCKCHAIN-BASED PRIVACY-PRESERVING TRAINING

FRAMEWORK FOR DISEASE DIAGNOSIS ............................................................................................................. 5469 Xuhui Chen ; Xufei Wang ; Kun Yang

PRIVACY-PRESERVING STATISTICAL ANALYSIS OF HEALTH DATA USING PAILLIER

HOMOMORPHIC ENCRYPTION AND PERMISSIONED BLOCKCHAIN .......................................................... 5474 Mahdi Ghadamyari ; Saeed Samet

STRUCTURING TIME SERIES DATA TO GAIN INSIGHT INTO AGENT BEHAVIOUR ................................ 5480 Najim Al-Baghdadi ; Wojciech Wisniewski ; David Lindsay ; Siân Lindsay ; Yuri Kalnishkan ; Chris Watkins

DIMENSLON ESTIMATION OF EQUITY MARKETS ............................................................................................ 5491 Nitish Bahadur ; Randy Paffenroth ; Kelum Gajamannage

EVALUATING SENTIMENT CLASSIFIERS FOR BITCOIN TWEETS IN PRICE PREDICTION

TASK ................................................................................................................................................................................ 5499 Ahmed M. Balfagih ; Vlado Keselj

PREDICTING THE DAILY NUMBER OF PAYMENT TRANSACTIONS IN THE LARGEST

BANK IN THE NETHERLANDS: APPLICATION TO BANKING DATA ............................................................. 5507 Maartje Corstjens ; Marzieh Bakhshandeh ; Pinar Kahraman ; Joost Bosman

DEEP LEARNING FOR THE PREDICTION OF STOCK MARKET TRENDS .................................................... 5513 Arvand Fazeli ; Sheridan Houghten

COSTOCK: A DEEPFM MODEL FOR STOCK MARKET PREDICTION WITH ATTENTIONAL

EMBEDDINGS ................................................................................................................................................................ 5522 Jieyun Huang ; Xi Zhang ; Binxing Fang

A SEMI-SUPERVISED APPROACH FOR IDENTIFICATION OF THE SECTIONS IN CHARGE

OF RFQ DOCUMENTS .................................................................................................................................................. 5532 Izumo Hidetaka ; Yiou Wang

PEER FIRM IDENTIFICATION USING WORD EMBEDDINGS ........................................................................... 5536 Taeyoung Kee

KRYPTOORACLE: A REAL-TIME CRYPTOCURRENCY PRICE PREDICTION PLATFORM

USING TWITTER SENTIMENTS ................................................................................................................................ 5544 Shubhankar Mohapatra ; Nauman Ahmed ; Paulo Alencar

DEAL OR NO DEAL: PREDICTING MERGERS AND ACQUISITIONS AT SCALE ......................................... 5552 Ryan Moriarty ; Howard Ly ; Ellie Lan ; Suzanne K. McIntosh

FINDX: A VERSATILE, LOW-RESOURCE APPROACH TO FINANCIAL WEBSITE

CLASSIFICATION ......................................................................................................................................................... 5559 Alissa Ostapenko ; Rodica Neamtu ; Frazer Anderson

A FRAMEWORK OF APPLYING KELLY STATIONARY INDEX TO STOCK TRADING IN

TAIWAN MARKET ........................................................................................................................................................ 5569 Jia-Hao Syu ; Mu-En Wu ; Jan-Ming Ho

STOCK PREDICTION USING DEEP LEARNING AND SENTIMENT ANALYSIS ............................................. 5573 Yichuan Xu ; Vlado Keselj

DEEP LEARNING APPROACHES FOR SENTIMENT ANALYSIS ON FINANCIAL

MICROBLOG DATASET .............................................................................................................................................. 5581 Savas Yildirim ; Dhanya Jothimani ; Can Kavaklioglu ; Ayse Basar

MULTI-VIEW, GENERATIVE, TRANSFER LEARNING FOR DISTRIBUTED TIME SERIES

CLASSIFICATION ......................................................................................................................................................... 5585 Sreyasee Das Bhattacharjee ; William J. Tolone ; Ashish Mahabal ; Mohammed Elshambakey ; Isaac Cho ;

Abdullah Al-Raihan Nayeem ; Junsong Yuan ; George Djorgovski

ON ONLINE HATE SPEECH DETECTION: EFFECTS OF NEGATED DATA CONSTRUCTION .................. 5595 Cheniki Abderrouaf ; Mourad Oussalah

DC2: A DIVIDE-AND-CONQUER ALGORITHM FOR LARGE-SCALE KERNEL LEARNING

WITH APPLICATION TO CLUSTERING ................................................................................................................. 5603 Ke Alexander Wang ; Xinran Bian ; Pan Liu ; Donghui Yan

ADVERTISER-ASSISTED BEHAVIORAL AD-TARGETING VIA DENOISED DISTRIBUTION

INDUCTION .................................................................................................................................................................... 5611 Kei Yonekawa ; Hao Niu ; Mori Kurokawa ; Arei Kobayashi ; Daichi Amagata ; Takuya Maekawa ; Takahiro

Hara

DECODER TRANSFER LEARNING FOR PREDICTING PERSONAL EXPOSURE TO AIR

POLLUTION ................................................................................................................................................................... 5620 Peijiang Zhao ; Koji Zettsu

LEARNING AND RECOGNIZING ARCHEOLOGICAL FEATURES FROM LIDAR DATA ............................ 5630 Conrad M Albrecht ; Chris Fisher ; Marcus Freitag ; Hendrik F Hamann ; Sharathchandra Pankanti ; Florencia

Pezzutti ; Francesca Rossi

N-DIMENSIONAL GEOSPATIAL DATA AND ANALYTICS FOR CRITICAL

INFRASTRUCTURE RISK ASSESSMENT ................................................................................................................ 5637 Levente J Klein ; Conrad M Albrecht ; Wang Zhou ; Carlo Siebenschuh ; Sharathchandra Pankanti ; Hendrik F

Hamann ; Siyuan Lu

EFFICIENT LIDAR POINT CLOUD DATA ENCODING FOR SCALABLE DATA

MANAGEMENT WITHIN THE HADOOP ECO-SYSTEM ...................................................................................... 5644 Anh Vu Vo ; Chamin Nalinda Lokugam Hewage ; Gianmarco Russo ; Neel Chauhan ; Debra F. Laefer ; Michela

Bertolotto ; Nhien-An Le-Khac ; Ulrich Oftendinger

DEFENSE-POINTNET: PROTECTING POINTNET AGAINST ADVERSARIAL ATTACKS ........................... 5654 Yu Zhang ; Gongbo Liang ; Tawfiq Salem ; Nathan Jacobs

STREAMING MACHINE LEARNING ALGORITHMS WITH BIG DATA SYSTEMS ....................................... 5661 Vibhatha Abeykoon ; Supun Kamburugamuve ; Kannan Govindrarajan ; Pulasthi Wickramasinghe ; Chathura

Widanage ; Niranda Perera ; Ahmet Uyar ; Gurhan Gunduz ; Selahattin Akkas ; Gregor Von Laszewski

A FAST VIDEO IMAGE DETECTION USING TENSORFLOW MOBILE NETWORKS FOR

RACING CARS ............................................................................................................................................................... 5667 Selahattin Akkas ; Sahaj Singh Maini ; Judy Qiu

DEEPLITE: REAL-TIME DEEP LEARNING FRAMEWORK FOR NEIGHBORHOOD

ANALYSIS ....................................................................................................................................................................... 5673 Duy H. Ho ; Raj Marri ; Sirisha Rella ; Yugyung Lee

BENCHMARKING DEEP LEARNING FOR TIME SERIES: CHALLENGES AND DIRECTIONS .................. 5679 Xinyuan Huang ; Geoffrey C. Fox ; Sergey Serebryakov ; Ankur Mohan ; Pawel Morkisz ; Debojyoti Dutta

MATRICS: A SYSTEM FOR HUMAN-MACHINE HYBRID FORECASTING OF

GEOPOLITICAL EVENTS ........................................................................................................................................... 5683 David J. Huber ; Nigel Stepp ; Aruna Jammalamadaka ; Tiffany Kim ; Sam Johnson ; Dana Warmsley ; Tsai-

Ching Lu

PERFORMANCE CHARACTERIZATION AND MODELING OF SERVERLESS AND HPC

STREAMING APPLICATIONS .................................................................................................................................... 5688 Andre Luckow ; Shantenu Jha

ADAPTIVE HOEFFDING TREE WITH TRANSFER LEARNING FOR STREAMING

SYNCHROPHASOR DATA SETS ................................................................................................................................ 5697 Zakaria El Mrabet ; Daisy Flora Selvaraj ; Prakash Ranganathan

ADVERSARIAL TRAINING FOR PRIVACY-PRESERVING DEEP LEARNING MODEL

DISTRIBUTION .............................................................................................................................................................. 5705 Mohammed Alawad ; Shang Gao ; Xiao-Cheng Wu ; Eric B. Durbin ; Linda Coyle ; Lynne Penberthy ; Georgia

Tourassi

ANONYMOUS PRIVACY-PRESERVING SCHEME FOR BIG DATA OVER THE CLOUD .............................. 5711 Zeyad A. Al-Odat ; Samee U. Khan

DISTRIBUTED CONSENSUS REDUCED SUPPORT VECTOR MACHINE ......................................................... 5718 Hsiang-Hsuan Chen ; Yuh-Jye Lee

DEEP NEURAL NETWORKS AS SIMILITUDE MODELS FOR SHARING BIG DATA .................................... 5728 Philip Derbeko ; Shlomi Dolev ; Ehud Gudes

PRIVACY AND SECURITY OF BIG DATA IN AI SYSTEMS: A RESEARCH AND STANDARDS

PERSPECTIVE ............................................................................................................................................................... 5737 Saharnaz Dilmaghani ; Matthias R. Brust ; Grégoire Danoy ; Natalia Cassagnes ; Johnatan Pecero ; Pascal

Bouvry

AN IDENTITY PRIVACY PRESERVING IOT DATA PROTECTION SCHEME FOR CLOUD

BASED ANALYTICS ...................................................................................................................................................... 5744 Christian Gehrmann ; Martin Gunnarsson

KRATOS: A SECURE, AUTHENTICATED AND PUBLICLY VERIFIABLE SYSTEM FOR

EDUCATIONAL DATA USING THE BLOCKCHAIN .............................................................................................. 5754 Velislava Hillman ; Varunram Ganesh

UTILITY AND PRIVACY ASSESSMENTS OF SYNTHETIC DATA FOR REGRESSION TASKS ................... 5763 Markus Hittmeir ; Andreas Ekelhart ; Rudolf Mayer

RIBS: RISKY BLIND-SPOTS FOR ATTACK CLASSIFICATION MODELS ....................................................... 5773 Mikel Joaristi ; Arthur Putnam ; Alfredo Cuzzocrea ; Edoardo Serra

THE ANTI-DATA-MINING (ADM) FRAMEWORK - BETTER PRIVACY ON ONLINE SOCIAL

NETWORKS AND BEYOND ........................................................................................................................................ 5780 Shah Mahmood

DROID-NNET: DEEP LEARNING NEURAL NETWORK FOR ANDROID MALWARE

DETECTION ................................................................................................................................................................... 5789 Mohammad Masum ; Hossain Shahriar

PRIVACY-PRESERVING TOP-K DOMINATING QUERIES IN DISTRIBUTED MULTI-PARTY

DATABASES ................................................................................................................................................................... 5794 Mahboob Qaosar ; Kazi Md. Rokibul Alam ; Chen Li ; Yasuhiko Morimoto

IMPROVING K-NEAREST NEIGHBOR PATTERN RECOGNITION MODELS FOR PRIVACY-

PRESERVING DATA ANALYSIS ................................................................................................................................ 5804 Walisa Romsaiyud ; Henning Schnoor ; Wilhelm Hasselbring

EXPERIENTIAL LEARNING: CASE STUDY-BASED PORTABLE HANDS-ON REGRESSION

LABWARE FOR CYBER FRAUD PREDICTION ..................................................................................................... 5814 Hossain Shahriar ; Michael Whitman ; Dan Chia-Tien Lo ; Fan Wu ; Cassandra Thomas ; Alfredo Cuzzocrea

LOCAL DIFFERENTIAL PRIVACY WITH K-ANONYMOUS FOR FREQUENCY ESTIMATION ................. 5819 Dan Zhao ; Hong Chen ; Suyun Zhao ; Xiaoying Zhang ; Cuiping Li ; Ruixuan Liu

A DEEP LEAMING MODEL WITH MULTI-SCALE SKIP CONNECTIONS FOR SOLAR FLARE

PREDICTION COMBINED WITH PRIOR INFORMATION .................................................................................. 5829 Tian Han ; Qinke Peng ; Yiqing Shen ; Haozhou Li ; Yan Gu

WINDOW-BASED FEATURE EXTRACTION METHOD USING XGBOOST FOR TIME SERIES

CLASSIFICATION OF SOLAR FLARES ................................................................................................................... 5836 Dan McGuire ; Renan Sauteraud ; Vishal Midya

SOLAR FLARE PREDICTION USING TWO-TIER ENSEMBLE WITH DEEP LEARNING AND

GRADIENT BOOSTING MACHINE ........................................................................................................................... 5844 Chau Pham ; Vung Pham ; Tommy Dang

AN APPROACH FOR SCALE SUSPICIOUS NETWORK EVENTS DETECTION .............................................. 5854 Cong Dong ; Yufan Chen ; Yunjian Zhang ; Bo Jiang ; Dongxu Han ; Baoxu Liu

AUTOMATED EVENT PRIORITIZATION FOR SECURITY OPERATION CENTER USING

DEEP LEARNING .......................................................................................................................................................... 5864 Nitika Gupta ; Issa Traore ; Paulo Magella Faria De Quinan

SUSPICIOUS NETWORK EVENT RECOGNITION USING MODIFIED STACKING ENSEMBLE

MACHINE LEARNING ................................................................................................................................................. 5873 Angus F. M. Huang ; Yang Chi-Wei ; Hsiao-Chi Tai ; Yang Chuan ; Jay J. C. Huang ; Yu-Han Liao

IEEE BIGDATA 2019 CUP: SUSPICIOUS NETWORK EVENT RECOGNITION ................................................ 5881 Andrzej Janusz ; Daniel Kaluza ; Agnieszka Chadzynska-Krasowska ; Bartek Konarski ; Joel Holland ; Dominik

Slezak

AN APRIORI-BASED DATA ANALYSIS ON SUSPICIOUS NETWORK EVENT RECOGNITION ................. 5888 Zhiwen Jian ; Hiroshi Sakai ; Junzo Watada ; Arunava Roy ; M Hilmi B Hassan

NAIVE TRANSFER LEARNING APPROACHES FOR SUSPICIOUS EVENT PREDICTION ........................... 5897 Chang Lin

APPLICATION OF XGBOOST TO THE CYBER-SECURITY PROBLEM OF DETECTING

SUSPICIOUS NETWORK TRAFFIC EVENTS .......................................................................................................... 5902 Lukasz Podlodowski ; Marek Kozlowski

IEEE BIGDATA 2019 CUP: BINARY CLASSIFICATION VIA TENSOR COMPLETION ................................. 5908 Teresa M. Ranadive ; Christopher J. Coley

SUSPICIOUS NETWORK EVENT RECOGNITION LEVERAGING ON MACHINE LEARNING ................... 5915 Daniele Sartiano ; Giuseppe Attardi ; Luca Deri ; Maurizio Martinelli

GRADIENT BOOSTING DECISION TREES FOR CYBER SECURITY THREATS DETECTION

BASED ON NETWORK EVENTS LOGS .................................................................................................................... 5921 Quang Hieu Vu ; Dymitr Ruta ; Ling Cen

IDENTIFYING TRULY SUSPICIOUS EVENTS AND FALSE ALARMS BASED ON ALERT

GRAPH ............................................................................................................................................................................. 5929 Tian Wang ; Chen Zhang ; Zhigang Lu ; Dan Du ; Yaopeng Han

AN ENSEMBLE APPROACH FOR SUSPICIOUS TRAFFIC DETECTION FROM HIGH RECALL

NETWORK ALERTS ..................................................................................................................................................... 5937 Peilin Wu ; Jinlei Li ; Yan Meng ; Haojin Zhu

MODELS AND FEATURES WITH COVARIATE SHIFT ADAPTATION FOR SUSPICIOUS

NETWORK EVENT RECOGNITION ......................................................................................................................... 5945 Shu-Yi Xie ; Jian Ma ; Yu-Bin Luo ; Lian-Xin Jiang ; Shirly Jin ; Yang Mo ; Jian-Ping Shen

LANGUAGE IDENTIFICATION AND CONTEXT-BASED ANALYSIS OF CODE-SWITCHING

BEHAVIORS IN SOCIAL MEDIA DISCUSSIONS .................................................................................................... 5951 Akankshya Mishra ; Yashvardhan Sharma

ANUJ@IEEE BIGDATA 2019: A NOVEL CODE-SWITCHING BEHAVIOR ANALYSIS IN

SOCIAL MEDIA DISCUSSIONS NATURAL LANGUAGE PROCESSING ........................................................... 5957 Anuj Saini

TIME SERIES CLASSIFICATION: LESSONS LEARNED IN THE (LITERAL) FIELD WHILE

STUDYING CHICKEN BEHAVIOR ............................................................................................................................ 5962 Alireza Abdoli ; Amy C. Murillo ; Alec C. Gerry ; Eamonn J. Keogh

HIGH PERFORMANCE I/O FOR LARGE SCALE DEEP LEARNING ................................................................. 5965 Alex Aizman ; Gavin Maltby ; Thomas Breuel

SIGN PREDICTION IN ONLINE GAMES .................................................................................................................. 5968 Essa Alhazmi ; Nazim Choudhury

EARLY DETECTION OF ALZHEIMER’S DISEASE USING PATIENT

NEUROPSYCHOLOGICAL AND COGNITIVE DATA AND MACHINE LEARNING

TECHNIQUES ................................................................................................................................................................. 5971 Ibrahim Almubark ; Lin-Ching Chang ; Thanh Nguyen ; Raymond Scott Turner ; Xiong Jiang

THERMAL IMAGERY BASED INSTANCE SEGMENTATION FOR ENERGY AUDIT

APPLICATIONS IN BUILDINGS ................................................................................................................................. 5974 Youness Arjoune ; Sai Peri ; Niroop Sugunaraj ; Debanjan Sadhukhan ; Michael Nord ; Gautham Krishnamoorthy

; David Flynn ; Prakash Ranganathan

QUALIBD: A TOOL FOR MODELLING QUALITY REQUIREMENTS FOR BIG DATA

APPLICATIONS ............................................................................................................................................................. 5977 Darlan Arruda ; Nazim H. Madhavji

A MACHINE LEARNING APPROACH FOR PREDICTION OF LENGTH OF STAY FOR THE

KID’S INPATIENT DATABASE ................................................................................................................................... 5980 Shilpa Balan ; Tanvi Gawade ; Aakanksha Tasgaonkar

SHARP FREQUENCY BOUNDS FOR SAMPLE-BASED QUERIES ...................................................................... 5983 Eric Bax ; John Donald

DATA REPLICATION FOR REDUCING COMPUTING TIME IN DISTRIBUTED SYSTEMS

WITH STRAGGLERS .................................................................................................................................................... 5986 Amir Behrouzi-Far ; Emina Soljanin

SCHEDULING IN THE PRESENCE OF DATA INTENSIVE COMPUTE JOBS .................................................. 5989 Amir Behrouzi-Far ; Emina Soljanin

COMBINING TEXT AND IMAGE DATA FOR PRODUCT RECOMMENDABILITY MODELING ................. 5992 Mark Capelo ; Karan Aggarwal ; Pranjul Yadav

A METHODOLOGY FOR REAL-TIME DATA VERIFICATION EXPLOITING DEEP

LEARNING AND MODEL CHECKING ..................................................................................................................... 5995 Giovanni Capobianco ; Umberto Di Giacomo ; Tommaso Di Tusa ; Francesco Mercaldo ; Antonella Santone

A MODEL CHECKING BASED PROPOSAL FOR MOBILE COLLUDING ATTACK

DETECTION ................................................................................................................................................................... 5998 Rosangela Casolare ; Fabio Martinelli ; Francesco Mercaldo ; Antonella Santone

AN AUTHENTICATION SYSTEM USING NEUROLOGICAL RESPONSES TO MUSIC .................................. 6001 Joseph M Cauthen ; Tejas Gandre ; Marco A. Mercado Espinoza ; Meetkumar J Patel ; Mohammad I Husain

FEDERATED WIRELESS NETWORK INTRUSION DETECTION ....................................................................... 6004 Burak Cetin ; Alina Lazar ; Jinoh Kim ; Alex Sim ; Kesheng Wu

APPLYING SDN BASED DATA NETWORK ON HPC BIG DATA COMPUTING – DESIGN,

IMPLEMENTATION, AND EVALUATION ............................................................................................................... 6007 Hsing-Bung Chen ; Zhi Qiao ; Song Fu

DESIGNING NOVEL FUNCTIONAL PEPTIDES BY MANIPULATING A TEMPERATURE IN

THE SOFTMAX FUNCTION COUPLED WITH VARIATIONAL AUTOENCODER .......................................... 6010 Shuan Chen ; Hyun Uk Kim

IOT MALWARE DYNAMIC ANALYSIS PROFILING SYSTEM AND FAMILY BEHAVIOR

ANALYSIS ....................................................................................................................................................................... 6013 Cheng-Yu Chen ; Shun-Wen Hsiao

EXTRACTING GRAMMATICAL ERROR CORRECTIONS FROM WIKIPEDIA REVISION

HISTORY ......................................................................................................................................................................... 6016 Jhih-Jie Chen ; Yi-Dong Wu ; Yu-Chuan Tai ; Ching-Yu Yang ; Hai-Lun Tu ; Jason S. Chang

EASYRAIN: A USER-FRIENDLY PLATFORM FOR COMPARING PRECIPITATION

NOWCASTING MODELS ............................................................................................................................................. 6019 Ji Cheng ; Guimu Guo ; Da Yan ; Xiaotian Hao ; Wilfred Ng

PARTS CATALOG OBJECT RECOGNITION TECHNOLOGY FOR EFFICIENT DRAWING

DISTRIBUTION MANAGEMENT ............................................................................................................................... 6022 Sung-Wook Choi ; Chang-Seob Song ; Chae-Soo Kim

HAPPY TOILET: A SOCIAL ANALYTICS APPROACH TO THE STUDY OF PUBLIC TOILET

CLEANLINESS ............................................................................................................................................................... 6024 Eugene W. J. Choy ; Winston M. K. Ho ; Xiaohang Li ; Ragini Verma ; Li J. Sim ; Kyong Jin Shim

LISTEN, NUDGE, EMPOWER: A MOBILE GRATITUDE JOURNAL APPLICATION ..................................... 6027 Eugene W. J. Choy ; Gladys H. L. Ng ; Martius J. H. Lim ; Kyong Jin Shim

A PROGRAMMING FRAMEWORK FOR HETEROGENEOUS STREAM ANALYTICS .................................. 6030 Roshan Bharath Das ; Marc X. Makkes ; Alexandru Uta ; Lin Wang ; Henri Bal

BIG FEDERAL DATA CENTERS IMPLEMENTING FAIR DATA PRINCIPLES: ARM DATA

CENTER EXAMPLE ...................................................................................................................................................... 6033 Ranjeet Devarakonda ; Giri Prakash ; Kavya Guntupally ; Jitendra Kumar

PRESERVING COMPOSITION AND CRYSTAL STRUCTURES OF CHEMICAL COMPOUNDS

IN ATOMIC EMBEDDING ........................................................................................................................................... 6037 Yifan Ding ; Daheng Wang ; Tim Weninger ; Meng Jiang

WEB SCRAPING: STATE-OF-THE-ART AND AREAS OF APPLICATION ........................................................ 6040 Rabiyatou Diouf ; Edouard Ngor Sarr ; Ousmane Sall ; Babiga Birregah ; Mamadou Bousso ; Sény Ndiaye Mbaye

TARGOAT: IMPROVING DATASET UPLOAD TIME TO OBJECT STORAGE USING CLIENT-

SERVER COOPERATION ............................................................................................................................................ 6043 Timothy John Ebido ; Kichul Park ; Kyungho Jeon

PSIDB: A FRAMEWORK FOR BATCHED QUERY PROCESSING AND OPTIMIZATION ............................. 6046 Mehrad Eslami ; Yicheng Tu ; Hadi Charkhgard ; Zichen Xu ; Jiacheng Liu

AN OPTIMIZED SCHEDULING PROCESS FOR A LARGE CROWD TO PERFORM SPATIO-

TEMPORAL MOVEMENTS SAFELY DURING PILGRIMAGE ............................................................................ 6049 Emad Felemban ; Abdulaziz Fatani ; Faizan Ur Rehman

STATISTICAL CORRECTION OF AVERAGE CUSTOMER RATINGS FOR PRODUCT

RANKING ........................................................................................................................................................................ 6052 Byron J. Gao ; Frank Medjo

CORANK: SIMULTANEOUSLY RANKING PUBLICATION VENUES AND RESEARCHERS ........................ 6055 Byron J. Gao ; Gayathri Karupakula Jagadeesh Kumar

ON UNDERSTANDING BIOSONAR DEFORMATIONS USING DEEP LEARNING-BASED

VIDEO INTERPOLATION............................................................................................................................................ 6058 Li Gao ; Weikai He

ON UNDERSTANDING DEGRADATION KINETICS OF PHARMACEUTIC GELATIN

MATRICES FOR PRECISION MEDICINE: A DEEP LEARNING APPROACH .................................................. 6060 Li Gao ; Hongjie Jiang ; Kaiming Fu ; Weikai He

HARDENING DGA CLASSIFIERS UTILIZING IVAP ............................................................................................. 6063 Charles Grumer ; Jonathan Peck ; Femi Olumofin ; Anderson Nascimento ; Martine De Cock

REALISTIC TRANSPORT SIMULATION WITH OPEN DATA ............................................................................. 6066 Guimu Guo ; Jalal Majed Khalil ; Da Yan ; Virginia Sisiopiku

A STATISTICAL CAUSAL INFERENCE METHOD FOR EXPLORING ULTRASONICS AND

TOPOLOGICAL DEFORMATIONS IN BIOLOGICAL SYSTEMS ........................................................................ 6069 Weikai He ; Li Gao

COMPARING IMPLEMENTATION VARIANTS OF DISTRIBUTED SPATIAL JOIN ON SPARK ................. 6071 Georg Heiler ; Allan Hanbury

INTERDEPENDENCY BETWEEN THE STOCK MARKET AND FINANCIAL NEWS ...................................... 6074 Eunjeong Hwang ; Yong-Hyuk Kim

CLUSTERED HIERARCHICAL ENTROPY-SCALING SEARCH OF ASTRONOMICAL AND

BIOLOGICAL DATA ..................................................................................................................................................... 6077 Najib Ishaq ; George Student ; Noah M. Daniels

ACCELERATING AN ON-LINE APPROXIMATION MINING FOR LARGE CLOSED ITEMSETS ............... 6078 Koji Iwanuma ; Takumi Nishina ; Yoshitaka Yamamoto

VECTOR-BASED CHURN PREDICTION USING NEURAL NETWORKS IN MOBILE GAMES..................... 6081 Kihoon Jang ; Junwhan Kim ; Byunggu Yu

DETECTION OF HIJACKED AUTHORITATIVE DNS SERVERS BY NAME RESOLUTION

TRAFFIC CLASSIFICATION ...................................................................................................................................... 6084 Yong Jin ; Masahiko Tomoishi ; Satoshi Matsuura

CITATION-BASED AUTHOR CONTRIBUTION MEASURE FOR BYLINE-INDEPENDENCY ....................... 6086 Sukhwan Jung ; Wan Chul Yoon

ESTABLISHING THE SAFETY OF A SMART HEART HEALTH MONITORING SERVICE

THROUGH VALIDATION ............................................................................................................................................ 6089 Murtadha Kareem ; Oliver Faust

BORDER GATEWAY PROTOCOL ANOMALY DETECTION USING NEURAL NETWORK ......................... 6092 Mohsen Karimi ; Ali Jahanshahi ; Abbas Mazloumi ; Hadi Zamani Sabzi

TOWARDS A MULTI-ENGINE QUERY OPTIMIZER FOR COMPLEX SQL QUERIES ON BIG

DATA ................................................................................................................................................................................ 6095 Evdokia Kassela ; Ioannis Konstantinou ; Nectarios Koziris

BIGOPTIBASE: BIG DATA ANALYTICS FOR BASE STATION ENERGY CONSUMPTION

OPTIMIZATION............................................................................................................................................................. 6098 Evdokia Kassela ; Nikodimos Provatas ; Asterios Tsiourvas ; Ioannis Konstantinou ; Nectarios Koziris

A STUDY ON SOLUTION ORIENTED SMART FACTORY DIAGNOSTIC SYSTEM FOR SME ..................... 6101 Seung-Beom Son ; Jun-Yeong Kwon ; Chae-Soo Kim

APPLICATION OF BIG DATA ANALYTICS TO SUPPORT POWER NETWORKS AND THEIR

TRANSITION TOWARDS SMART GRIDS ................................................................................................................ 6104 Sylvie Koziel ; Patrik Hilber ; Ryutaro Ichise

RECURRENT PATH INDEX FOR EFFICIENT GRAPH TRAVERSAL ................................................................ 6107 Kazuma Kusu ; Kenji Hatano

MODEL MANAGEMENT AND HANDWRITTEN CHARACTER RECOGNITION: AN

ENTERPRISE SOLUTION ............................................................................................................................................ 6110 Choudur Lakshminarayan ; Thiagarajan Ramakrishnan ; Awny Al-Omari ; Khaled Bouaziz ; Faraz Ahmad ; Sri

Raghavan ; Prama Agarwal

AI-BASED ONLINE P2P LENDING RISK ASSESSMENT ON SOCIAL NETWORK DATA WITH

MISSING VALUE ........................................................................................................................................................... 6113 Lok Ting Lam ; Shun-Wen Hsiao

ON OPTIMIZATION OF STOCK MARKET PREDICTION METHODS .............................................................. 6116 Warren Landis ; Sangwhan Cha ; Majid Shaalan

HOW DOES FAKE NEWS SPREAD: RAISING AWARENESS & EDUCATING THE PUBLIC

WITH A SIMULATION TOOL ..................................................................................................................................... 6119 Cheng L. Lee ; Joel-David J. J. Wong ; Zi Y. Lim ; Belinda S. T. Tho ; Sean S. W. Kwek ; Kyong Jin Shim

DEVELOPMENT OF A CONTINUOUS COMPLEX EVENT PROCESSING PLATFORM FOR

REAL-TIME TACTICAL MOVING OBJECTS ......................................................................................................... 6122 Jiwan Lee ; Bonghee Hong ; Chumsoo Kim ; Woo Chan Kim

UTILIZING MULTIVARIATE TIME SERIES FOR SEMANTIC SEGMENTATION ......................................... 6125 Frederique Van Leeuwen

MUSIC SEQUENCE PREDICTION WITH MIXTURE HIDDEN MARKOV MODELS ...................................... 6128 Tao Li ; Minsoo Choi ; Kaiming Fu ; Lei Lin

A JOINT MODEL OF CLINICAL DOMAIN CLASSIFICATION AND SLOT FILLING BASED ON

RCNN AND BIGRU-CRF ............................................................................................................................................... 6133 Yuming Li ; Pin Ni ; Junkun Peng ; Jiayi Zhu ; Zhenjin Dai ; Gangmin Li ; Xuming Bai

LEARNING A FACETED CUSTOMER SEGMENTATION FOR DISCOVERING NEW BUSINESS

OPPORTUNITIES AT INTEL ....................................................................................................................................... 6136 Itay Lieder ; Meirav Segal ; Eran Avidan ; Asaf Cohen ; Tom Hope

A HIERARCHICAL CLASSIFICATION MODEL FOR SOLAR FLARE PREDICTION .................................... 6139 Cenru Liu

A MODERN APPROACH TO BIG PROVENANCE .................................................................................................. 6142 Crisrael Lucero ; Phuong H. Nguyen

FEDERATED QUERY PROCESSING FOR BIG DATA IN DATA SCIENCE ....................................................... 6145 Manoj Muniswamaiah ; Tilak Agerwala ; Charles C. Tappert

REMAINING USEFUL LIFE PREDICTION OF INDUSTRIAL CONSUMABLES USING

WIDEBAND VIBRATION SIGNALS ........................................................................................................................... 6148 Daisuke Maeda ; Sudhanshu Gaur

ENABLING FASTER CONVERGENCE IN DISTRIBUTED IRREGULAR GRAPH PROCESSING ................. 6151 Abbas Mazloumi ; Rajiv Gupta

WIKIPEDIA POLARIZATION AND ITS EFFECTS ON NAVIGATION PATHS ................................................. 6154 Cristina Menghini ; Aris Anagnostopoulos ; Eli Upfal

IOT APPLICATION DEVELOPMENT USING MIT APP INVENTOR TO COLLECT AND

ANALYZE SENSOR DATA ........................................................................................................................................... 6157 Thilanka Munasinghe ; Evan W. Patton ; Oshani Seneviratne

SPECTRAL ANALYSIS OF USER INTERESTS FOR EXPERIMENTAL VERIFICATION OF THE

OSCILLATION MODEL FOR OSNS ........................................................................................................................... 6160 Kouichi Nagatani ; Chisa Takano ; Masaki Aida

IDENTIFYING THE RELATIONSHIP BETWEEN PRECIPITATION AND ZIKA OUTBREAKS

IN ARGENTINA ............................................................................................................................................................. 6163 Lilian Ngweta ; Karan Bhanot ; Ariane Maharaj ; Ian Bogle ; Thilanka Munasinghe

DISEASE DIAGNOSIS PREDICTION OF EMR BASED ON BIGRU-ATT-CAPSNETWORK

MODEL ............................................................................................................................................................................ 6166 Pin Ni ; Yuming Li ; Jiayi Zhu ; Junkun Peng ; Zhenjin Dai ; Gangmin Li ; Xuming Bai

IDENTIFYING IMPLICIT REQUIREMENTS IN SRS BIG DATA ......................................................................... 6169 Emebo Onyeka ; Vaibhav Anu ; Aparna S. Varde

EXTRACTING FEATURE ENGINEERING KNOWLEDGE FROM DATA SCIENCE

NOTEBOOKS .................................................................................................................................................................. 6172 Masafumi Oyamada

Q95-SQUARED<E,R,D>, A TUPLE OF SINGULAR METRICS TO DEFINE HISTOGRAM

QUALITY FOR A DATABASE SYSTEM .................................................................................................................... 6174 Parag Paul ; Manas Sharma

IDENTIFYING AND UNDERSTANDING BUSINESS TRENDS USING TOPIC MODELS WITH

WORD EMBEDDING .................................................................................................................................................... 6177 Yun Ning Pek ; Kwan Hui Lim

AUTOMATIC GENERATION OF ELECTRONIC MEDICAL RECORD BASED ON GPT2

MODEL ............................................................................................................................................................................ 6180 Junkun Peng ; Pin Ni ; Jiayi Zhu ; Zhenjin Dai ; Yuming Li ; Gangmin Li ; Xuming Bai

ON NEURAL NETWORK ACTIVATION FUNCTIONS AND OPTIMIZERS IN RELATION TO

POLYNOMIAL REGRESSION .................................................................................................................................... 6183 John Pomerat ; Aviv Segev ; Rituparna Datta

AN IOT-DRIVEN SMART CAFE SOLUTION FOR HUMAN TRAFFIC MANAGEMENT ................................ 6186 Maruthi Prithivirajan ; Kyong Jin Shim

TOWARDS FASTER DISTRIBUTED DEEP LEARNING USING DATA HASHING TECHNIQUES ............... 6189 Nikodimos Provatas ; Ioannis Konstantinou ; Nectarios Koziris

TRACKING POLITICAL EVENTS IN SOCIAL MEDIA: A CASE STUDY OF HONG KONG

PROTESTS ...................................................................................................................................................................... 6192 Haodi Qi ; Hanyu Jiang ; Wende Bu ; Chengzi Zhang ; Kyong Jin Shim

A BIG-DATA APPROACH TO DEFINING BREATHING SIGNATURES FOR IDENTIFYING

RESPIRATORY DISEASE ............................................................................................................................................ 6195 Abrar Rahman ; Yonathan Weiner ; Hailey Swanson ; Rebecca Slepian ; Anusheh Abdullah ; Marvin J. Slepian

SMART EHR - A BIG-DATA APPROACH TO AUTOMATED COLLECTION AND PROCESSING

OF MULTI-MODAL HEALTH SIGNALS IN A DOCTOR-PATIENT ENCOUNTER .......................................... 6198 Abrar Rahman ; Ari Mitra ; Fuad Rahman ; Marvin J. Slepian

EXTRACTING VALID INDOOR SEMANTIC TRAJECTORIES USING MOVEMENT

CONSTRAINTS .............................................................................................................................................................. 6201 Hani Ramadhan ; Yoga Yustiawan ; Joonho Kwon

ON NON-INTRUSIVE PREDICTION OF ACTIVITIES AND BEHAVIOR ........................................................... 6203 Guillermo Ramirez-Prado ; Bashar Barmada ; Veronica Liesaputra

IMPACT OF MANDATED PUBLIC REPORTING IN CALIFORNIA ON 30-DAY READMISSION

FOLLOWING CABG SURGERY: A HEALTH POLICY ANALYSIS ..................................................................... 6205 Monika Ray ; Banafsheh Sadeghi ; Dominique Ritley ; Patrick S. Romano

THE LIMITS TO THE EFFICIENCY OF MACHINE LEARNING ......................................................................... 6208 Kush Arya ; Teja Reddy ; Zachary Dodds

AUTOMATIC CATEGORIZATION OF PRESS ARTICLES THROUGH LEARNING: THE CASE

OF SENEGALESE ONLINE PRESS ............................................................................................................................ 6211 Edouard Ngor Sarr ; Ousmane Sall ; Mamadou Bousso ; Rabiyatou Diouf ; Babiga Birregah ; Sény Ndiaye Mbaye

PLUGIN TO A HEALTHIER LIFE: A WEB BROWSER PLUGIN FOR MENTAL HEALTH

MONITORING ................................................................................................................................................................ 6214 Jane H. K. Seah ; Kyong Jin Shim

AIRLINE MILES REDEMPTION ................................................................................................................................ 6217 Joseph Sebastian ; Thilanka Munasinghe

BIG DATA ANALYSIS ON SOCIAL NETWORKING .............................................................................................. 6220 Zhengwu Sun ; Dan Chia-Tien Lo ; Yong Shi

MITIGATING DDOS WITH POW AND GAME THEORY ...................................................................................... 6223 Kun-Yuan Sung ; Shun-Wen Hsiao

THE INTERNET OF RESPONSIBILITIES - CONNECTING HUMAN RESPONSIBILITIES

USING BIG DATA AND BLOCKCHAIN .................................................................................................................... 6226 Xuejiao Tang ; Jiong Qiu ; Wenbin Zhang ; Ibrahim Toure ; Mingli Zhang ; Enza Messina ; Xueping Xie ; Xuebing

Wang ; Sheng Yu

USING DEEP CONVOLUTIONAL NEURAL NETWORK FOR MOUSE BRAIN

SEGMENTATION IN DT-MRI ..................................................................................................................................... 6229 Anh Thai ; Vy Bui ; Laura Reyes ; Lin-Ching Chang

USER AUTHENTICATION THROUGH KEYSTROKE DYNAMICS BY MEANS OF MODEL

CHECKING: A PROPOSAL ......................................................................................................................................... 6232 Fabio Di Tommaso ; Michele Guerra ; Fabio Martinelli ; Francesco Mercaldo ; Massimo Piedimonte ; Giovanni

Rosa ; Antonella Santone

WORD SENSE DISAMBIGUATION USING WIKIPEDIA LINK GRAPH ............................................................ 6235 Hai-Lun Tu ; Pei-Chen Ho ; Jason S. Chang ; Li-Guang Chen

ROBUST FEATURES FOR ACTIVITIES RECOGNITION ..................................................................................... 6237 Kashyap Venkatesh ; Bashar Barmada ; Veronica Liesaputra ; Guillermo Ramirez-Prado

LARGE SCALE TIME SERIES ANALYSIS FOR INFRASTRUCTURE RELIABILITY ..................................... 6240 Zhichao Wang ; Shashank Singh ; Arnold Pereira

DATA-DRIVEN CONTROL OF A CLASS OF DISCRETE-TIME LINEAR COMPLEX

DYNAMICAL NETWORKS .......................................................................................................................................... 6243 Wei Wang ; Changyang Feng ; Wei Quan

MAP ADJUSTMENT AS A BASE FOR PRIVACY ASSURANCE IN SEMANTIC SPATIAL

TRAJECTORIES RELEASE ......................................................................................................................................... 6246 Xingang Wang ; Renshi Yu ; Tailian Liu ; Wenqing Li ; Xiaoling Sun

CLASSIFYING GENUINE FACE IMAGES FROM DISGUISED FACE IMAGES ............................................... 6248 Junyaup Kim ; Siho Han ; Simon S. Woo

LAUNCHING A USER-GENERATED CONTENT CAMPAIGN TO PROMOTE PUMA’S SOCIAL

CONTRIBUTIONS ......................................................................................................................................................... 6251 Lili Xiang ; Alice Mello ; Ryan Ackerman

A GRAPH CONSTRUCTION STUDY FOR GRAPH-BASED SEMI-SUPERVISED LEARNING:

CASE STUDY ON UNSTRUCTURED TEXT DATA ................................................................................................. 6254 Sumedh Yadav ; Gautam Kumar ; Shivam Kumar

EFFECTIVENESS OF USABILITY & PERFORMANCE FEATURES FOR WEB CREDIBILITY

EVALUATION ................................................................................................................................................................ 6257 Kenta Yamada ; Hayato Yamana

METHOD AND DATASET MINING IN SCIENTIFIC PAPERS .............................................................................. 6260 Rujing Yao ; Linlin Hou ; Yingchun Ye ; Ji Zhang ; Jian Wu

AN WORD2VEC BASED ON CHINESE MEDICAL KNOWLEDGE ...................................................................... 6263 Jiayi Zhu ; Pin Ni ; Yuming Li ; Junkun Peng ; Zhenjin Dai ; Gangmin Li ; Xuming Bai

Author Index