PredictingScientificDataTransferCharacteristicsWilliamAgnew,MichaelFischer,KyleChard,IanFoster
GlobusAllowingcommunicationandcollaborationforusersaroundtheworld.-Kyle Chard
LongTailedDistributions
EndpointPredictionHeuristics
• History:Themostlikelysource(S)/destination(D)endpointasthemostrecentS/Dendpointusedbyauser
• MarkovChain:AtransitionmatrixoftheobservedprobabilitiesofusingeachendpointasaS/DconditionedonaparticularendpointbeingpreviouslyusedasaS/D
• MostUniqueUsers:ThemostlikelyS/DendpointistheS/Dendpointwiththemostuniqueusers
• Institution: ThemostlikelyS/Dendpointforauseristhemostpopularendpointatthatuser’sinstitution
• EndpointOwnership:Themostlikelyendpointisthemostrecentendpointtheusercreated
EnsembleMethods:UsingRecurrentNeuralNetwork
• Combinemultipleheuristicsintoabetterheuristic
• Simpleexample:outputthetoprecommendationofeachheuristic
Results:TransferAccuracy
Results:UserAccuracy
ColdStart:PredictionsforNewUsers