Click here for the data challenge overview paper
Click here for the Leaderboards
Join the data challenge slack channel: DC Slack
The 2021 edition of the SIGIR eCom Data Challenge, hosted by Coveo, ran from April 21 to June 12. Over 20 teams, coming from both industry and academia, participated in the Data Challenge and a total of 6 final design papers, where teams shared their insights and methods, were accepted. The final results of the Data Challenge were presented on July 15, 2021 during the SIGIR eCom'21 Workshop, and featured an invited talk from the NVIDIA team and a round table discussion with various participating teams.
Training data, evaluation scripts and rules can be found in the official challenge repository; relevant literature and background information about the challenge and relevant industry use cases can be found in the challenge paper pre-print.
ContactsFor questions regarding the dataset, please contact Jacopo Tagliabue.
This challenge addresses the growing need for reliable predictions within the boundaries of a shopping session, as customer intentions can be different depending on the occasion. In the context of e-commerce technology, the feedback loop determined by behavioural signals spans from hours to a few seconds and machine learning models need to adapt as fast as possible to the continuously changing nature of the customer journey.
The need for efficient procedures for personalization is even clearer if we consider the e-commerce landscape more broadly: outside of giant digital retailers, the constraints of the problem are stricter, due to smaller user bases and the realization that most users are not frequently returning customers.
We release a new session-based dataset including fine-grained browsing events (detail, add, purchase), enriched by linguistic behavior (queries made by shoppers, with items clicked and items not clicked after the query) and catalog meta-data (image, text, pricing information). On this dataset, we ask participants to showcase innovative solutions for two open problems:
The organizers wish to thank Luca Bigon for his outstanding support in data collection, and Surya Kallumadi, Massimo Quadrana, Dietmar Jannach, Ajinkya Kale for precious feedback on a previous version of this paper. Finally, special thanks to Richard Tessier and Coveo's legal team for believing in this data sharing initiative.
The following system description papers were accepted:
April 21 | |
June 5 | |
June 10 | |
June 11 | |
June 17 | |
June 29 | |
July 7 | |
July 10 | |
July 15 |
Position | Nickname | Score (MRR) | Timestamp (UTC) |
---|---|---|---|
1 | DeepBlueAI | 0.277256856863352 | 2021-06-16 15:00:24.054893 |
2 | NVIDIA Merlin | 0.277150722994993 | 2021-06-17 23:47:39.824805 |
3 | tsotfsk | 0.26171723662781 | 2021-06-17 23:51:05.140458 |
4 | scitator | 0.228414620791257 | 2021-06-15 21:00:07.655583 |
5 | louis | 0.223834529799278 | 2021-06-16 17:04:01.073889 |
6 | Yoshi | 0.214855530913555 | 2021-06-17 23:32:29.575103 |
7 | old | 0.191871401529567 | 2021-06-16 15:56:11.615827 |
8 | busdriver | 0.18341056004716 | 2021-06-16 13:23:46.65911 |
9 | DSWue | 0.139330140115961 | 2021-06-16 21:26:42.031644 |
10 | Beantown | 0.116077675397221 | 2021-06-11 23:44:25.022806 |
11 | eggie5 | 0.0311874109181821 | 2021-06-14 05:09:44.15369 |
Position | Nickname | Score (F1) | Timestamp (UTC) |
---|---|---|---|
1 | NVIDIA Merlin | 0.0744066480874766 | 2021-06-17 23:47:39.824805 |
2 | Yoshi | 0.071323143782563 | 2021-06-17 23:32:29.575103 |
3 | DeepBlueAI | 0.0712670036197682 | 2021-06-16 15:00:24.054893 |
4 | louis | 0.0691723442031248 | 2021-06-16 17:04:01.073889 |
5 | tsotfsk | 0.0676956513778051 | 2021-06-17 23:51:05.140458 |
6 | scitator | 0.0622307564654324 | 2021-06-15 21:00:07.655583 |
7 | DSWue | 0.0515263362020548 | 2021-06-17 21:28:58.418355 |
8 | Beantown | 0.0476388656948051 | 2021-06-11 23:44:25.022806 |
9 | old | 0.0475242159500783 | 2021-06-16 15:56:11.615827 |
10 | busdriver | 0.0458035709730356 | 2021-06-16 13:23:46.65911 |
11 | eggie5 | 0.014190389913474 | 2021-06-14 05:09:44.15369 |
Position | Nickname | Score (Weighted Micro-F1) | Timestamp (UTC) |
---|---|---|---|
1 | DeepBlueAI | 3.63439774529335 | 2021-06-13 09:06:15.335559 |
2 | NVIDIA Merlin | 3.6340530847665 | 2021-06-14 23:29:52.034462 |
3 | hakubishin3 | 3.63031595108722 | 2021-06-14 04:18:15.102488 |
4 | Shawn | 3.63006620083747 | 2021-06-11 10:21:15.292887 |
5 | Yoshi | 3.5829108962637 | 2021-06-14 22:50:19.714592 |
6 | busdriver | 3.51563694420912 | 2021-06-12 00:13:41.575177 |
Position | Nickname | Score (MRR) | Timestamp (UTC) |
---|---|---|---|
1 | DeepBlueAI | 0.259567372588436 | 2021-06-10 07:06:38.503671 |
2 | NVIDIA Merlin | 0.257838846924539 | 2021-06-10 14:21:19.516696 |
3 | tsotfsk | 0.254900668665202 | 2021-06-09 15:51:53.606362 |
4 | gspmoreira | 0.246658879747372 | 2021-06-02 04:43:46.826176 |
5 | scitator | 0.227762621529733 | 2021-06-09 17:15:01.304097 |
6 | louis | 0.219813275834822 | 2021-06-10 16:50:29.592666 |
7 | hakubishin3 | 0.21850783896367 | 2021-06-07 13:55:34.853107 |
8 | Yoshi | 0.203271422177873 | 2021-06-10 02:55:04.23637 |
9 | Wanna | 0.174838662853926 | 2021-05-27 02:20:01.378616 |
10 | eggie5 | 0.169853537423224 | 2021-04-28 01:33:58.986311 |
11 | old | 0.168984566055026 | 2021-06-10 12:54:39.244947 |
12 | busdriver | 0.164626909973333 | 2021-06-10 12:50:38.254958 |
13 | ECNU_DM | 0.12317866259579 | 2021-06-06 02:43:27.514704 |
14 | Beantown | 0.115919547270715 | 2021-06-10 23:58:17.605326 |
15 | learner | 0.113309491523042 | 2021-05-01 11:02:27.474228 |
16 | ECNU_Rec | 0.112677144884667 | 2021-05-13 11:40:32.372782 |
17 | KonigsbergGuy | 0.0892720412689585 | 2021-06-07 16:08:09.70735 |
18 | Rick | 0.0470746941901396 | 2021-05-04 18:31:44.117046 |
19 | DSWue | 0.001780158105201 | 2021-05-28 08:52:29.361252 |
20 | Nastya | 0 | 2021-05-19 12:45:44.220495 |
Position | Nickname | Score (F1) | Timestamp (UTC) |
---|---|---|---|
1 | NVIDIA Merlin | 0.0704473890923575 | 2021-06-10 14:21:19.516696 |
2 | Yoshi | 0.0683150961432571 | 2021-06-04 06:18:47.454721 |
3 | louis | 0.0675671489008072 | 2021-05-29 14:12:16.80038 |
4 | gspmoreira | 0.0671580029228143 | 2021-06-02 04:43:46.826176 |
5 | tsotfsk | 0.065942425395436 | 2021-06-10 11:14:27.903234 |
6 | DeepBlueAI | 0.0652533072711102 | 2021-06-10 07:06:38.503671 |
7 | hakubishin3 | 0.0627382093680529 | 2021-06-07 13:55:34.853107 |
8 | scitator | 0.0605172072958347 | 2021-06-09 16:15:12.179543 |
9 | Wanna | 0.0534842389144261 | 2021-05-27 03:36:59.695761 |
10 | ECNU_DM | 0.051821044811731 | 2021-05-26 02:41:42.600018 |
11 | eggie5 | 0.0513360414256127 | 2021-06-04 02:09:42.642162 |
12 | old | 0.0420626146480762 | 2021-06-10 12:54:39.244947 |
13 | Beantown | 0.0413345157161334 | 2021-06-04 23:54:19.308318 |
14 | busdriver | 0.0409778820267947 | 2021-06-10 12:50:38.254958 |
15 | KonigsbergGuy | 0.0243042924848845 | 2021-06-07 16:08:09.70735 |
16 | learner | 0.0126461115078898 | 2021-05-01 11:02:27.474228 |
17 | ECNU_Rec | 0.0125607859842969 | 2021-05-16 13:29:12.389733 |
18 | Rick | 0.00551070580065692 | 2021-05-04 18:31:44.117046 |
19 | DSWue | 0.000578046028940261 | 2021-05-28 08:52:29.361252 |
20 | Nastya | 0 | 2021-05-19 12:45:44.220495 |
Position | Nickname | Score (Weighted Micro-F1) | Timestamp (UTC) |
---|---|---|---|
1 | NVIDIA Merlin | 3.63142202394556 | 2021-06-10 22:19:39.439075 |
2 | DeepBlueAI | 3.62898581869512 | 2021-06-10 12:47:23.969359 |
3 | ronai | 3.62223442288064 | 2021-06-07 16:28:00.728037 |
4 | hakubishin3 | 3.62147689588072 | 2021-06-10 12:14:35.016894 |
5 | Shawn | 3.62034182210093 | 2021-06-01 02:30:38.991516 |
6 | SunnySideUp | 3.61663879085421 | 2021-06-03 00:28:11.293968 |
7 | Yoshi | 3.61485959169783 | 2021-06-04 10:33:04.753004 |
8 | busdriver | 3.59379628246656 | 2021-06-09 05:05:34.153941 |