1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
1001
1002
1003
1004
1005
1006
1007
1008
1009
1010
1011
1012
1013
1014
1015
1016
1017
1018
1019
1020
1021
1022
1023
1024
1025
1026
1027
1028
1029
1030
1031
1032
1033
1034
1035
1036
1037
1038
1039
1040
1041
1042
1043
1044
1045
1046
1047
1048
1049
1050
1051
1052
1053
1054
1055
1056
1057
1058
1059
1060
1061
1062
1063
1064
1065
1066
1067
1068
1069
1070
1071
1072
1073
1074
1075
1076
1077
1078
1079
1080
1081
1082
1083
1084
1085
1086
1087
1088
1089
1090
1091
1092
1093
1094
1095
1096
1097
1098
1099
1100
1101
1102
1103
1104
1105
1106
1107
1108
1109
1110
1111
1112
1113
1114
1115
1116
1117
1118
1119
1120
1121
1122
1123
1124
1125
1126
1127
1128
1129
1130
1131
1132
1133
1134
1135
1136
1137
1138
1139
1140
1141
1142
1143
1144
1145
1146
1147
1148
1149
1150
1151
1152
1153
1154
1155
1156
1157
1158
1159
1160
1161
1162
1163
1164
1165
1166
1167
1168
1169
1170
1171
1172
1173
1174
1175
1176
1177
1178
1179
1180
1181
1182
1183
1184
1185
1186
1187
1188
1189
1190
1191
1192
1193
1194
1195
1196
1197
1198
1199
1200
1201
1202
1203
1204
1205
1206
1207
1208
1209
1210
1211
1212
1213
1214
1215
1216
1217
1218
1219
1220
1221
1222
1223
1224
1225
1226
1227
1228
1229
1230
1231
1232
1233
1234
1235
1236
1237
1238
1239
1240
1241
1242
1243
1244
1245
1246
1247
1248
1249
1250
1251
1252
1253
1254
1255
1256
1257
1258
1259
1260
1261
1262
1263
1264
1265
1266
1267
1268
1269
1270
1271
1272
1273
1274
1275
1276
1277
1278
1279
1280
1281
1282
1283
1284
1285
1286
1287
1288
1289
1290
1291
1292
1293
1294
1295
1296
1297
1298
1299
1300
1301
1302
1303
1304
1305
1306
1307
1308
1309
1310
1311
1312
1313
1314
1315
1316
1317
1318
1319
1320
1321
1322
1323
1324
1325
1326
1327
1328
1329
1330
1331
1332
1333
1334
1335
1336
1337
1338
1339
1340
1341
1342
1343
1344
1345
1346
1347
1348
1349
1350
1351
1352
1353
1354
1355
1356
1357
1358
1359
1360
1361
1362
1363
1364
1365
1366
1367
1368
1369
1370
1371
1372
1373
1374
1375
1376
1377
1378
1379
1380
1381
1382
1383
1384
1385
1386
1387
1388
1389
1390
1391
1392
1393
1394
1395
1396
1397
1398
1399
1400
1401
1402
1403
1404
1405
1406
1407
1408
1409
1410
1411
1412
1413
1414
1415
1416
1417
1418
1419
1420
1421
1422
1423
1424
1425
1426
1427
1428
1429
1430
1431
1432
1433
1434
1435
1436
1437
1438
1439
1440
1441
1442
1443
1444
1445
1446
1447
1448
1449
1450
1451
1452
1453
1454
1455
1456
1457
1458
1459
1460
1461
1462
1463
1464
1465
1466
1467
1468
1469
1470
1471
1472
1473
1474
1475
1476
1477
1478
1479
1480
1481
1482
1483
1484
1485
1486
1487
1488
1489
1490
1491
1492
1493
1494
1495
1496
1497
1498
1499
1500
1501
1502
1503
1504
1505
1506
1507
1508
1509
1510
1511
1512
1513
1514
1515
1516
1517
1518
1519
1520
1521
1522
1523
1524
1525
1526
1527
1528
1529
1530
1531
1532
1533
1534
1535
1536
1537
1538
1539
1540
1541
1542
1543
1544
1545
1546
1547
1548
1549
1550
1551
1552
1553
1554
1555
1556
1557
1558
1559
1560
1561
1562
1563
1564
1565
1566
1567
1568
1569
1570
1571
1572
1573
1574
1575
1576
1577
1578
1579
1580
1581
1582
1583
1584
1585
1586
1587
1588
1589
1590
1591
1592
1593
1594
1595
1596
1597
1598
1599
1600
1601
1602
1603
1604
1605
1606
1607
1608
1609
1610
1611
1612
1613
1614
1615
1616
1617
1618
1619
1620
1621
1622
1623
1624
1625
1626
1627
1628
1629
1630
1631
1632
1633
1634
1635
1636
1637
1638
1639
1640
1641
1642
1643
1644
1645
1646
1647
1648
1649
1650
1651
1652
1653
1654
1655
1656
1657
1658
1659
1660
1661
1662
1663
1664
1665
1666
1667
1668
1669
1670
1671
1672
1673
1674
1675
1676
1677
1678
1679
1680
1681
1682
1683
1684
1685
1686
1687
1688
1689
1690
1691
1692
1693
1694
1695
1696
1697
1698
1699
1700
1701
1702
1703
1704
1705
1706
1707
1708
1709
1710
1711
1712
1713
1714
1715
1716
1717
1718
1719
1720
1721
1722
1723
1724
1725
1726
1727
1728
1729
1730
1731
1732
1733
1734
1735
1736
1737
1738
1739
1740
1741
1742
1743
1744
1745
1746
1747
1748
1749
1750
1751
1752
1753
1754
1755
1756
1757
1758
1759
1760
1761
1762
1763
1764
1765
1766
1767
1768
1769
1770
1771
1772
1773
1774
1775
1776
1777
1778
1779
1780
1781
1782
1783
1784
1785
1786
1787
1788
1789
1790
1791
1792
1793
1794
1795
1796
1797
1798
1799
1800
1801
1802
1803
1804
1805
1806
1807
1808
1809
1810
1811
1812
1813
1814
1815
1816
1817
1818
1819
1820
1821
1822
1823
1824
1825
1826
1827
1828
1829
1830
1831
1832
1833
1834
1835
1836
1837
1838
1839
1840
1841
1842
1843
1844
1845
1846
1847
1848
1849
1850
1851
1852
1853
1854
1855
1856
1857
1858
1859
1860
1861
1862
1863
1864
1865
1866
1867
1868
1869
1870
1871
1872
1873
1874
1875
1876
1877
1878
1879
1880
1881
1882
1883
1884
1885
1886
1887
1888
1889
1890
1891
1892
1893
1894
1895
1896
1897
1898
1899
1900
1901
1902
1903
1904
1905
1906
1907
1908
1909
1910
1911
1912
1913
1914
1915
1916
1917
1918
1919
1920
1921
1922
1923
1924
1925
1926
1927
1928
1929
1930
1931
1932
1933
1934
1935
1936
1937
1938
1939
1940
1941
1942
1943
1944
1945
1946
1947
1948
1949
1950
1951
1952
1953
1954
1955
1956
1957
1958
1959
1960
1961
1962
1963
1964
1965
1966
1967
1968
1969
1970
1971
1972
1973
1974
1975
1976
1977
1978
1979
1980
1981
1982
1983
1984
1985
1986
1987
1988
1989
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026
2027
2028
2029
2030
2031
2032
2033
2034
2035
2036
2037
2038
2039
2040
2041
2042
2043
2044
2045
2046
2047
2048
2049
2050
2051
2052
2053
2054
2055
2056
2057
2058
2059
2060
2061
2062
2063
2064
2065
2066
2067
2068
2069
2070
2071
2072
2073
2074
2075
2076
2077
2078
2079
2080
2081
2082
2083
2084
2085
2086
2087
2088
2089
2090
2091
2092
2093
2094
2095
2096
2097
2098
2099
2100
2101
2102
2103
2104
2105
2106
2107
2108
2109
2110
2111
2112
2113
2114
2115
2116
2117
2118
2119
2120
2121
2122
2123
2124
2125
2126
2127
2128
2129
2130
2131
2132
2133
2134
2135
2136
2137
2138
2139
2140
2141
2142
2143
2144
2145
2146
2147
2148
2149
2150
2151
2152
2153
2154
2155
2156
2157
2158
2159
2160
2161
2162
2163
2164
2165
2166
2167
2168
2169
2170
2171
2172
2173
2174
2175
2176
2177
2178
2179
2180
2181
2182
2183
2184
2185
2186
2187
2188
2189
2190
2191
2192
2193
2194
2195
2196
2197
2198
2199
2200
2201
2202
2203
2204
2205
2206
2207
2208
2209
2210
2211
2212
2213
2214
2215
2216
2217
2218
2219
2220
2221
2222
2223
2224
2225
2226
2227
2228
2229
2230
2231
2232
2233
2234
2235
2236
2237
2238
2239
2240
2241
2242
2243
2244
2245
2246
2247
2248
2249
2250
2251
2252
2253
2254
2255
2256
2257
2258
2259
2260
2261
2262
2263
2264
2265
2266
2267
2268
2269
2270
2271
2272
2273
2274
2275
2276
2277
2278
2279
2280
2281
2282
2283
2284
2285
2286
2287
2288
2289
2290
2291
2292
2293
2294
2295
2296
2297
2298
2299
2300
2301
2302
2303
2304
2305
2306
2307
2308
2309
2310
2311
2312
2313
2314
2315
2316
2317
2318
2319
2320
2321
2322
2323
2324
2325
2326
2327
2328
2329
2330
2331
2332
2333
2334
2335
2336
2337
2338
2339
2340
2341
2342
2343
2344
2345
2346
2347
2348
2349
2350
2351
2352
2353
2354
2355
2356
2357
2358
2359
2360
2361
2362
2363
2364
2365
2366
2367
2368
2369
2370
2371
2372
2373
2374
2375
2376
2377
2378
2379
2380
2381
2382
2383
2384
2385
2386
2387
2388
2389
2390
2391
2392
2393
2394
2395
2396
2397
2398
2399
2400
2401
2402
2403
2404
2405
2406
2407
2408
2409
2410
2411
2412
2413
2414
2415
2416
2417
2418
2419
2420
2421
2422
2423
2424
2425
2426
2427
2428
2429
2430
2431
2432
2433
2434
2435
2436
2437
2438
2439
2440
2441
2442
2443
2444
2445
2446
2447
2448
2449
2450
2451
2452
2453
2454
2455
2456
2457
2458
2459
2460
2461
2462
2463
2464
2465
2466
2467
2468
2469
2470
2471
2472
2473
2474
2475
2476
2477
2478
2479
2480
2481
2482
2483
2484
2485
2486
2487
2488
2489
2490
2491
2492
2493
2494
2495
2496
2497
2498
2499
2500
2501
2502
2503
2504
2505
2506
2507
2508
2509
2510
2511
2512
2513
2514
2515
2516
2517
2518
2519
2520
2521
2522
2523
2524
2525
2526
2527
2528
2529
2530
2531
2532
2533
2534
2535
2536
2537
2538
2539
2540
2541
2542
2543
2544
2545
2546
2547
2548
2549
2550
2551
2552
2553
2554
2555
2556
2557
2558
2559
2560
2561
2562
2563
2564
2565
2566
2567
2568
2569
2570
2571
2572
2573
2574
2575
2576
2577
2578
2579
2580
2581
2582
2583
2584
2585
2586
2587
2588
2589
2590
2591
2592
2593
2594
2595
2596
2597
2598
2599
2600
2601
2602
2603
2604
2605
2606
2607
2608
2609
2610
2611
2612
2613
2614
2615
2616
2617
2618
2619
2620
2621
2622
2623
2624
2625
2626
2627
2628
2629
2630
2631
2632
2633
2634
2635
2636
2637
2638
2639
2640
2641
2642
2643
2644
2645
2646
2647
2648
2649
2650
2651
2652
2653
2654
2655
2656
2657
2658
2659
2660
2661
2662
2663
2664
2665
2666
2667
2668
2669
2670
2671
2672
2673
2674
2675
2676
2677
2678
2679
2680
2681
2682
2683
2684
2685
2686
2687
2688
2689
2690
2691
2692
2693
2694
2695
2696
2697
2698
2699
2700
2701
2702
2703
2704
2705
2706
2707
2708
2709
2710
2711
2712
2713
2714
2715
2716
2717
2718
2719
2720
2721
2722
2723
2724
2725
2726
2727
2728
2729
2730
2731
2732
2733
2734
2735
2736
2737
2738
2739
2740
2741
2742
2743
2744
2745
2746
2747
2748
2749
2750
2751
2752
2753
2754
2755
2756
2757
2758
2759
2760
2761
2762
2763
2764
2765
2766
2767
2768
2769
2770
2771
2772
2773
2774
2775
2776
2777
2778
2779
2780
2781
2782
2783
2784
2785
2786
2787
2788
2789
2790
2791
2792
2793
2794
2795
2796
2797
2798
2799
2800
2801
2802
2803
2804
2805
2806
2807
2808
2809
2810
2811
2812
2813
2814
2815
2816
2817
2818
2819
2820
2821
2822
2823
2824
2825
2826
2827
2828
2829
2830
2831
2832
2833
2834
2835
2836
2837
2838
2839
2840
2841
2842
2843
2844
2845
2846
2847
2848
2849
2850
2851
2852
2853
2854
2855
2856
2857
2858
2859
2860
2861
2862
2863
2864
2865
2866
2867
2868
2869
2870
2871
2872
2873
2874
2875
2876
2877
2878
2879
2880
2881
2882
2883
2884
2885
2886
2887
2888
2889
2890
2891
2892
2893
2894
2895
2896
2897
2898
2899
2900
2901
2902
2903
2904
2905
2906
2907
2908
2909
2910
2911
2912
2913
2914
2915
2916
2917
2918
2919
2920
2921
2922
2923
2924
2925
2926
2927
2928
2929
2930
2931
2932
2933
2934
2935
2936
2937
2938
2939
2940
2941
2942
2943
2944
2945
2946
2947
2948
2949
2950
2951
2952
2953
2954
2955
2956
2957
2958
2959
2960
2961
2962
2963
2964
2965
2966
2967
2968
2969
2970
2971
2972
2973
2974
2975
2976
2977
2978
2979
2980
2981
2982
2983
2984
2985
2986
2987
2988
2989
2990
2991
2992
2993
2994
2995
2996
2997
2998
2999
3000
3001
3002
3003
3004
3005
3006
3007
3008
3009
3010
3011
3012
3013
3014
3015
3016
3017
3018
3019
3020
3021
3022
3023
3024
3025
3026
3027
3028
3029
3030
3031
3032
3033
3034
3035
3036
|
<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE rfc [
<!ENTITY RFC2119 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.2119.xml">
<!ENTITY RFC2782 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.2782.xml">
<!ENTITY RFC3629 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.3629.xml">
<!ENTITY RFC3686 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.3686.xml">
<!ENTITY RFC3826 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.3826.xml">
<!ENTITY RFC3986 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.3986.xml">
<!ENTITY RFC4634 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.4634.xml">
<!ENTITY RFC5234 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.5234.xml">
<!ENTITY RFC5869 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.5869.xml">
<!ENTITY RFC5890 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.5890.xml">
<!ENTITY RFC5891 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.5891.xml">
<!ENTITY RFC6781 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.6781.xml">
<!ENTITY RFC6895 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.6895.xml">
<!ENTITY RFC6940 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.6940.xml">
<!ENTITY RFC6979 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.6979.xml">
<!ENTITY RFC7748 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.7748.xml">
<!ENTITY RFC8032 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.8032.xml">
<!ENTITY RFC8126 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.8126.xml">
<!ENTITY RFC8174 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.8174.xml">
<!ENTITY I-D.schanzen-gns PUBLIC '' "http://xml.resource.org/public/rfc/bibxml3/reference.I-D.schanzen-gns.xml">
]>
<?xml-stylesheet type='text/xsl' href='rfc2629.xslt' ?>
<?rfc strict="yes" ?>
<?rfc toc="yes" ?>
<?rfc symrefs="yes"?>
<?rfc sortrefs="yes" ?>
<?rfc compact="yes" ?>
<?rfc subcompact="no" ?>
<rfc xmlns:xi="http://www.w3.org/2001/XInclude" category="info" docName="draft-schanzen-r5n-00" ipr="trust200902" obsoletes="" updates="" submissionType="IETF" xml:lang="en" version="3">
<front>
<title abbrev="The R5N Distributed Hash Table">
The R5N Distributed Hash Table
</title>
<seriesInfo name="Internet-Draft" value="draft-schanzen-r5n-00"/>
<author fullname="Martin Schanzenbach" initials="M." surname="Schanzenbach">
<organization>Fraunhofer AISEC</organization>
<address>
<postal>
<street>Lichtenbergstrasse 11</street>
<city>Garching</city>
<code>85748</code>
<country>DE</country>
</postal>
<email>martin.schanzenbach@aisec.fraunhofer.de</email>
</address>
</author>
<author fullname="Christian Grothoff" initials="C." surname="Grothoff">
<organization>Berner Fachhochschule</organization>
<address>
<postal>
<street>Hoeheweg 80</street>
<city>Biel/Bienne</city>
<code>2501</code>
<country>CH</country>
</postal>
<email>grothoff@gnunet.org</email>
</address>
</author>
<author fullname="Bernd Fix" initials="B." surname="Fix">
<organization>GNUnet e.V.</organization>
<address>
<postal>
<street>Boltzmannstrasse 3</street>
<city>Garching</city>
<code>85748</code>
<country>DE</country>
</postal>
<email>fix@gnunet.org</email>
</address>
</author>
<!-- Meta-data Declarations -->
<area>General</area>
<workgroup>Independent Stream</workgroup>
<keyword>distributed hash tables</keyword>
<abstract>
<t>
This document contains the R<sup>5</sup>N DHT technical specification.
R<sup>5</sup>N is a secure distributed hash table (DHT) routing algorithm
and data structure for decentralized applications.
It features an open peer-to-peer overlay routing mechanism which supports ad-hoc
permissionless participation and support for topologies in restricted-route
environments.
</t>
<t>
This document defines the normative wire format of protocol messages,
routing algorithms, cryptographic routines and security considerations for
use by implementers.
</t>
<t>
This specification was developed outside the IETF and does not have IETF
consensus. It is published here to guide implementation of R<sup>5</sup>N and to
ensure interoperability among implementations including the pre-existing
GNUnet implementation.
</t>
</abstract>
</front>
<middle>
<section anchor="introduction" numbered="true" toc="default">
<name>Introduction</name>
<!--
2022/12/23 MSC: I moved references to rfc6940 to security considerations.
I think we should talk about R5N in the positive here only, not about
RELOAD in the negative.
- Lean. Can be implemented. Not overengineered.
- Path tracking (more difficult) -> Not built in
- Certificates central server ?
- "self-signed certificates can be used in closed networks."
- "Security Framework: A P2P network will often be established among a
set of peers that do not trust each other. RELOAD leverages a
central enrollment server to provide credentials for each peer,
which can then be used to authenticate each operation. This
greatly reduces the possible attack surface." bizarre statement.
- For a PUT, reload requires that
"Each element is signed by a credential which is authorized to
write this Kind at this Resource-ID. If this check fails, the
request <bcp14>MUST</bcp14> be rejected with an Error_Forbidden error."
-->
<!--FIXME: Here we should also cite and discuss RELOAD (https://datatracker.ietf.org/doc/html/rfc6940)
and establish why we need this spec and are not a "Topology plugin"
in RELOAD. The argumentation revolves around the trust model (openness) and
security aspects (path signatures).-->
<t>
This specification describes the protocol of R<sup>5</sup>N.
R<sup>5</sup>N is a Distributed Hash Table (DHT) is an acronym for
"randomized recursive routing for restricted-route
networks" and its first academic description can be found in
<xref target="R5N"/>.
</t>
<t>
DHTs are a key data structure for the construction of decentralized applications.
and they generally provide a robust and efficient means to distribute the
storage and retrieval of key-value pairs.
</t>
<t>
The core idea behind R<sup>5</sup>N is to combine an initial randomized routing
algorithm with an efficient, classical closest-peer algorithm.
This allows us to construct an algorithm that is able to escape and circumvent
restricted route environments while at the same time allow for O(log n) routing
complexity.
</t>
<t>
R<sup>5</sup>N also includes advanced features like tracing paths messages take
through the network, response filters and on-path application-specific data
validation.
</t>
<t>
This document defines the normative wire format of peer-to-peer
messages, routing algorithms, cryptographic routines and security
considerations for use by implementors.
</t>
<section numbered="true" toc="default">
<name>Requirements Notation</name>
<t>
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and
"OPTIONAL" in this document are to be interpreted as described in
BCP 14 <xref target="RFC2119"/> <xref target="RFC8174"/> when, and only
when, they appear in all capitals, as shown here.
</t>
</section>
</section>
<section anchor="terminology">
<name>Terminology</name>
<dl>
<dt>Address</dt>
<dd>
<t>
is a UTF-8 <xref target="RFC3629"/> URI
<xref target="RFC3986"/> which can be
used as address to contact a peer.
An example of an addressing scheme used in
this document is "r5n+ip+tcp", which refers to a standard TCP/IP socket
connection. The "hier"-part of the URI must provide a suitable
address for the given addressing scheme.
The following is a non-normative example of address strings:
</t>
<figure title="Example Address URIs.">
<artwork name="" type="" align="left" alt=""><![CDATA[
r5n+ip+udp://1.2.3.4:6789/
gnunet+tcp://12.3.4.5/
]]></artwork>
</figure>
</dd>
<dt>Applications</dt>
<dd>
Applications are components which directly use the DHT overlay
interfaces. Possible applications include the GNU Name System
<xref target="I-D.schanzen-gns"/> and the CADET transport system
<xref target="cadet"/>.
</dd>
<dt>Application API</dt>
<dd>
The application API exposes the core operations of the DHT overlay
to applications.
This includes storing blocks in the DHT and retrieving blocks from the DHT.
</dd>
<dt>Block</dt>
<dd>
Variable-size unit of payload stored in the DHT
under a <tt>Key</tt>. Commonly also called a "value" when talking
about a DHT as a "key-value store".
</dd>
<dt>Block Storage</dt>
<dd>
The <tt>Block Storage</tt> component is used to persist and manage
<tt>Block</tt> data by peers.
It includes logic for enforcing storage quotas, caching strategies and
data validation.
</dd>
<dt>Block-Type</dt>
<dd>
A unique 32-bit value identifying the data format of a <tt>Block</tt>.
Block-Types are either private or registered in the GANA block type registry (see
<xref target="gana_block_type"/>).
</dd>
<dt>Initiator</dt>
<dd>
The peer that initially creates and sends a message (<xref target="p2p_hello"/>,
<xref target="p2p_put"/>, <xref target="p2p_get"/>, <xref target="p2p_result"/>).
</dd>
<dt>HELLO block</dt>
<dd>
A HELLO block is a block with a dedicated block type and is specified in this document.
The HELLO block is used to store and retrieve Peer addresses.
In this document, HELLO blocks are used by the peer discovery mechanism.
</dd>
<dt>HELLO URL</dt>
<dd>
HELLO URLs are URL-formatted HELLO blocks.
They can used for out-of-band exchanges of peer information and are used for
address update signalling messages to neighbours.
</dd>
<dt>Key</dt>
<dd>
512-bit identifier of a location in the DHT. Multiple <tt>Block</tt>s can be
stored under the same key. <tt>Peer Addresses</tt> are valid keys.
</dd>
<dt>Message Processing</dt>
<dd>
The Message Processing component processes requests from and
generates responses to applications and the underlay network.
</dd>
<dt>Neighbor</dt>
<dd>
A neighbor is a peer which is directly able to communicate
with our peer via the <tt>Underlay Interface</tt>.
</dd>
<dt>Peer</dt>
<dd>
A host that is participating in the overlay. Peers are
responsible for holding some portion of the data that has been
stored in the overlay, and they are responsible for routing
messages on behalf of other hosts as needed by the Routing
Algorithm.
</dd>
<dt>Peer Address</dt>
<dd>
The <tt>Peer Address</tt> is the identifier used on the Overlay
to address a peer.
It is a SHA-512 hash of the <tt>Peer ID</tt>.
</dd>
<dt>Peer ID</dt>
<dd>
The <tt>Peer ID</tt> is the public key which is used to authenticate
a peer in the underlay.
The Peer ID is the public key of the corresponding
Ed25519<xref target="ed25519" /> peer private key.
</dd>
<dt>Routing</dt>
<dd>
The Routing component includes the routing table as well as
routing and peer selection logic. It facilitates the R<sup>5</sup>N routing
algorithm with required data structures and algorithms.
</dd>
<dt>Responsible Peer</dt>
<dd>
The peer <tt>N</tt> that is responsible for a specific key <tt>K</tt>, as
defined by the <tt>SelectClosestPeer(K, P)</tt> algorithm (see
<xref target="routing"/>.
</dd>
<dt>Underlay Interface</dt>
<dd>
The Underlay Interface is an abstraction layer on top of the
supported links of a peer. Peers may be linked by a variety of
different transports, including "classical" protocols such as
TCP, UDP and TLS or advanced protocols such as GNUnet, I2P or Tor.
</dd>
</dl>
</section>
<section>
<name>Overview</name>
<t>
In R<sup>5</sup>N peers communicate with each other in order to realize and
maintain two basic operations of a distributed hash table:
</t>
<ul>
<li>
PUT: This operation stores a block payload on one or more peers with
the goal of making the block availiable for queries using the GET operation.
In the classical definition of a dictionary interface, this operation would be
called "insert".
</li>
<li>
GET: This operation queries the network of peers for blocks
previously stored under or near the key.
In the classical definition of a dictionary interface, this operation would be
called "find".
</li>
</ul>
<t>
A peer or its implementation does not necessarily need to expose the above operations
to applications but it commonly will.
For example, the peer could be a server purely used for bootstrapping, routing or
supporting the overlay network with resources.
An example for possible semantics of the above operations provided as an API to applications by an
implementation are outlined in <xref target="overlay"/>.
</t>
<t>
In a trivial scenario where there is only one peer (the local host),
R<sup>5</sup>N operates in a very similar fashion to a dictionary data structure.
However, the default use case is one where nodes communicate directly and
indirectly in order to realize a distributed storage mechanism.
This communication requires a lower-level peer addressing and message transport
mechanism such as TCP/IP.
R<sup>5</sup>N is agnostic to the underlying transport protocol which is why
this document defines a common addressing and messaging interface in
<xref target="underlay"/>.
The interface provided by this underlay is used across the specification of the
R<sup>5</sup>N protocol.
It also serves as a set of requirements of possible transport mechanisms that
can be used to implement R<sup>5</sup>N with.
That being said, common transport protocols such as TCP/IP or UDP/IP and their
interfaces are suitable R<sup>5</sup>N underlays used by existing
implementations.
</t>
<!-- consider moving some of this back into sec considerations -->
<t>
Specifics about the protocols of the underlays providing
connectivity or the applications using the DHT are out of
the scope of this document. However, we note that peers
implementing disjoint sets of underlay protocols may
experience difficulties communicating (unless other peers
bridge the respective underlays). Similarly, peers that
do not support a particular application will not be able
to validate application-specific payloads and may thus be
tricked into storing or forwarding corrupt blocks.
</t>
<t>
In order to establish an initial connection to a network of R<sup>5</sup>N
peers, an initial, addressable bootstrap peer is required.
Further peers, including neighbors, are then learned via a peer discovery
process as defined in <xref target="find_peer"/>.
</t>
<t>
Across this document, the functional components of an R<sup>5</sup>N
implementation are divided into routing (<xref target="routing"/>),
message processing (<xref target="p2p_messages"/>) and
block processing (<xref target="blockstorage"/>).
Applications that require application-specific block payloads are expected to
register a block type in the GANA block type registry (<xref target="gana_block_type"/>)
and provide a specification of the associated block operations (<xref target="block_functions"/>).
to implementors of R<sup>5</sup>N.
<xref target="figure_r5n_arch"/> illustrates the architectural overview of
R<sup>5</sup>N.
</t>
<figure anchor="figure_r5n_arch" title="The R5N architecture.">
<artwork><![CDATA[
| +-----------------+ +-------+
Applications | | GNU Name System | | CADET | ...
| +-----------------+ +-------+
-------------+------------------------------------ Application API
| ^
| | +---------------+
| | | Block Storage |
| | +---------------+
| | ^
R5N | v v
| +--------------------+ +---------+
| | Message Processing |<-->| Routing |
| +--------------------+ +---------+
| ^ ^
| v v
-------------+------------------------------------ Underlay Interface
| +--------+ +--------+
| |GNUnet | |IP | ...
Connectivity | |Underlay| |Underlay|
| |Link | |Link |
| +--------+ +--------+
]]>
</artwork>
</figure>
</section>
<section anchor="underlay" numbered="true" toc="default">
<name>Underlay</name>
<t>
In the network underlay, a peer is addressable by traditional
means out of scope of this document. For example, the peer may
have a TCP/IP address, or a HTTPS endpoint.
While the specific addressing options and mechanisms are out of scope
for this document, it is necessary to define a universal addressing
format in order to facilitate the distribution of connectivity
information to other peers in the DHT overlay.
This format is the "HELLO" Block (described in <xref target="hello_block"/>),
which contains URIs. The scheme of each URI indicates which underlay understands the
respective address given in the rest of the URI.
</t>
<!--
1) The current API is always fire+forget, it doesn't allow for flow
control. I think we need to add that, possibly for sending and receiving.
IDK.
2) I'm not sure what to do with the crypto: mandate EdDSA or allow the
underlay to do whatever public keys it likes.
We need keys in the overlay. (Path signatures). Do they need to
be the same keys???
3) I think we may want to mandate that the lower layer at least
authenticate the other peer (i.e. every UDP message could be in
cleartext, but would need to come with an EdDSA signature, alas 92 byte
overhead and a signature verification _required_). Otherwise, I don't
see how we can offer even the most minimal protections against peer
impersonation attacks. WDYT?
Security considerations? Prerequisites?
-->
<t>
It is expected that the underlay provides basic mechanisms to
manage peer connectivity and addressing.
The required functionalities can be represented by the following
API:
</t>
<dl>
<dt>
<tt>TRY_CONNECT(N, A)</tt>
</dt>
<dd>
A function which allows the local peer to attempt the establishment of
a connection to another peer <tt>N</tt> using an address <tt>A</tt>.
When the connection attempt is successful, information on the new
peer is offered through the <tt>PEER_CONNECTED</tt> signal.
</dd>
<!-- FIXME: Not used anywhere in draft -->
<dt>
<tt>HOLD(P)</tt>
</dt>
<dd>
A function which tells the underlay to keep a hold on to a connection
to a peer <tt>P</tt>. Underlays are usually limited in the number
of active connections. With this function the DHT can indicate to the
underlay which connections should preferably be preserved.
</dd>
<!-- FIXME: Not used anywhere in draft -->
<dt>
<tt>DROP(P)</tt>
</dt>
<dd>
A function which tells the underlay to drop the connection to a
peer <tt>P</tt>. This function is only there for symmetry and
used during the peer's shutdown to release all of the remaining
HOLDs. As R<sup>5</sup>N always prefers the longest-lived
connections, it would never drop an active connection that it
has called HOLD() on before. Nevertheless, underlay implementations
should not rely on this always being true. A call to DROP() also
does not imply that the underlay must close the connection: it merely
removes the preference to preserve the connection that was established
by HOLD().
</dd>
<dt>
<tt>SEND(P, M)</tt>
</dt>
<dd>
A function that allows the local peer to send a protocol message
<tt>M</tt> to a peer <tt>P</tt>.
</dd>
<dt>
<tt>L2NSE = ESTIMATE_NETWORK_SIZE()</tt>
</dt>
<dd>
A procedure that provides an estimate of the network size.
The result, <tt>L2NSE</tt>, must be the base-2 logarithm of the estimated number of peers in the network.
It is used by the routing algorithm.
If the underlay does not support a protocol for network size estimation (such as cite paper NSE) the value
<bcp14>MAY</bcp14> be provided as a configuration parameter to the implementation.
</dd>
</dl>
<t>
The above procedures are meant to be actively
executed by the implementation as part of the peer-to-peer protocol.
In addition, the underlay is expected to emit
the following signals (usually implemented as callbacks)
based on network events observed by the underlay implementation:
</t>
<dl>
<dt>
<tt>PEER_CONNECTED -> P</tt>
</dt>
<dd>
is a signal that allows the DHT to react to a newly connected peer
<tt>P</tt>.
Such an event triggers, for example, updates in the
routing table and gossiping of HELLOs to that peer.
</dd>
<dt>
<tt>PEER_DISCONNECTED -> P</tt>
</dt>
<dd>
is a signal that allows the DHT to react to a recently disconnected
peer.
Such an event triggers, for example, updates in the
routing table.
</dd>
<dt>
<tt>ADDRESS_ADDED -> A</tt>
</dt>
<dd>
The underlay signals indicates that an address <tt>A</tt> was added for our
local peer and that henceforth the peer may be reachable under this address.
This information is used to advertise
connectivity information about the local peer to other peers.
<tt>A</tt> must be a URI suitable for inclusion in a HELLO payload
<xref target="hello_block"/>.
</dd>
<dt>
<tt>ADDRESS_DELETED -> A</tt>
</dt>
<dd>
This underlay signals indicates that an address <tt>A</tt> was removed
from the set of addresses the local peer is possibly reachable
under. Addresses must have been added before they may be deleted.
This information is used to no longer advertise
this address to other peers.
</dd>
<dt>
<tt>RECEIVE -> (P, M)</tt>
</dt>
<dd>
This signal informs the local peer that a protocol
message <tt>M</tt> was received from a peer <tt>P</tt>.
</dd>
</dl>
<t>
These signals then drive updates of the routing table, local storage
and message transmission.
</t>
</section>
<section anchor="routing" numbered="true" toc="default">
<name>Routing</name>
<t>
In order to select peers which are suitable destinations for
routing messages, R<sup>5</sup>N uses a hybrid approach:
Given an estimated network size N, the peer selection for the
first N hops is random. After the initial N hops, peer selection
follows an XOR-based peer distance calculation.
</t>
<t>
To enable routing, any R<sup>5</sup>N implementation must keep
information about its current set of neighbors.
Upon receiving a connection notification from the Underlay through
<tt>PEER_CONNECTED</tt>, information on the new neighbor
<bcp14>MUST</bcp14> be added.
Similarly when a disconnect is indicated by the Underlay through
<tt>PEER_DISCONNECTED</tt> messages for all addresses of the peer it
<bcp14>MUST</bcp14> be removed.
The implementation <bcp14>MAY</bcp14> try to re-establish connection
to the peer by issuing <tt>TRY_CONNECT</tt> requests on the Underlay
using valid HELLO caches.
</t>
<t>
In order to achieve O(log n) routing performance,
the data structure for managing neighbors and their
metadata <bcp14>MUST</bcp14> be implemented using the k-buckets concept of
<xref target="Kademlia"/> as defined in <xref target="routing_table"/>.
Maintenance of the routing table (after bootstrapping) is
described in <xref target="find_peer"/>.
</t>
<t>
Unlike <xref target="Kademlia"/>, routing decisions in
R<sup>5</sup>N are also influenced by a Bloom filter in the message
that prevents routing loops. This data structure is discussed in
<xref target="routing_bloomfilter"/>. <xref target="routing_functions"/>
describes the key functions provided on top of these data structures.
</t>
<section anchor="routing_table">
<name>Routing Table</name>
<t>
The routing table consists of an array of k-buckets. Each
k-bucket contains a list of neighbors.
The i-th k-bucket stores neighbors whose peer IDs are between distance 2^i and 2^(i+1) from the local peer.
System constraints will typically force an implementation to impose some
upper limit on the number of neighbors kept per k-bucket.
</t>
<t>
Initially, the implementation depends upon either the Underlay providing at
least one initial connection to a peer (signalled through
<tt>PEER_CONNECTED</tt>), or the application/end-user providing at
least one working HELLO which is then in turn used to call <tt>TRY_CONNECT</tt>
on the Underlay.
While details on how the first connection is established <bcp14>MAY</bcp14>
depend on the specific implementation, this <bcp14>SHOULD</bcp14> usually be done
by an out-of-band exchange of the information from a HELLO block.
<!-- FIXME: Is this really an encoding of a block? It seems to me that "HELLO"
is not properly defined.
FIXME: Should? Isn't this part of the HelloMessage? -->
For this, section <xref target="hello_url"/> specifies a URL format for encoding HELLO
blocks as text strings which <bcp14>SHOULD</bcp14> be supported by implementations.
</t>
<t>
Implementations <bcp14>SHOULD</bcp14> try to keep at least
5 entries per k-bucket. Embedded systems that cannot manage
this number of connections <bcp14>MAY</bcp14> use connection-level
signalling to indicate that they are merely a client utilizing a
DHT and not able to participate in routing. DHT peers receiving
such connections <bcp14>MUST NOT</bcp14> include connections to
such restricted systems in their k-buckets, thereby effectively
excluding them when making routing decisions.
</t>
<t>
If a system hits constraints with respect to
the number of active connections, an implementation
<bcp14>MUST</bcp14> evict peers from those k-buckets with the
largest number of neighbors. The eviction strategy <bcp14>MUST</bcp14> be
to drop the shortest-lived connections first.
</t>
</section>
<section anchor="find_peer">
<name>Peer Discovery</name>
<t>
To build its routing table, a peer will send out requests
asking for blocks of type HELLO using its own location as the key,
but filtering all of its neighbors via the Bloom filter described
in <xref target="result_filter"/>.
These requests <bcp14>MUST</bcp14> use the FindApproximate and DemultiplexEverywhere
flags. FindApproximate will ensure that other peers will reply
with keys they merely consider close-enough, while DemultiplexEverywhere
will cause each peer on the path to respond, which is likely to yield
HELLOs of peers that are useful somewhere in the routing table.
</t>
<t>
<!-- This here sais that this is a mandatory functionality. in HelloMessage it
sais RECOMMENDED -->
To facilitate peer discovery, each peer <bcp14>MUST</bcp14> broadcast its own
HELLO message to all peers in the routing table periodically.
The specific frequency <bcp14>MAY</bcp14> depend on available bandwidth
but <bcp14>SHOULD</bcp14> be a fraction of the expiration period.
Whenever a peer receives such a HELLO message from another peer that is
already in the routing table, it must cache it as long as that peer is in its routing table
(or until the HELLO expires) and serve it in response to
<!-- FIXME wat is a Peer Discovery request??
maybe a HELLO GET request?
-->
Peer Discovery requests.
Details about the format of the
HELLO message are given in <xref target="p2p_hello_wire"/>.
</t>
<t>
The frequency of advertisement and peer discovery messages
<bcp14>MAY</bcp14> be adapted according to network conditions,
the set of already connected neighbors,
the workload of the system and other factors which are at the discretion of
the developer.
</t>
<t>
Furthermore, the Underlay <bcp14>SHOULD</bcp14> provide the implementation with one or more
addresses signalled through <tt>ADDRESS_ADDED</tt>. Zero addresses <bcp14>MAY</bcp14> be
provided if a peer can only establish outgoing connections and is otherwise unreachable.
<!-- FIXME: What about HelloMessages? What is the distinction between HELLO blocks
and HELLO messages? -->
The implementation periodically advertises all
active addresses in a HELLO block <xref target="hello_block"/>.
</t>
</section>
<section anchor="routing_bloomfilter">
<name>Peer Bloom Filter</name>
<t>
As DHT <tt>GetMessage</tt>s and <tt>PutMessage</tt>s traverse a random path through the network for the
first N hops, it is essential that routing loops are avoided.
In R<sup>5</sup>N, a Bloom filter is part of the routing metadata in
messages in order to prevent circular routes.
The Bloom filter is updates at each hop with the hops
peer identity.
For the next hop selection in both the random and the deterministic
case, any peer which is in the Bloom filter for the respective message
is not included in the peer selection process.
</t>
<t>
Any peer which is forwarding <tt>GetMessage</tt>s or <tt>PutMessage</tt>s
(<xref target="p2p_messages"/>) adds its own peer ID to the
peer Bloom filter.
This allows other peers to (probabilistically) exclude already
traversed peers when searching for the next hops in the routing table.
</t>
<t>
The peer Bloom filter follows the definition in <xref target="bloom_filters"/>
and consists of L=1024 buckets.
The set of elements <tt>E</tt> consists of of all possible 256-bit peer IDs.
The number of buckets per element <tt>e</tt> is <tt>k=16</tt>.
The mapping function <tt>M</tt> is defined as follows:
</t>
<t>
<tt>M(e) -> SHA-512 (e) as uint32[]</tt>
</t>
<t>
The element <tt>e</tt> is hashed using SHA-512.
The resulting byte string is interpreted as a string of k=16
32-bit integers in network byte order which are used to set and check the bucket bits
in <tt>B</tt> using <tt>BF-SET</tt> and <tt>BF-TEST</tt>.
</t>
<t>
We note that the peer Bloom filter may exclude peers due to false-postive
matches. This is acceptable as routing should nevertheless
terminate (with high probability) in close vicinity of the key.
</t>
</section>
<section anchor="routing_functions">
<name>Routing Functions</name>
<t>
Using the data structures described so far,
the R<sup>5</sup>N routing component provides
the following functions for
message processing (<xref target="p2p_messages"/>):
</t>
<dl>
<dt>
<tt>GetDistance(A, B) -> Distance as Integer</tt>
</dt>
<dd>
This function calculates the binary XOR between A and B.
The resulting distance is interpreted as an integer where
the leftmost bit is the most significant bit.
</dd>
<dt>
<tt>SelectClosestpeer(K, B) -> N</tt>
</dt>
<dd>
This function selects the neighbor <tt>N</tt> from our
routing table with the shortest XOR-distance to the key <tt>K</tt>.
This means that for all other peers <tt>N'</tt> in the routing table
<tt>GetDistance(N, K) < GetDistance(N',K)</tt>.
Peers with a positive test against the peer Bloom
filter <tt>B</tt> are not considered.
</dd>
<dt>
<tt>SelectRandompeer(B) -> N</tt>
</dt>
<dd>
This function selects a random peer <tt>N</tt> from
all neighbors.
Peers with a positive test in the peer Bloom
filter <tt>B</tt> are not considered.
</dd>
<dt>
<tt>Selectpeer(K, H, B) -> N</tt>
</dt>
<dd>
This function selects a neighbor <tt>N</tt> depending on the
number of hops <tt>H</tt> parameter.
If <tt>H < NETWORK_SIZE_ESTIMATE</tt>
this function <bcp14>MUST</bcp14> return <tt>SelectRandompeer(B)</tt> and
<tt>SelectClosestpeer(K, B)</tt> otherwise.
</dd>
<dt>
<tt>IsClosestPeer(N, K, B) -> true | false</tt>
</dt>
<dd>
This function checks if <tt>N</tt> is the closest peer for <tt>K</tt>
(cf. <tt>SelectClosestpeer(K)</tt>).
Peers with a positive test in the Bloom filter <tt>B</tt> are not considered.
</dd>
<dt>
<tt>ComputeOutDegree(REPL_LVL, HOPCOUNT, L2NSE) -> Number</tt>
</dt>
<dd>
<t>
This function computes the number of neighbors
that a message should be forwarded to. The arguments
are the desired replication level (<tt>REPL_LVL</tt>), the <tt>HOPCOUNT</tt> of the message so far and
and the current network size estimate (<tt>L2NSE</tt>) as provided
by the underlay.
The result is the non-negative number of next hops to
select. The following figure gives the
pseudocode for computing the number of neighbors
the peer should attempt to forward the message to.
</t>
<figure anchor="compute_out_degree" title="Computing the number of next hops.">
<artwork name="" type="" align="left" alt=""><![CDATA[
function ComputeOutDegree(REPL_LVL, HOPCOUNT, L2NSE)
BEGIN
if (HOPCOUNT > L2NSE * 4)
return 0;
if (HOPCOUNT > L2NSE * 2)
return 1;
if (0 = REPL_LEVL)
REPL_LEVL = 1
if (REPL_LEVEL > 16)
REPL_LEVEL = 16
RM1 = REPL_LEVEL - 1
return 1 + (RM1 / (L2NSE + RM1 * HOPCOUNT))
]]></artwork>
</figure>
<!-- FIXME: WTF? So much wrong here.
1. Why is this not part of code if necessary?
2. Why is this necessary?
3. Why probabillistic?
Implementations are bound to get this wrong. -->
<t>
The above calculation may yield values that are
not discrete. Hence, the result <bcp14>MUST</bcp14> be
rounded probabilistically to the nearest
discrete value, using the fraction
as the probability for rounding up.
</t>
</dd>
</dl>
</section>
<section anchor="pending_table">
<name>Pending Table</name>
<t>
R<sup>5</sup>N performs stateful routing where the messages
only carry the query hash and do not encode the ultimate
source or destination of the request. Routing a request
towards the key is doing hop-by-hop using the routing table and the
query hash. The pending table is used to route responses
back to the originator. In the pending table each peer
primarily associates a query hash with the associated
originator of the request. The pending table <bcp14>MUST</bcp14>
store entries for the last <tt>MAX_RECENT</tt> requests
the peer has encountered. To ensure that the peer does
not run out of memory, information about older requests
is discarded. The value of <tt>MAX_RECENT</tt> <bcp14>MAY</bcp14> be
configurable and <bcp14>SHOULD</bcp14> be at least 128k.
</t>
<t>
For each entry in the pending table, the DHT <bcp14>MUST</bcp14> track
not only the query key and the origin, but also the
extended query, requested block type and flags, and the
result filter. If the query did not provide
a result filter, a fresh result filter
<bcp14>MUST</bcp14> still be created to filter duplicate replies.
Details of how a result filter works depend on the
type, as described in <xref target="block_functions"/>.
</t>
<t>
When a second query from the same origin for the
same query hash is received, the DHT <bcp14>MUST</bcp14>
attempt to merge the new request with the state for
the old request. If this is not possible, the
existing result filter <bcp14>MUST</bcp14> be
discarded and replaced with the result
filter of the incoming message.
</t>
<t>
We note that for local applications, a fixed limit on
the number of concurrent requests may be problematic.
Hence, it is <bcp14>RECOMMENDED</bcp14> that implementations
track requests from local applications separately and
preserve the information until the application explicitly
stops the request.
</t>
</section>
</section>
<section anchor="p2p_messages" numbered="true" toc="default">
<name>Message Processing</name>
<t>
Further, the implementation <bcp14>MAY</bcp14> act as an initiator of
messages.
If instructed through an application-facing API such as the one outlined
in <xref target="overlay"/>, the peer may acts as an initiator of <tt>GetMessage</tt>s
or <tt>PutMessage</tt>s.
The status of initiator is relevant for peers when processing <tt>ResultMessages</tt>
and the potential handover of results to the application.
</t>
<t>
The implementation <bcp14>MUST</bcp14> listen for <tt>RECEIVE(P, M)</tt> signals
from the Underlay and respond to the respective messages sent by
the peer <tt>P</tt>.
</t>
<t>
Wheather initiated locally or received from a neighbour, the implementation
processes the messages according to the wire formats and the required
validations detailed in the following.
Where required, the local peer's ID is referred to as <tt>SELF</tt>.
</t>
<section anchor="message_components">
<name>Message components</name>
<t>
This section describes some data structures and fields shared
by various message types.
</t>
<section anchor="route_flags">
<name>Flags</name>
<t>
Flags is a 16-bit vector representing binary options.
Each flag is represented by a bit in the field starting from 0 as
the rightmost bit to 15 as the leftmost bit.
<!--FIXME: Actually, we set those bits and then store the resulting
value in NBO...-->
</t>
<dl>
<dt>0: DemultiplexEverywhere</dt>
<dd>
This bit indicates that each peer along the way should process the request.
If the bit is not set, intermediate peers only route the message and only
peers which consider themselves closest to the key look for answers
in their local storage for <tt>GetMessage</tt>s and cache the
block in their local storage for <tt>PutMessage</tt>s and <tt>ResultMessage</tt>s.
</dd>
<dt>1: RecordRoute</dt>
<dd>
This bit indicates to keep track of the path that the message takes
in the P2P network.
</dd>
<dt>2: FindApproximate</dt>
<dd>
This bit allows results where the key does not match exactly.
</dd>
<dt>3: Truncated</dt>
<dd>
This is a special flag which is set if a peer truncated the path
and thus the first hop on the path is given without a signature
to enable checking of the next signature. MUST never be set in
a query.
</dd>
<dt>4-15: Reserved</dt>
<dd>
The remaining bits are reserved for future use and
<bcp14>MUST</bcp14> be set to 0 when initiating an operation.
If non-zero bits are received, implementations <bcp14>MUST</bcp14>
preserve these bits when forwarding messages.
</dd>
</dl>
</section>
<section anchor="p2p_pathelement">
<!-- TODO-GROTHOFF: Discuss this change again. The text is currently not correct
it is very difficult to understand. Is it worth 32 byte -->
<name>Path Element</name>
<t>
A Path Element represents a hop in the path a message has taken
through the network.
The wire format of a Path Element is illustrated in
<xref target="figure_pathelement"/>.
</t>
<figure anchor="figure_pathelement" title="The Wire Format of a Path Element.">
<artwork name="" type="" align="left" alt=""><![CDATA[
0 8 16 24 32 40 48 56
+-----+-----+-----+-----+-----+-----+-----+-----+
| SIGNATURE |
| (64 byte) |
| |
| |
| |
| |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
| PEER ID |
| (32 byte) |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
]]></artwork>
</figure>
<t>where:</t>
<dl>
<dt>SIGNATURE</dt>
<dd>
is a 64 byte EdDSA signature using the current hop's private
key affirming the previous and next hops.
</dd>
<dt>PEER ID</dt>
<dd>
is the EdDSA public key of the peer on the path.
</dd>
</dl>
<t>
An ordered list of Path Elements may be appended to any routed
<tt>PutMessage</tt>s or <tt>ResultMessage</tt>s.
The signature of a Path Element is created by the current hop
after it made its routing decision identifiying the successor peer.
</t>
<t>
<xref target="figure_path_ex"/> shows the wire format of an example
path from Peers A over B and C as it would be received by D in the
<tt>PUTPATH</tt> of a <tt>PutMessage</tt> or the combined
<tt>PUTPATH</tt> and <tt>GETPATH</tt> of a <tt>ResultMessage</tt>.
The wire format of the Path Elements allows a natural
extension of the <tt>PUTPATH</tt> along the route of the <tt>ResultMessage</tt>
to the destination forming the <tt>GETPATH</tt>.
The <tt>PutMessage</tt> would indicate in the <tt>PATH_LEN</tt> field
a length of 3.
The <tt>ResultMessage</tt> would indicate a path length of 3 as the
sum of the field values in <tt>PUTPATH_L</tt> and <tt>GETPATH_L</tt>.
</t>
<figure anchor="figure_path_ex" title="Example of a path as found in PutMessages or ResultMessages from A to D.">
<artwork name="" type="" align="left" alt=""><![CDATA[
0 8 16 24 32 40 48 56
+-----+-----+-----+-----+-----+-----+-----+-----+
| SIGNATURE A |
| (64 byte) |
| |
| |
| |
| |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
| PEER A |
| (32 byte) |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
| SIGNATURE B |
| (64 byte) |
| |
| |
| |
| |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
| PEER B |
| (32 byte) |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
| SIGNATURE C |
| (64 byte) |
| |
| |
| |
| |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
| PEER C |
| (32 byte) |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
| SIGNATURE D |
| (64 byte) |
| |
| |
| |
| |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
]]></artwork>
</figure>
<t>
A path may be truncated in which case the signature of the truncated
Path Element is omitted leaving only the Peer ID required for the
verification of the subsequent Path Element signature.
Such a truncated path is indicated with the respective flag (<xref target="route_flags"/>).
The Peer ID of the last Path Element is omitted as it must be that of
the sender of the PutMesssage or ResultMessage.
The wire format of a truncated example path from Peers B over C to D
is illustrated in <xref target="figure_path_ex_trunc"/>.
The wire format of an example path from Peers B over C as it
would be received by D in a <tt>PutMessage</tt> or <tt>ResultMessage</tt>
is illustrated in <xref target="figure_path_ex_trunc"/>.
A <tt>ResultMessage</tt> would indicate in the <tt>PATH_LEN</tt> field
a length of 1.
A <tt>PutMessage</tt> would indicate a length of 1 as the sum of
<tt>PUTPATH_L</tt> and <tt>GETPATH_L</tt> fields.
</t>
<figure anchor="figure_path_ex_trunc" title="Example of a truncated path from Peer B to Peer D.">
<artwork name="" type="" align="left" alt=""><![CDATA[
0 8 16 24 32 40 48 56
+-----+-----+-----+-----+-----+-----+-----+-----+
| PEER B |
| (32 byte) |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
| SIGNATURE C |
| (64 byte) |
| |
| |
| |
| |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
| PEER C |
| (32 byte) |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
| SIGNATURE D |
| (64 byte) |
| |
| |
| |
| |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
]]></artwork>
</figure>
<t>
The SIGNATURE field in a Path Element covers a 64-bit contextualization header, the
the block expiration, a hash of the block
payload, as well as the predecessor peer ID and the peer ID of the
successor that the peer making the signature is routing the
message to. Thus, the signature made by SELF basically says that
SELF received the block payload from PEER PREDECESSOR and has forwarded
it to PEER SUCCESSOR. The wire format is illustrated
in <xref target="figure_pathelewithpseudo"/>.
</t>
<figure anchor="figure_pathelewithpseudo" title="The Wire Format of the Path Element for Signing.">
<artwork name="" type="" align="left" alt=""><![CDATA[
0 8 16 24 32 40 48 56
+-----+-----+-----+-----+-----+-----+-----+-----+
| SIZE | PURPOSE |
+-----+-----+-----+-----+-----+-----+-----+-----+
| EXPIRATION |
+-----+-----+-----+-----+-----+-----+-----+-----+
| BLOCK HASH |
| (64 byte) |
| |
| |
| |
| |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
| PEER PREDECESSOR |
| (32 byte) |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
| PEER SUCCESSOR |
| (32 byte) |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
]]></artwork>
</figure>
<dl>
<dt>SIZE</dt>
<dd>
A 32-bit value containing the length of the signed data in bytes
in network byte order.
The length of the signed data <bcp14>MUST</bcp14> be 144 bytes.
</dd>
<dt>PURPOSE</dt>
<dd>
A 32-bit signature purpose flag. This field <bcp14>MUST</bcp14> be 6 (in network
byte order).
</dd>
<dt>EXPIRATION</dt>
<dd>
denotes the absolute 64-bit expiration date of the block.
In microseconds since midnight (0 hour), January 1, 1970 UTC in
network byte order.
</dd>
<dt>BLOCK HASH</dt>
<dd>
a SHA-512 hash over the block payload.
</dd>
<dt>PEER PREDECESSOR</dt>
<dd>
the Peer ID of the previous hop. If the signing peer initiated
the PUT, this field is set to all zeroes.
</dd>
<dt>PEER SUCCESSOR</dt>
<dd>
the Peer ID of the next hop (not of the signer).
</dd>
</dl>
</section>
</section>
<section anchor="p2p_hello" numbered="true" toc="default">
<name>HelloMessage</name>
<t>
<!-- FIXME: While it is discussed here how to hangle HelloMessages
nowhere in the draft is a HelloMessage created at the
initiator so it is completely irrelevant atm -->
When the Underlay notifies the implementation of added or removed
addresses through <tt>ADDRESS_ADDED</tt> and <tt>ADDRESS_DELETED</tt>
it <bcp14>MAY</bcp14> disseminate those changes to neighbors using
<tt>HelloMessage</tt>s.
<!-- FIXME yesyes this is blabla and obvious when reading the processing section.
In contrast to a HELLO block, a
<tt>HelloMessage</tt> does not contain the ID of
the peer the address information is about: in a
<tt>HelloMessage</tt>, the address information is always
about the sender.
Since the Underlay is responsible to determine the
peer ID of the sender for all messages, it would thus be
redundant to also include the peer ID in the
<tt>HelloMessage</tt>.-->
<!-- FIXME: Are hello messages should or must?
Only recommended not, if so, what if that is not done?
-->
Initiation of <tt>HelloMessages</tt> by the implementation itself is <bcp14>RECOMMENDED</bcp14>.
<tt>HelloMessage</tt>s are used to inform neighbors of
a peer about the sender's available addresses. The
recipients use these messages to inform their respective
Underlays about ways to sustain the connections and to
generate HELLO blocks (see <xref target="hello_block"/>)
to answer peer discovery queries
from other peers.
</t>
<section anchor="p2p_hello_wire">
<name>Wire Format</name>
<figure anchor="figure_hellomsg" title="The HelloMessage Wire Format.">
<artwork name="" type="" align="left" alt=""><![CDATA[
0 8 16 24 32 40 48 56
+-----+-----+-----+-----+-----+-----+-----+-----+
| MSIZE | MTYPE | RESERVED | URL_CTR |
+-----+-----+-----+-----+-----+-----+-----+-----+
| SIGNATURE /
/ (64 byte) |
+-----+-----+-----+-----+-----+-----+-----+-----+
| EXPIRATION |
+-----+-----+-----+-----+-----+-----+-----+-----+
/ ADDRESSES (variable length) /
+-----+-----+-----+-----+-----+-----+-----+-----+
]]></artwork>
</figure>
<t>where:</t>
<dl>
<dt>MSIZE</dt>
<dd>
denotes the size of this message in network byte order.
</dd>
<dt>MTYPE</dt>
<dd>
is the 16-bit message type. It must be set to
the value 157 in network byte order.
</dd>
<dt>RESERVED</dt>
<dd>
is a 16-bit field that must be zero.
</dd>
<dt>URL_CTR</dt>
<dd>
is a 16-bit number that gives the total number of
addresses encoded in the ADDRESSES field.
In network byte order.
</dd>
<dt>SIGNATURE</dt>
<dd>
is a 64 byte EdDSA signature using the sender's private
key affirming the information contained in the message.
The signature is signing exactly the same data that is being
signed in a HELLO block as described in <xref target="hello_block"/>.
</dd>
<dt>EXPIRATION</dt>
<dd>
denotes the absolute 64-bit expiration date of the content.
The value specified is microseconds since midnight (0 hour),
January 1, 1970, but must be a multiple of one million
(so that it can be represented in seconds in a HELLO URL).
Stored in network byte order.
</dd>
<dt>ADDRESSES</dt>
<dd>
A sequence of exactly URL_CTR
addresses (<xref target="terminology"/>)
which can be used to contact the peer.
Each address <bcp14>MUST</bcp14> be 0-terminated.
The set of addresses <bcp14>MAY</bcp14> be empty.
</dd>
</dl>
</section>
<section anchor="p2p_hello_processing">
<name>Processing</name>
<t>
Upon receiving a <tt>HelloMessage</tt> from a peer <tt>P</tt>
an implementation <bcp14>MUST</bcp14> process it step by step as follows:
</t>
<ol>
<li>
If <tt>P</tt> is not in its routing table, the message
is discarded.
</li>
<li>
The signature is verified, including a check that the expiration time
is in the future. If the signature is invalid, the message is discarded.
</li>
<li>
The HELLO information is cached in the routing table until it expires,
the peer is removed from the routing table, or the information is replaced by another message from the peer.
</li>
<li>
<!-- We need a statement like this here. Should they? Can they? What if they are (not)? -->
<tt>HelloMessages</tt> <bcp14>SHOULD NOT</bcp14> be forwarded.
</li>
</ol>
<t>
<!-- FIXME: How? What function of the underlay? The underlay also already knows so...
what is the point? -->
The address information about P should then also be made
available to each respective Underlays to enable the
Underlay to maintain optimal connectivity to the
neighbor.
</t>
</section>
</section>
<section anchor="p2p_put" numbered="true" toc="default">
<name>PutMessage</name>
<t>
<tt>PutMessage</tt>s are used to store information at other peers in the DHT.
Any API which allows applications to initiate <tt>PutMessage</tt>s needs to
provide sufficient, implementation-specific information needed to construct
the initial <tt>PutMessage</tt>.
For example, implementations supporting multiple applications and blocks will
have block type and message flag parameters in addition to the actual data
payload and key.
</t>
<section anchor="p2p_put_wire">
<name>Wire Format</name>
<figure anchor="figure_putmsg" title="The PutMessage Wire Format.">
<artwork name="" type="" align="left" alt=""><![CDATA[
0 8 16 24 32 40 48 56
+-----+-----+-----+-----+-----+-----+-----+-----+
| MSIZE | MTYPE | BTYPE |
+-----+-----+-----+-----+-----+-----+-----+-----+
| FLAGS | HOPCOUNT | REPL_LVL | PATH_LEN |
+-----+-----+-----+-----+-----+-----+-----+-----+
| EXPIRATION |
+-----+-----+-----+-----+-----+-----+-----+-----+
| PEER_BF /
/ (128 byte) |
+-----+-----+-----+-----+-----+-----+-----+-----+
| BLOCK_KEY /
/ (64 byte) |
+-----+-----+-----+-----+-----+-----+-----+-----+
/ TRUNCATED ORIGIN (0 or 32 bytes) /
+-----+-----+-----+-----+-----+-----+-----+-----+
/ PUTPATH (variable length) /
+-----+-----+-----+-----+-----+-----+-----+-----+
/ LAST HOP SIGNATURE (0 or 64 bytes) /
+-----+-----+-----+-----+-----+-----+-----+-----+
/ BLOCK (variable length) /
+-----+-----+-----+-----+-----+-----+-----+-----+
]]></artwork>
</figure>
<t>where:</t>
<dl>
<dt>MSIZE</dt>
<dd>
denotes the size of this message in network byte order.
Set by the initiator.
Modified by processing peers when message contents such as the path lengths change.
</dd>
<dt>MTYPE</dt>
<dd>
is the 16-bit message type. It is set by the initiator to
the value 146 in network byte order. Read-only.
</dd>
<dt>BTYPE</dt>
<dd>
is a 32-bit block type.
The block type indicates the content
type of the payload.
Set by the initiator. Read-only.
In network byte order.
</dd>
<dt>FLAGS</dt>
<dd>
is a 16-bit vector with binary options (see <xref target="route_flags"/>).
Set by the initiator. Read-only.
</dd>
<dt>HOPCOUNT</dt>
<dd>
is a 16-bit number indicating how many hops this message has
traversed to far.
Set by the initiator to 0.
Incremented by processing peers.
In network byte order.
</dd>
<dt>REPL_LVL</dt>
<dd>
is a 16-bit number indicating the desired replication level of
the data.
Set by the initiator. Read-only.
In network byte order.
</dd>
<dt>PATH_LEN</dt>
<dd>
is a 16-bit number indicating the number of Path Elements
recorded in PUTPATH.
As PUTPATH is optional, this value may be zero.
If the PUTPATH is enabled, set initially to 0 by the initiator.
Incremented by processing peers.
In network byte order.
</dd>
<dt>EXPIRATION</dt>
<dd>
denotes the absolute 64-bit expiration date of the content.
Set by the initiator. Read-only.
In microseconds since midnight (0 hour), January 1, 1970 in network
byte order.
</dd>
<dt>PEER_BF</dt>
<dd>
A peer Bloom filter to stop circular routes (see <xref target="routing_bloomfilter"/>).
Set by the initiator to include itself. <!-- FIXME: Is that so? -->
Modified by processing peers to include their own peer ID.
</dd>
<dt>BLOCK_KEY</dt>
<dd>
The key under which the <tt>PutMessage</tt> wants to store content
under.
Set by the initiator. Read-only.
</dd>
<dt>TRUNCATED ORIGIN</dt>
<dd>
is only provided if the TRUNCATED flag
is set in FLAGS. If present, this is
the public key of the peer just before
the first entry on the PUTPATH and the
first peer on the PUTPATH is not the
actual origin of the message. Thus, to
verify the first signature on the PUTPATH,
this public key must be used. Note that
due to the truncation, this last hop
cannot be verified to exist.
Value is modified by processing peers.
</dd>
<dt>PUTPATH</dt>
<dd>
the variable-length PUT path.
The path consists of a list of PATH_LEN Path Elements.
Set by the initiator to 0.
Incremented by processing peers.
</dd>
<dt>LAST HOP SIGNATURE</dt>
<dd>
is only provided if the RECORD ROUTE flag
is set in FLAGS. If present, this is
an EdDSA signature of the sender of this message
(using the same format as the signatures in PUTPATH)
affirming that the sender forwarded the message from
the predecessor (all zeros if PATH_LEN is 0,
otherwise the last peer in PUTPATH) to
the target peer.
Modified by processing peers (if flag is set).
</dd>
<dt>BLOCK</dt>
<dd>
the variable-length block payload. The contents are determined
by the BTYPE field. The length is determined by MSIZE minus
the size of all of the other fields.
Set by the initiator. Read-only.
</dd>
</dl>
</section>
<section anchor="p2p_put_processing">
<name>Processing</name>
<t>
Upon receiving a <tt>PutMessage</tt> from a peer <tt>P</tt>
, or created through initiation by an overlay API,
an implementation <bcp14>MUST</bcp14> process it step by step as follows:
</t>
<ol>
<li>
The <tt>EXPIRATION</tt> field is evaluated.
If the message is expired, it <bcp14>MUST</bcp14> be discarded.
</li>
<li>
If the <tt>BTYPE</tt> is not supported by the implementation,
no validation of the block payload is performed and processing
continues at (5).
If the <tt>BTYPE</tt> is <tt>ANY</tt>, then the message <bcp14>MUST</bcp14> be discarded.
Else, the block <bcp14>MUST</bcp14> be validated as defined in (3) and (4).
</li>
<li>
The message is evaluated using the block validation functions matching
the <tt>BTYPE</tt>. First, the client attempts to
derive the key using the respective <tt>DeriveBlockKey</tt> procedure
as described in <xref target="block_functions"/>. If a key can be
derived and does not match, the message <bcp14>MUST</bcp14> be discarded.
</li>
<li>
Next, the <tt>ValidateBlockStoreRequest</tt> procedure for the <tt>BTYPE</tt>
as described in <xref target="block_functions"/> is used to
validate the block payload. If the block payload
is invalid, the message <bcp14>MUST</bcp14> be discarded.
</li>
<li>
The peer address of the sender peer <tt>P</tt> <bcp14>SHOULD</bcp14> be in <tt>PEER_BF</tt>.
If not, the implementation <bcp14>MAY</bcp14> log an error, but <bcp14>MUST</bcp14> continue.
</li>
<!--<li>
FIXME: I do not know what is going on here. "local peer address"
is certainly wrong.
But then, looking at the PathElement description, so wold be local peer ID.
Also, no signatures are ever created in this processing
If the <tt>RecordRoute</tt> flag is set in FLAGS,
the local peer address <bcp14>MUST</bcp14> be appended to the <tt>PUTPATH</tt>
of the message.
</li>-->
<li>
If the flag is not set, the <tt>PATH_LEN</tt>
<bcp14>MUST</bcp14> be set to zero.
If the <tt>PATH_LEN</tt> is non-zero,
the local peer <bcp14>SHOULD</bcp14> verify the signatures from the <tt>PUTPATH</tt>.
Verification <bcp14>MAY</bcp14> involve checking all signatures or any random
subset of the signatures. It is <bcp14>RECOMMENDED</bcp14> that peers adapt
their behavior to available computational resources so as to not make signature
verification a bottleneck. If an invalid signature is found, the
<tt>PUTPATH</tt> <bcp14>MUST</bcp14> be truncated to only include the elements
following the invalid signature.
</li>
<li>
If the local peer is the closest peer
(cf. <tt>IsClosestPeer(SELF, BLOCK_KEY, PeerFilter)</tt>) or the <tt>DemultiplexEverywhere</tt>
flag ist set, the message <bcp14>MUST</bcp14>
be stored locally in the block storage.
</li>
<li>
If the <tt>BTYPE</tt> of the message indicates a HELLO block, the
peer <bcp14>MUST</bcp14> be considered for the local routing
table if the respective k-bucket is not yet full. In this case,
the local peer <bcp14>MUST</bcp14> try to establish a connection
to the peer indicated in the HELLO block using the address information
from the HELLO block. If a connection is established,
the peer is added to the respective k-bucket of the routing table.
Note that the k-bucket <bcp14>MUST</bcp14> be determined by the
key computed using <tt>DeriveBlockKey</tt> and not by the <tt>QUERY_HASH</tt>.
</li>
<li>
Given the value in <tt>REPL_LVL</tt>, <tt>HOPCOUNT</tt> and the
result of <tt>IsClosestPeer(SELF, BLOCK_KEY, PeerFilter)</tt> the number of peers to
forward to <bcp14>MUST</bcp14> be calculated
using <tt>ComputeOutDegree()</tt>.
The implementation <bcp14>SHOULD</bcp14> select up to this
number of peers to forward the message to. The implementation <bcp14>MAY</bcp14>
forward to fewer or no peers in order to handle resource constraints
such as limited bandwidth.
<!-- FIXME: From above. it seems to me that here we need to add a new putpath
signature element (as we know the successor now)-->
For each selected peer with peer address <tt>P</tt> a dedicated <tt>PutMessage'</tt>
is created containing the original (and where applicable already updated) fields
of the received <tt>PutMessage</tt>.
In each message the local peer address <bcp14>MUST</bcp14> be added to the
<tt>PEER_BF</tt> and the <tt>HOPCOUNT</tt> is incremented by 1.
If the <tt>RecordRoute</tt> flag is set, a new Path Element is created and added
to the <tt>PUTPATH</tt> fields and the <tt>PATH_LEN</tt> field is incremented by 1.
Finally, <tt>SEND(P, PutMessage')</tt> is called.
</li>
</ol>
</section>
</section>
<section anchor="p2p_get" numbered="true" toc="default">
<name>GetMessage</name>
<t>
<tt>GetMessage</tt>s are used to request information from other peers in the DHT.
Any overlay API which allows applications to initiate <tt>GetMessage</tt>s needs to provide
sufficient, implementation-specific information needed to construct the initial <tt>GetMessage</tt>.
For example, implementations supporting multiple applications and blocks will have block type and
message flag parameters.
</t>
<section anchor="p2p_get_wire">
<name>Wire Format</name>
<figure anchor="figure_getmsg" title="The GetMessage Wire Format.">
<artwork name="" type="" align="left" alt=""><![CDATA[
0 8 16 24 32 40 48 56
+-----+-----+-----+-----+-----+-----+-----+-----+
| MSIZE | MTYPE | BTYPE |
+-----+-----+-----+-----+-----+-----+-----+-----+
| FLAGS | HOPCOUNT | REPL_LVL | RF_SIZE |
+-----+-----+-----+-----+-----+-----+-----+-----+
| PEER_BF /
/ (128 byte) |
+-----+-----+-----+-----+-----+-----+-----+-----+
| QUERY_HASH /
/ (64 byte) |
+-----+-----+-----+-----+-----+-----+-----+-----+
| RESULT_FILTER /
/ (variable length) /
+-----+-----+-----+-----+-----+-----+-----+-----+
/ XQUERY (variable length) /
+-----+-----+-----+-----+-----+-----+-----+-----+
]]></artwork>
</figure>
<t>where:</t>
<dl>
<dt>MSIZE</dt>
<dd>
denotes the size of this message in network byte order.
Set by the initiator.
<!-- FIXME: Is this not fixed-length once set by initiator?-->
Modified by processing peers when message contents change.
</dd>
<dt>MTYPE</dt>
<dd>
is the 16-bit message type. It is set by the initiator to
the value 147 in network byte order. Read-only.
</dd>
<dt>BTYPE</dt>
<dd>
is a 32-bit block type field. The block type indicates the content
type of the payload. Set by the initiator. Read-only. In network byte order.
</dd>
<dt>FLAGS</dt>
<dd>
is a 16-bit vector with binary options (see <xref target="route_flags"/>).
Set by the initiator. Read-only.
</dd>
<dt>HOPCOUNT</dt>
<dd>
is a 16-bit number indicating how many hops this message has
traversed to far.
Set by the initiator to 0.
Incremented by processing peers.
In network byte order.
</dd>
<dt>REPL_LVL</dt>
<dd>
is a 16-bit number indicating the desired replication level of
the data.
Set by the initiator. Read-only.
In network byte order.
</dd>
<dt>RF_SIZE</dt>
<dd>
is a 16-bit number indicating the length of the
result filter RESULT_FILTER.
Set by the initiator. Read-only. <!--FIXME is that so? -->
In network byte order.
</dd>
<dt>PEER_BF</dt>
<dd>
A peer Bloom filter to stop circular routes (see <xref target="routing_bloomfilter"/>).
Set by the initiator to include itself. <!-- Is that so? -->
Modified by processing peers to include their own peer address.
</dd>
<dt>QUERY_HASH</dt>
<dd>
The query used to indicate what the key is under which the initiator is looking
for blocks with this request.
The block type may use a different evaluation logic to determine
applicable result blocks.
Set by the initiator. Read-only.
</dd>
<dt>RESULT_FILTER</dt>
<dd>
the variable-length result filter, described in <xref target="result_filter"/>.
Set by the initiator.
Modified by processing peers.
</dd>
<dt>XQUERY</dt>
<dd>
the variable-length extended query. Optional.
Set by the initiator. <!-- FIXME: Modified by processing peers? -->
</dd>
</dl>
</section>
<section anchor="result_filter">
<name>Result Filter</name>
<t>
The result filter is used to indicate to other peers which results
are not of interest when processing a <tt>GetMessage</tt>
(<xref target="p2p_get"/>).
Any peer which is processing <tt>GetMessage</tt>s and has a result
which matches the query key <bcp14>MUST</bcp14> check the result filter
and only send a reply message if the result does not test positive
under the result filter. Before forwarding the <tt>GetMessage</tt>, the
result filter <bcp14>MUST</bcp14> be updated using the result of the <tt>BTYPE</tt>-specific
<tt>FilterResult</tt> (see <xref target="block_functions"/>) function to filter
out all results already returned by the local peer.
</t>
<t>
How a result filter is implemented depends on the block type
as described in <xref target="block_functions"/>.
Result filters may be probabilistic data structures. Thus,
it is possible that a desireable result is filtered by a result
filter because of a false-positive test.
</t>
<t>
<!-- FIXME: then this should be part of the registration policy -->
How exactly a block result is added to a result filter
<bcp14>MUST</bcp14> be specified as part of the definition of a block type.
</t>
</section>
<section anchor="p2p_get_processing">
<name>Processing</name>
<t>
Upon receiving a <tt>GetMessage</tt> from a peer <tt>P</tt>, or
created through initiation by the overlay API, an
implementation <bcp14>MUST</bcp14> process it step by step as follows:
</t>
<ol>
<li>
The <tt>QUERY_HASH</tt> and <tt>XQUERY</tt> fields are validated
against the requested <tt>BTYPE</tt> as defined by its respective
<tt>ValidateBlockQuery</tt> procedure.
If validation
function yields <tt>REQUEST_INVALID</tt>, the message <bcp14>MUST</bcp14> be discarded.
If the <tt>BTYPE</tt> is not supported, the message <bcp14>MUST</bcp14>
be forwarded (Skip to step 4).
If the <tt>BTYPE</tt> is <tt>ANY</tt>, the message is processed
without validation.
</li>
<li>
The peer address of the sender peer <tt>P</tt> <bcp14>SHOULD</bcp14> be in the
<tt>PEER_BF</tt> Bloom filter. If not, the
implementation <bcp14>MAY</bcp14> log an error, but <bcp14>MUST</bcp14> continue.
</li>
<li>
<t>
If the local peer is the closest peer
(cf. <tt>IsClosestPeer (SELF, QueryHash, PeerFilter)</tt>) or the
<tt>DemultiplexEverywhere</tt> flag is set, a reply
<bcp14>MUST</bcp14> be produced (if one is available) using the following
steps:
</t>
<ol type="%c)">
<!-- FIXME: It is not clear that this is a fallthrough statement -->
<!-- FIXME: Are HELLO blocks according to the spec stored in block storage but never looked for? -->
<li>
If <tt>BTYPE</tt> indicates a request for a HELLO block or
<tt>ANY</tt>,
the peer <bcp14>MUST</bcp14> consult its own HELLO as well as
the HELLOs it has cached for the
peers in its routing table instead of the local block
storage (while continuing to respect flags like
<tt>DemultiplexEverywhere</tt>
and <tt>FindApproximate</tt>).
</li>
<li>
If <tt>FLAGS</tt> indicate a <tt>FindApproximate</tt> request,
the peer <bcp14>SHOULD</bcp14> try to respond with the closest block it
has that is not filtered by the
<tt>RESULT_BF</tt>.
</li>
<li>
Else, the peer <bcp14>MUST</bcp14> respond if it has a valid block
that matches the key exactly and that is
not filtered by the <tt>RESULT_BF</tt>.
</li>
</ol>
<t>
Any such resulting block <bcp14>MUST</bcp14> be encapsulated in a
<tt>ResultMessage</tt> and <bcp14>SHOULD</bcp14> be transmitted to the
neighbor from which the request was received. Implementations
<bcp14>MAY</bcp14> drop messages if they are resource-constrained.
However, <tt>ResultMessage</tt>s <bcp14>SHOULD</bcp14> be given the
highest priority among competing transmissions.
</t>
<t>
If the <tt>BTYPE</tt> is supported and <tt>ValidateBlockReply</tt> for the given
query has yielded a status of <tt>FILTER_LAST</tt>, processing
<bcp14>MUST</bcp14> end and not continue with forwarding of
the request to other peers.
</t>
</li>
<li>
Given the value in <tt>REPL_LVL</tt>, the number of peers to forward to
<bcp14>MUST</bcp14> be calculated using
<tt>ComputeOutDegree()</tt>.
If there is at least one
peer to forward to, the implementation <bcp14>SHOULD</bcp14> select up to this
number of peers to forward the message to. The implementation <bcp14>MAY</bcp14>
forward to fewer or no peers in order to handle resource constraints
such as bandwidth.
The peer Bloom filter <tt>PEER_BF</tt> <bcp14>MUST</bcp14> be updated with the local
peer address <tt>SELF</tt>.
For all peers with peer address <tt>P'</tt> chosen to forward the message
to, <tt>SEND(P', GetMessage')</tt> is called. Here, <tt>GetMessage'</tt>
is the original message with the updated fields for <tt>HOPCOUNT</tt> (incremented
by 1), <tt>PEER_BF</tt> and <tt>RESULT_FILTER</tt>.
</li>
</ol>
</section>
</section>
<section anchor="p2p_result" numbered="true" toc="default">
<name>ResultMessage</name>
<t>
<tt>ResultMessage</tt>s are used to return information to other peers in the DHT
or to applications using the overlay API that previously initiated a <tt>GetMessage</tt>.
The initiator of a <tt>ResultMessage</tt> is a peer triggered through the processing
of a <tt>GetMessage</tt>.
</t>
<section anchor="p2p_result_wire">
<name>Wire Format</name>
<figure anchor="figure_resmsg" title="The ResultMessage Wire Format">
<artwork name="" type="" align="left" alt=""><![CDATA[
0 8 16 24 32 40 48 56
+-----+-----+-----+-----+-----+-----+-----+-----+
| MSIZE | MTYPE | BTYPE |
+-----+-----+-----+-----+-----+-----+-----+-----+
| RESERVED | FLAGS | PUTPATH_L | GETPATH_L |
+-----+-----+-----+-----+-----+-----+-----+-----+
| EXPIRATION |
+-----+-----+-----+-----+-----+-----+-----+-----+
| QUERY_HASH /
/ (64 byte) |
+-----+-----+-----+-----+-----+-----+-----+-----+
/ TRUNCATED ORIGIN (0 or 32 bytes) /
+-----+-----+-----+-----+-----+-----+-----+-----+
/ PUTPATH /
/ (variable length) /
+-----+-----+-----+-----+-----+-----+-----+-----+
/ GETPATH /
/ (variable length) /
+-----+-----+-----+-----+-----+-----+-----+-----+
/ LAST HOP SIGNATURE (0 or 64 bytes) /
+-----+-----+-----+-----+-----+-----+-----+-----+
/ BLOCK /
/ (variable length) /
+-----+-----+-----+-----+-----+-----+-----+-----+
]]></artwork>
</figure>
<t>where:</t>
<dl>
<dt>MSIZE</dt>
<dd>
denotes the size of this message in network byte order.
Set by the initiator.
Updated by processing peers.
</dd>
<dt>MTYPE</dt>
<dd>
is the 16-bit message type. It must be set to
the value 148 in network byte order.
Set by the initiator. Read-only.
</dd>
<dt>BTYPE</dt>
<dd>
is a 32-bit block type field. The block type indicates the content
type of the payload.
Set by the initiator. Read-only.
In network byte order.
</dd>
<dt>RESERVED</dt>
<dd>
is a 16-bit value. Implementations <bcp14>MUST</bcp14>
set this value to zero when originating a result message.
Implementations <bcp14>MUST</bcp14> forward
this value unchanged even if it is non-zero.
</dd>
<dt>FLAGS</dt>
<dd>
is a 16-bit vector with binary options (see <xref target="route_flags"/>).
Set by the initiator. <!-- FIXME to what? -->
</dd>
<dt>PUTPATH_L</dt>
<dd>
is a 16-bit number indicating the number of Path Elements recorded
in <tt>PUTPATH</tt>. As <tt>PUTPATH</tt> is optional, this value may be zero
even if the message has traversed several peers.
Set by the initiator to the <tt>PATH_LEN</tt> of the <tt>PutMessage</tt>
from which the block originated.
Modified by processing peers in case of path truncation.
In network byte order.
</dd>
<dt>GETPATH_L</dt>
<dd>
is a 16-bit number indicating the number of Path Elements
recorded in <tt>GETPATH</tt>. As <tt>GETPATH</tt> is optional, this value may be zero
even if the message has traversed several peers.
Set by the initiator to 0.
Modified by processing peers.
In network byte order.
</dd>
<dt>EXPIRATION</dt>
<dd>
denotes the absolute 64-bit expiration date of the content.
In microseconds since midnight (0 hour), January 1, 1970 in network
byte order.
Set by the initiator to the expiration value as recorded from
the <tt>PutMessage</tt> from which the block originated.
Read-only.
</dd>
<dt>QUERY_HASH</dt>
<dd>
the query hash corresponding to the <tt>GetMessage</tt> which
caused this reply message to be sent.
Set by the initiator using the value of the <tt>GetMessage</tt>.
Read-only.
</dd>
<dt>TRUNCATED ORIGIN</dt>
<dd>
is only provided if the TRUNCATED flag
is set in FLAGS. If present, this is
the public key of the peer just before
the first entry on the PUTPATH and the
first peer on the PUTPATH is not the
actual origin of the message. Thus, to
verify the first signature on the PUTPATH,
this public key must be used. Note that
due to the truncation, this last hop
cannot be verified to exist.
Set by processing peers.
</dd>
<dt>PUTPATH</dt>
<dd>
the variable-length PUT path.
The path consists of a list of <tt>PUTPATH_L</tt> Path Elements.
Set by the initiator to the the <tt>PUTPATH</tt> of the <tt>PutMessage</tt>
from which the block originated.
Modified by processing peers in case of path truncation.
</dd>
<dt>GETPATH</dt>
<dd>
the variable-length PUT path.
The path consists of a list of <tt>GETPATH_L</tt> Path Elements.
Set by processing peers.
</dd>
<dt>LAST HOP SIGNATURE</dt>
<dd>
is only provided if the <tt>RecordRoute</tt> flag
is set in FLAGS. If present, this is
an EdDSA signature of the sender of this message
(using the same format as the signatures in PUTPATH)
affirming that the sender forwarded the message from
the predecessor (all zeros if PATH_LEN is 0,
otherwise the last peer in PUTPATH) to
the target peer.
</dd>
<dt>BLOCK</dt>
<dd>
the variable-length resource record data payload.
The contents are defined by the respective type of the resource record.
Set by the initiator. Read-only.
</dd>
</dl>
</section>
<section anchor="p2p_result_processing">
<name>Processing</name>
<t>
Upon receiving a <tt>ResultMessage</tt> from a connected peer or
triggered by the processing of a <tt>GetMessage</tt>,
an implementation <bcp14>MUST</bcp14> process it step by step as follows:
</t>
<ol>
<li>
First, the <tt>EXPIRATION</tt> field is evaluated.
If the message is expired, it <bcp14>MUST</bcp14> be discarded.
</li>
<li>
If the <tt>BTYPE</tt> is supported, then the <tt>BLOCK</tt>
<bcp14>MUST</bcp14> be validated against the
requested <tt>BTYPE</tt>. To do this, the peer
checks that the block is valid using <tt>ValidateBlockStoreRequest</tt>.
If the result is <tt>BLOCK_INVALID</tt>, the message <bcp14>MUST</bcp14> be
discarded.
</li>
<li>
If the <tt>PUTPATH_L</tt> or the <tt>GETPATH_L</tt> are non-zero,
the local peer <bcp14>SHOULD</bcp14> verify the signatures from the <tt>PUTPATH</tt>
and the <tt>GETPATH</tt>.
Verification <bcp14>MAY</bcp14> involve checking all signatures or any random
subset of the signatures. It is <bcp14>RECOMMENDED</bcp14> that peers adapt
their behavior to available computational resources so as to not make signature
verification a bottleneck. If an invalid signature is found, the
path <bcp14>MUST</bcp14> be truncated to only include the elements
following the invalid signature. In particular, any invalid signature
on the <tt>GETPATH</tt> will cause <tt>PUTPATH_L</tt> to be set to 0.
</li>
<li>
The peer also attempts to compute the
key using <tt>DeriveBlockKey</tt>. This may result in <tt>NONE</tt>.
The result is used later. Note that even if a key was computed, it
does not have to match the <tt>QUERY_HASH</tt>.
</li>
<li>
If the <tt>BTYPE</tt> of the message indicates a HELLO block, the
peer <bcp14>MUST</bcp14> be considered for the local routing
table if the respective k-bucket is not yet full. In this case,
the local peer <bcp14>MUST</bcp14> try to establish a connection
to the peer indicated in the HELLO block using the address information
from the HELLO block. If a connection is established,
the peer is added to the respective k-bucket of the routing table.
Note that the k-bucket <bcp14>MUST</bcp14> be determined by the
key computed using <tt>DeriveBlockKey</tt> and not by the <tt>QUERY_HASH</tt>.
</li>
<li>
<t>
If the <tt>QUERY_HASH</tt> of this <tt>ResultMessage</tt> is found in the
list of pending local or remote queries, then
for each matching pending query:
</t>
<ol type="%c)">
<li>
If the <tt>FindApproximate</tt> flag was not set in the query and the <tt>BTYPE</tt> allowed the
implementation to compute the key from the block, the computed key must
exactly match the <tt>QUERY_HASH</tt>, otherwise the result does
not match the pending query and processing continues with the next pending query.
</li>
<li>
If the <tt>BTYPE</tt> is supported, result block <bcp14>MUST</bcp14>
be validated against the specific query using
the respective <tt>FilterBlockResult</tt> function. This function
<bcp14>MUST</bcp14> update
the result filter if a result is returned to the originator of the
query.
</li>
<li>
If the <tt>BTYPE</tt> is not supported, filtering of exact duplicate
replies <bcp14>MUST</bcp14> still be performed before forwarding
the reply.
Such duplicate filtering <bcp14>MAY</bcp14> be implemented
probabilistically, for example using a Bloom filter.
The result of this duplicate filtering is always either
<tt>FILTER_MORE</tt> or <tt>FILTER_DUPLICATE</tt>.
</li>
<li>
If the <tt>RecordRoute</tt> flag is set in FLAGS,
the local peer address <bcp14>MUST</bcp14> be appended to the <tt>GETPATH</tt>
of the message and the respective signature <bcp14>MUST</bcp14> be
set using the query origin as the <tt>PEER SUCCESSOR</tt> and the
response origin as the <tt>PEER PREDECESSOR</tt>. If the flag is not set,
the <tt>GETPATH_L</tt> and <tt>PUTPATH_L</tt>
<bcp14>MUST</bcp14> be set to zero when forwarding the result.
</li>
</ol>
<t>
If the result is either <tt>FILTER_MORE</tt> or <tt>FILTER_LAST</tt>,
the message is forwarded to the origin of the query through
either the overlay API or using <tt>SEND(P, ResultMessage')</tt> where
<tt>ResultMessage'</tt> is the now modified message.
If the result was <tt>FILTER_LAST</tt>, the query is removed from the list of pending
queries.
</t>
</li>
<li>
<!-- FIXME: If that is the case, then this also needs to be in the
processing of GetMessage: I think we shoul dmove this to some
performance considerations instead. -->
Finally, the implementation <bcp14>MAY</bcp14> choose to
cache data from <tt>ResultMessage</tt>s.
</li>
</ol>
</section>
</section>
</section>
<section anchor="blockstorage" numbered="true" toc="default">
<name>Blocks</name>
<t>
This section describes various considerations R<sup>5</sup>N
implementations must consider with respect to blocks.
Specifically, implementations <bcp14>SHOULD</bcp14> be able
to validate and persist blocks. Implementations
<bcp14>MAY</bcp14> not support validation for all types
of blocks. On some devices, storing blocks <bcp14>MAY</bcp14>
also be impossible due to lack of storage capacity.
</t>
<t>
Applications can and should define their own block types.
The block type determines the format and handling of the block
payload by peers in <tt>PutMessage</tt>s and <tt>ResultMessage</tt>s.
Block types <bcp14>MUST</bcp14> be registered with GANA
(see <xref target="gana_block_type"/>).
</t>
<t>
</t>
<section anchor="block_functions">
<name>Block Operations</name>
<t>
Block validation may be necessary for all types of DHT
messages. To enable these validations, any block type
specification <bcp14>MUST</bcp14> define the following functions:
</t>
<dl>
<dt>ValidateBlockQuery(Key, XQuery)
-> RequestEvaluationResult</dt>
<dd>
<t>
is used to evaluate the request for a block as part of
<tt>GetMessage</tt> processing. Here, the block payload is unkown,
but if possible the <tt>XQuery</tt> and <tt>Key</tt>
<bcp14>SHOULD</bcp14> be verified. Possible values for
the <tt>RequestEvaluationResult</tt> are:
</t>
<dl>
<dt>REQUEST_VALID</dt>
<dd>Query is valid.</dd>
<dt>REQUEST_INVALID</dt>
<dd>
Query format does not match block type. For example, a
mandatory XQuery was not provided, or of the size of
the XQuery is not appropriate for the block type.
</dd>
</dl>
</dd>
<dt>DeriveBlockKey(Block) -> Key | NONE</dt>
<dd>
is used to synthesize the block key from the block payload as
part of <tt>PutMessage</tt> and <tt>ResultMessage</tt> processing.
The special return value of <tt>NONE</tt> implies that this block type does not
permit deriving the key from the block. A Key may be returned
for a block that is ill-formed.
</dd>
<dt>ValidateBlockStoreRequest(Block)
-> BlockEvaluationResult</dt>
<dd>
<t>
is used to evaluate a block payload
as part of <tt>PutMessage</tt> and <tt>ResultMessage</tt> processing.
Possible values for the <tt>BlockEvaluationResult</tt> are:
</t>
<dl>
<dt>BLOCK_VALID</dt>
<dd>Block is valid.</dd>
<dt>BLOCK_INVALID</dt>
<dd>Block payload does not match the block type.
</dd>
</dl>
</dd>
<dt>SetupResultFilter(FilterSize, Mutator) -> RF</dt>
<dd>
is used to setup an empty result filter. The arguments
are the set of results that must be filtered at the
initiator, and a <tt>MUTATOR</tt> value which <bcp14>MAY</bcp14>
be used to deterministically re-randomize
probabilistic data structures. The specification <bcp14>MUST</bcp14>
also include the wire format for BF.
</dd>
<dt>FilterResult(Block, Key, RF, XQuery) -> (FilterEvaluationResult, RF')</dt>
<dd>
<t>
is used to filter results against specific queries. This function
does not check the validity of Block itself or that it matches the given key,
as this must have been checked earlier.
Thus, locally stored blocks from previously observed
<tt>ResultMessages</tt> and <tt>PutMessages</tt> use this
function to perform filtering based on the request parameters
of a particular GET operation.
Possible values for the <tt>FilterEvaluationResult</tt> are:
</t>
<dl>
<dt>FILTER_MORE</dt>
<dd>Valid result, and there may be more.</dd>
<dt>FILTER_LAST</dt>
<dd>Last possible valid result.</dd>
<dt>FILTER_DUPLICATE</dt>
<dd>Valid result, but duplicate (was filtered by the result filter).</dd>
<dt>FILTER_IRRELEVANT</dt>
<dd>Block does not satisfy the constraints imposed by the XQuery.</dd>
</dl>
<t>
If the main evaluation result is <tt>FILTER_MORE</tt>, the function also returns
an updated result filter where the block is added to the set of
filtered replies. An implementation is not expected to actually differenciate
between the <tt>FILTER_DUPLICATE</tt> and <tt>FILTER_IRRELEVANT</tt> return
values: in both cases the block is ignored for this query.
</t>
</dd>
</dl>
</section>
<section anchor="hello_block">
<name>HELLO Blocks</name>
<t>
For bootstrapping and peer discovery, the DHT implementation uses
its own block type called "HELLO". HELLO blocks are the only type
of block that <bcp14>MUST</bcp14> be supported by every
R<sup>5</sup>N implementation. A block with this block type
contains the peer ID of the peer that published the HELLO together
with a set of addresses of this peer. The key of a HELLO block
is the SHA-512 of the peer ID and thus the peer's address in the DHT.
</t>
<t>
The HELLO block type wire format is illustrated in
<xref target="figure_hello"/>. A query for block of type HELLO <bcp14>MUST NOT</bcp14>
include extended query data (XQuery). Any implementation
encountering a request for a HELLO with non-empty XQuery
data <bcp14>MUST</bcp14> consider the request invalid and ignore it.
</t>
<figure anchor="figure_hello" title="The HELLO Block Format.">
<artwork name="" type="" align="left" alt=""><![CDATA[
0 8 16 24 32 40 48 56
+-----+-----+-----+-----+-----+-----+-----+-----+
| PEER-ID |
| (32 byte) |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
| SIGNATURE |
| (64 byte) |
| |
| |
| |
| |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
| EXPIRATION |
+-----+-----+-----+-----+-----+-----+-----+-----+
/ ADDRESSES /
/ (variable length) /
+-----+-----+-----+-----+-----+-----+-----+-----+
]]></artwork>
</figure>
<dl>
<dt>PEER-ID</dt>
<dd>
is the Peer-ID of the peer which has generated this HELLO.
</dd>
<dt>EXPIRATION</dt>
<dd>
denotes the absolute 64-bit expiration date of the content.
The value specified is microseconds since midnight (0 hour),
January 1, 1970, but must be a multiple of one million (so that it
can be represented in seconds in a HELLO URL).
Stored in network byte order.
</dd>
<dt>ADDRESSES</dt>
<dd>
is a list of UTF-8 addresses (<xref target="terminology"/>)
which can be used to contact the peer.
Each address <bcp14>MUST</bcp14> be 0-terminated.
The set of addresses MAY be empty.
</dd>
<dt>SIGNATURE</dt>
<dd>
<t>
is the signature of the HELLO.
It covers a 64-bit pseudo header
derived from the information in the HELLO block.
The pseudo header includes
the expiration time, a constant that uniquely
identifies the purpose of the signature,
and a hash over the addresses.
The wire format is illustrated
in <xref target="figure_hellowithpseudo"/>.
</t>
<figure anchor="figure_hellowithpseudo" title="The Wire Format of the HELLO for Signing.">
<artwork name="" type="" align="left" alt=""><![CDATA[
0 8 16 24 32 40 48 56
+-----+-----+-----+-----+-----+-----+-----+-----+
| SIZE | PURPOSE |
+-----+-----+-----+-----+-----+-----+-----+-----+
| EXPIRATION |
+-----+-----+-----+-----+-----+-----+-----+-----+
| H_ADDRS |
| (64 byte) |
| |
| |
| |
| |
| |
| |
+-----+-----+-----+-----+-----+-----+-----+-----+
]]></artwork>
</figure>
<dl>
<dt>SIZE</dt>
<dd>
A 32-bit value containing the length of the signed data in bytes
in network byte order.
The length of the signed data <bcp14>MUST</bcp14> be 80 bytes.
</dd>
<dt>PURPOSE</dt>
<dd>
A 32-bit signature purpose flag. This field <bcp14>MUST</bcp14> be 7 (in network
byte order).
</dd>
<dt>EXPIRATION</dt>
<dd>
denotes the absolute 64-bit expiration date of the HELLO.
In microseconds since midnight (0 hour), January 1, 1970 in network
byte order.
</dd>
<dt>H_ADDRS</dt>
<dd>
a SHA-512 hash over the addresses in the HELLO.
H_ADDRS is generated over the ADDRESSES field
as provided in the HELLO block using SHA-512 <xref target="RFC4634"/>.
</dd>
</dl>
</dd>
</dl>
<t>
The HELLO block functions <bcp14>MUST</bcp14> be implemented
as follows:
</t>
<dl>
<dt>ValidateBlockQuery(Key, XQuery)
-> RequestEvaluationResult</dt>
<dd>
To validate a block query for a HELLO is to simply check that the XQuery is empty. If it is empty, REQUEST_VALID ist returned. Otherwise, REQUEST_INVALID.
</dd>
<dt>DeriveBlockKey(Block) -> Key | NONE</dt>
<dd>
To derive a block key for a HELLO is to simply
hash the peer ID from the HELLO. The result of this function
is always the SHA-512 hash over the PEER-ID.
</dd>
<dt>ValidateBlockStoreRequest(Block)
-> BlockEvaluationResult</dt>
<dd>
To validate a block store request is to verify
the EdDSA <tt>SIGNATURE</tt> over the hashed <tt>ADDRESSES</tt>
against the public key from the peer ID field.
If the signature is valid BLOCK_VALID is returned.
Otherwise BLOCK_INVALID.
</dd>
<dt>SetupResultFilter(FilterSize, Mutator) -> RF</dt>
<dd>
<t>
<!-- FIXME: I do not understand this. Isn't the element set E known
for HELLO blocks? Isn't it H_ADDRS XOR MUTATOR? -->
The RESULT_FILTER for HELLO blocks is implemented using a
Bloom filter following the definition from <xref target="bloom_filters"/>
and consists of a variable number of buckets <tt>L</tt>.
<tt>L</tt> depends on the number of elements <tt>|E|</tt> known to be filtered at the
initiator.
<!-- FIXME: Minimum space for used for what? There is no example given anywhere in the
spec how this becomes relevant. Again, this is not some abstract block,
but specifically a HELLO (see above). -->
If <tt>|E|</tt> is zero, the size <tt>L</tt> is set to 32 bits to provide some minimum
space (to be used by peers when forwarding the request after
returning local results).
Otherwise, <tt>L</tt> is set to the minimum of
2<sup>18</sup> bits (2<sup>15</sup> bytes) and the lowest power
of 2 that is strictly larger than <tt>2*K*|E|</tt> bits (<tt>K*|E|/4</tt> bytes).
</t>
<t>
The <tt>k</tt>-value for the Bloom filter is 16.
The elements used in the Bloom filter
consist of an XOR between the <tt>H_ADDRS</tt> field (as computed using
SHA-512 over the <tt>ADDRESSES</tt>) and the SHA-512
hash of the <tt>MUTATOR</tt> field from a given HELLO block.
The mapping function M(<tt>H_ADDRS XOR MUTATOR</tt>) is defined as follows:
</t>
<t>
<tt>M(e = H_ADDR XOR MUTATOR) -> e as uint32[]</tt>
</t>
<t>
<tt>M</tt> is an identity function and returns the 512-bit XOR result unmodified.
This resulting byte string is interpreted as k=16 32-bit
integers in network byte order which are used to set and check the bucket bits in
<tt>B</tt> using <tt>BF-SET</tt> and <tt>BF-TEST</tt>.
The 32-bit Mutator is prepended to the L-bit Bloom filter bucket field <tt>HELLO_BF</tt> containing <tt>B</tt>
to create the result filter for a HELLO block:
</t>
<figure anchor="hello_rf" title="The HELLO Block Result Filter.">
<artwork name="" type="" align="left" alt=""><![CDATA[
0 8 16 24 32 40 48 56
+-----+-----+-----+-----+-----+-----+-----+-----+
| MUTATOR | HELLO_BF /
+-----+-----+-----+-----+ (variable length) /
/ /
+-----+-----+-----+-----+-----+-----+-----+-----+
]]></artwork>
</figure>
<t>where:</t>
<dl>
<dt>MUTATOR</dt>
<dd>
The 32-bit mutator for the result filter.
</dd>
<dt>HELLO_BF</dt>
<dd>
The L-bit Bloom filter buckets byte array.
</dd>
</dl>
<t>
The <tt>MUTATOR</tt> value is used
to additionally "randomize" the computation of the Bloom filter while
remaining deterministic across peers.
It is only ever set by the peer initiating the GET
request, and changed every time the GET request is repeated.
Peers forwarding GET requests <bcp14>MUST</bcp14> not change the
mutator value included in the <tt>RESULT_FILTER</tt> as they might not
be able to recalculate the result filter with a different <tt>MUTATOR</tt>
value.
</t>
<t>
Consequently, repeated
requests have statistically independent probabilities of creating
false-positives in a result filter. Thus, even if for one request
a result filter may exclude a result as a false-positive
match, subsequent requests are likely to not have the same
false-positives.
</t>
<t>
HELLO result filters can be merged if the
Bloom filters have the same size and
<tt>MUTATOR</tt> by setting all bits to 1 that are
set in either Bloom filter. This is done whenever
a peer receives a query with the same <tt>MUTATOR</tt>,
predecessor and Bloom filter size.
</t>
</dd>
<dt>FilterResult(Block, Key, RF, XQuery) -> (FilterEvaluationResult, RF')</dt>
<dd>
The <tt>H_ADDRS</tt> field is XORed with the SHA-512
hash of the <tt>MUTATOR</tt> field from the HELLO block and the resulting
value is checked against the Bloom filter in RF.
Consequently, HELLOs with completely identical sets of
addresses will be filtered and FILTER_DUPLICATE is returned.
Any small variation in the set of addresses will cause the block
to no longer be filtered (with high probability) and
FILTER_MORE is returned.
</dd>
</dl>
</section>
<section>
<name>Persistence</name>
<t>
An implementation <bcp14>SHOULD</bcp14> provide a local persistence mechanism for
blocks. Embedded systems that lack storage capability <bcp14>MAY</bcp14> use
connection-level signalling to indicate that they are merely a client utilizing a
DHT and are not able to participate with storage.
The local storage <bcp14>MUST</bcp14> provide the following functionality:
</t>
<dl>
<dt>Store(Key, Block)</dt>
<dd>
Stores a block under the specified key. If an block with identical
payload exists already under the same key, the meta data should
be set to the maximum expiration time of both blocks and use the
corresponding <tt>PUTPATH</tt> (and if applicable
<tt>TRUNCATED ORIGIN</tt>) of that version of the block.
</dd>
<dt>Lookup(Key) -> List of Blocks</dt>
<dd>
Retrieves blocks stored under the specified key.
</dd>
<dt>LookupApproximate(Key) -> List of Blocks</dt>
<dd>
Retrieves the blocks stored under the specified key and
any blocks under keys close to the specified key, in order
of decreasing proximity.
</dd>
</dl>
<section>
<name>Approximate Search Considerations</name>
<t>
Over time a peer may accumulate a significant number of blocks
which are stored locally in the persistence layer.
Due to the expected high number of blocks, the method to
retrieve blocks close to the specified lookup key in the
<tt>LookupApproximate</tt> API must be implemented with care
with respect to efficiency.
</t>
<t>
It is <bcp14>RECOMMENDED</bcp14> to limit the number of results
from the <tt>LookupApproximate</tt> procedure to a result size
which is easily manageable by the local system.
</t>
<t>
In order to efficiently find a suitable result set, the implementation
<bcp14>SHOULD</bcp14> follow the following procedure:
</t>
<ol>
<li>
Sort all blocks by the block key in ascending (decending) order.
The block keys are interpreted as integer.
</li>
<li>
Alternatingly select a block with a key larger and smaller from
the sortings.
The resulting set is sorted by XOR distance.
The selection process continues until the upper bound for the
result set is reached and both sortings do not yield any closer
blocks.
</li>
</ol>
<t>
An implementation <bcp14>MAY</bcp14> decide to use a custom algorithm in order to
find the closest blocks in the local storage.
But, especially for more primitive approaches, such as only
comparing XOR distances for all blocks in the storage, the
procedure may become ineffective for large storages.
</t>
</section>
<section>
<name>Caching Strategy Considerations</name>
<t>
An implementation <bcp14>MUST</bcp14> implement an eviction strategy
for blocks stored in the block storage layer.
</t>
<t>
In order to ensure the freshness of blocks, an implementation
<bcp14>MUST</bcp14> evict expired blocks in favor of
new blocks.
</t>
<t>
An implementation <bcp14>MAY</bcp14> preserve blocks which are often
requested.
This approach can be expensive as it requires the implementation
to keep track of how often a block is requested.
</t>
<t>
An implementation <bcp14>MAY</bcp14> preserve blocks which are close
to the local peer ID.
</t>
<t>
An implementation <bcp14>MAY</bcp14> provide configurable storage
quotas and
adapt its eviction strategy based on the current storage size
or other constrained resources.
</t>
</section>
</section>
</section>
<section anchor="security" numbered="true" toc="default">
<name>Security Considerations</name>
<!-- FIXME: Here we should (again) discuss how the system is open and
does not have/require a trust anchor a priori. This is (again) in contrast
to RELOAD -->
<t>
If an upper bound to the maximum number of neighbors in a
k-bucket is reached, the implementation <bcp14>MUST</bcp14>
prefer to preserve the oldest working connections instead of
new connections. This makes Sybil attacks less effective
as an adversary would have to invest more resources over time
to mount an effective attack.
</t>
<t>
The <tt>ComputeOutDegree</tt> function limits the
<tt>REPL_LVL</tt> to a maximum of 16. This imposes
an upper limit on bandwidth amplification an attacker
may achieve for a given network size and topology.
</t>
<section>
<name>Approximate Result Filtering</name>
<t>
When a FindApproximate request is encountered, a peer will try to
respond with the closest block it has that is not filtered by the
result bloom filter.
Implementations <bcp14>MUST</bcp14> ensure that
the cost of evaluating any such query is reasonably small.
For example, implementations <bcp14>MAY</bcp14> consider to
avoid an exhaustive search of their database.
Not doing so can lead to denial of service attacks as there
could be cases where too many local results are
filtered by the result filter.
</t>
</section>
<section>
<name>Access control</name>
<t>
By design R<sup>5</sup>N does not rely on strict admission control through
the use of either centralized enrollment servers or pre-shared keys.
This is a key distintion over protocols that do rely on this kind of access
control such as <xref target="RFC6940"/> which, like R<sup>5</sup>N, provides
a peer-to-peer (P2P) signaling protocol with extensible routing and topology
mechanisms.
<!-- FIXME: Can we give examples here? -->
Some decentralized applications require a more open system that
enables ad-hoc participation and other means to prevent common attacks
on P2P overlays.
</t>
</section>
</section>
<section anchor="iana" numbered="true" toc="default">
<name>IANA Considerations</name>
<t>
IANA maintains a registry called the "Uniform Resource Identifier
(URI) Schemes" registry.
</t>
<!-- NOTE: not sure we should do this here, we could also
register gnunet:// independent of this RFC already.
Might be a good idea! -->
<section anchor="gnunet-uri" title="GNUnet URI Scheme Registration">
<t>
IANA maintains the "Uniform Resource Identifier (URI)
Schemes" registry. The registry should be updated to include
an entry for the 'gnunet' URI scheme. IANA is requested to
update that entry to reference this document when published
as an RFC.
</t>
</section>
<section anchor="r5n-uri" title="R5N URI Scheme Registration">
<t>
IANA maintains the "Uniform Resource Identifier (URI)
Schemes" registry. The registry should be updated to include
an entry for the 'r5n+udp+ip' URI scheme. IANA is requested to
update that entry to reference this document when published
as an RFC.
</t>
</section>
</section>
<section anchor="gana" numbered="true" toc="default">
<name>GANA Considerations</name>
<section anchor="gana_block_type" numbered="true" toc="default">
<name>Block Type Registry</name>
<t>
GANA <xref target="GANA"/>
is requested to create a "DHT Block Types" registry.
The registry shall record for each entry:
</t>
<ul>
<li>Name: The name of the block type (case-insensitive ASCII
string, restricted to alphanumeric characters</li>
<li>Number: 32-bit</li>
<li>Comment: Optionally, a brief English text describing the purpose of
the block type (in UTF-8)</li>
<li>Contact: Optionally, the contact information of a person to contact for
further information</li>
<li>References: Optionally, references describing the record type
(such as an RFC)</li>
</ul>
<t>
The registration policy for this sub-registry is "First Come First
Served", as described in <xref target="RFC8126"/>.
GANA created the registry as follows:
</t>
<figure anchor="figure_btypenums" title="The Block Type Registry.">
<artwork name="" type="" align="left" alt=""><![CDATA[
Number| Name | Contact | References | Description
------+----------------+---------+------------+-------------------------
0 ANY N/A [This.I-D] Reserved
11 GNS_NAMERECORD N/A GNS Block for GNS records
13 DHT_HELLO N/A [This.I-D] Address data for a peer
]]></artwork>
</figure>
</section>
<section anchor="gana_gnunet_url" numbered="true" toc="default">
<name>GNUnet URI schema Subregistry</name>
<t>
GANA <xref target="GANA"/>
is requested to create a "gnunet://" sub-registry.
The registry shall record for each entry:
</t>
<ul>
<li>Name: The name of the subsystem (case-insensitive ASCII
string, restricted to alphanumeric characters</li>
<li>Comment: Optionally, a brief English text describing the purpose of
the subsystem (in UTF-8)</li>
<li>Contact: Optionally, the contact information of a person to contact for
further information</li>
<li>References: Optionally, references describing the syntax of the URL
(such as an RFC or LSD)</li>
</ul>
<t>
<!-- FIXME: See GNS wording for this which is already improved / ISE compliant -->
The registration policy for this sub-registry is "First Come First
Served", as described in <xref target="RFC8126"/>.
GANA is requested to populate this registry as follows:
</t>
<figure anchor="figure_gnunetscheme" title="GNUnet scheme Subregistry.">
<artwork name="" type="" align="left" alt=""><![CDATA[
Name | Contact | References | Description
---------------+---------+------------+-------------------------
HELLO N/A [This.I-D] How to contact a peer.
ADDRESS N/A N/A Network address.
]]></artwork>
</figure>
</section>
<section anchor="gana_signature_purpose" numbered="true" toc="default">
<name>GNUnet Signature Purpose Registry</name>
<t>
GANA is requested to amend the "GNUnet Signature Purpose" registry
as follows:
</t>
<figure anchor="figure_purposenums" title="The Signature Purpose Registry Entries.">
<artwork name="" type="" align="left" alt=""><![CDATA[
Purpose | Name | References | Description
--------+-----------------+------------+---------------
6 DHT PATH Element [This.I-D] DHT message routing data
7 HELLO Payload [This.I-D] Peer contact information
]]></artwork>
</figure>
</section>
<section anchor="gana_message_type" numbered="true" toc="default">
<name>GNUnet Message Type Registry</name>
<t>
GANA is requested to amend the "GNUnet Message Type" registry
as follows:
</t>
<figure anchor="figure_messagetypeenums" title="The Message Type Registry Entries.">
<artwork name="" type="" align="left" alt=""><![CDATA[
Type | Name | References | Description
--------+-----------------+------------+---------------
146 DHT PUT [This.I-D] Store information in DHT
147 DHT GET [This.I-D] Request information from DHT
148 DHT RESULT [This.I-D] Return information from DHT
157 HELLO Message [This.I-D] Peer contact information
]]></artwork>
</figure>
</section>
</section>
<!-- gana -->
<section anchor="testvectors">
<name>Test Vectors</name>
</section>
</middle>
<back>
<references>
<name>Normative References</name>
&RFC2119;
&RFC3629;
&RFC3986;
&RFC4634;
&RFC5234;
&RFC6940;
&RFC8126;
&RFC8174;
&I-D.schanzen-gns;
<reference anchor="ed25519" target="http://link.springer.com/chapter/10.1007/978-3-642-23951-9_9"><front><title>High-Speed High-Security Signatures</title><author initials="D." surname="Bernstein" fullname="Daniel Bernstein"><organization>University of Illinois at Chicago</organization></author><author initials="N." surname="Duif" fullname="Niels Duif"><organization>Technische Universiteit Eindhoven</organization></author><author initials="T." surname="Lange" fullname="Tanja Lange"><organization>Technische Universiteit Eindhoven</organization></author><author initials="P." surname="Schwabe" fullname="Peter Schwabe"><organization>National Taiwan University</organization></author><author initials="B." surname="Yang" fullname="Bo-Yin Yang"><organization>Academia Sinica</organization></author><date year="2011"/></front></reference>
<reference anchor="GANA" target="https://gana.gnunet.org/"><front><title>GNUnet Assigned Numbers Authority (GANA)</title><author><organization>GNUnet e.V.</organization></author><date month="April" year="2020"/></front></reference>
</references>
<references>
<name>Informative References</name>
<reference anchor="R5N" target="https://doi.org/10.1109/ICNSS.2011.6060022">
<front>
<title>R5N: Randomized recursive routing for restricted-route networks</title>
<author initials="N. S." surname="Evans" fullname="Nathan S. Evans">
<organization>Technische Universität München</organization>
</author>
<author initials="C." surname="Grothoff" fullname="Christian Grothoff">
<organization>Technische Universität München</organization>
</author>
<date year="2011"/>
</front>
</reference>
<reference anchor="Kademlia" target="http://css.csail.mit.edu/6.824/2014/papers/kademlia.pdf">
<front>
<title>Kademlia: A peer-to-peer information system based on the xor metric.</title>
<author initials="P." surname="Maymounkov" fullname="Petar Maymounkov">
</author>
<author initials="D." surname="Mazieres" fullname="David Mazieres">
</author>
<date year="2002"/>
</front>
</reference>
<reference anchor="cadet" target="https://doi.org/10.1109/MedHocNet.2014.6849107">
<front>
<title>CADET: Confidential ad-hoc decentralized end-to-end transport</title>
<author initials="B." surname="Polot" fullname="Bartlomiej Polot">
<organization>Technische Universität München</organization>
</author>
<author initials="C." surname="Grothoff" fullname="Christian Grothoff">
<organization>Technische Universität München</organization>
</author>
<date year="2014"/>
</front>
</reference>
</references>
<section anchor="bloom_filters" numbered="true" toc="default">
<name>Bloom filters in R<sup>5</sup>N</name>
<t>
R<sup>5</sup>N uses Bloom filters in several places. This section
gives some general background on Bloom filters and defines functions
on this data structure shared by the various use-cases in R<sup>5</sup>N.
</t>
<t>
A Bloom filter (BF) is a space-efficient probabilistic datastructure
to test if an element is part of a set of elements.
Elements are identified by an element ID.
Since a BF is a probabilistic datastructure, it is possible to have
false-positives: when asked if an element is in the set, the answer from
a BF is either "no" or "maybe".
</t>
<t>
Bloom filters are defined as a string of <tt>L</tt> bits called "buckets".
The buckets are initially always empty, meaning that the bits are set to
zero.
There are two functions which can be invoked on the Bloom filter "bf":
BF-SET(bf, e) and BF-TEST(bf, e) where "e" is an element that is to
be added to the Bloom filter or queried against the set.
</t>
<t>
A mapping function M is used to map each ID of each element from the set to a
subset of k buckets.
In the original proposal by Bloom, M is non-injective and can thus map the same
element multiple times to the same bucket.
The type of the mapping function can thus be described by the following
mathematical notation:
</t>
<figure anchor="figure_bf_func" title="Bloom filter mapping function.">
<artwork><![CDATA[
------------------------------------
# M: E->B^k
------------------------------------
# L = Number of buckets
# B = 0,1,2,3,4,...L-1 (the buckets)
# k = Number of buckets per element
# E = Set of elements
------------------------------------
Example: L=256, k=3
M('element-data') = {4,6,255}
]]>
</artwork>
</figure>
<t>
When adding an element to the Bloom filter <tt>bf</tt> using
<tt>BF-SET(bf,e)</tt>, each integer <tt>n</tt> of the mapping
<tt>M(e)</tt> is interpreted as a bit offset <tt>n mod L</tt> within
<tt>bf</tt> and set to 1.
</t>
<t>
When testing if an element may be in the Bloom filter <tt>bf</tt> using
<tt>BF-TEST(bf,e)</tt>, each bit offset <tt>n mod L</tt> within
<tt>bf</tt> <bcp14>MUST</bcp14> have been set to 1.
Otherwise, the element is not considered to be in the Bloom filter.
</t>
</section>
<section anchor="overlay" numbered="true" toc="default">
<name>Overlay Operations</name>
<t>
An implementation of this specification commonly exposes the two overlay
operations "GET" and "PUT".
The following are non-normative examples of APIs for those operations.
Their behaviour is described prosaically in order to give implementers a fuller
picture of the protocol.
</t>
<section>
<name>GET operation</name>
<t>
A basic GET operation interface may be exposed as:
</t>
<t>
<tt>GET(Query-Key, Block-Type) -> Results as List</tt>
</t>
<t>
The procedure typically takes at least two arguments to initiate a lookup:
</t>
<dl>
<dt><tt>QueryKey</tt>:</dt>
<dd>
is the 512-bit key to look for in the DHT.
</dd>
<dt>Block-Type:</dt>
<dd>
is the type of block to look for, possibly "any".
</dd>
</dl>
<t>
The GET procedure may allow a set of optional parameters in order to
control or modify the query:
</t>
<dl>
<dt>Replication-Level:</dt>
<dd>
is an integer which controls how many nearest peers the request
should reach.
</dd>
<dt>Flags:</dt>
<dd>
is a 16-bit vector which indicates certain
processing requirements for messages.
Any combination of flags as defined in <xref target="route_flags"/>
may be specified.
</dd>
<dt>eXtended-Query (XQuery):</dt>
<dd>
is medatadata which may be
required depending on the respective <tt>Block-Type</tt>.
A <tt>Block-Type</tt> must define if the <tt>XQuery</tt> can or must
be used and what the specific format of its contents should be.
Extended queries are in general used to implement domain-specific filters.
These might be particularly useful in combination with FindApproximate
to add a well-defined filter by an application-specific distance.
Regardless, the DHT does not define any particular semantics for an XQuery.
See also <xref target="blockstorage"/>.
</dd>
<dt>Result-Filter:</dt>
<dd>
is data for a <tt>Block-type</tt>-specific filter
which allows applications to
indicate results which are
not relevant anymore to the
caller (see <xref target="result_filter"/>).
</dd>
</dl>
<t>
The GET procedure should be implemented as an asynchronous
operation that returns individual results as they are found
in the DHT. It should terminate only once the application
explicitly cancels the operation. A single result commonly
consists of:</t>
<dl>
<dt>Block-Type:</dt>
<dd>
is the desired type of block in the result.
</dd>
<dt>Block-Data:</dt>
<dd>
is the application-specific block payload. Contents are specific to the <tt>Block-Type</tt>.
</dd>
<dt>Block-Expiration:</dt>
<dd>
is the expiration time of the block. After this time, the result should no
longer be used.
</dd>
<dt>Key:</dt>
<dd>
is the key under which the block was stored. This may be different from the
key that was queried if the flag FindApproximate was set.
</dd>
<dt>GET-Path:</dt>
<dd>
is a signed path of the IDs of peers which the query
traversed through the network. The DHT will try to make
the path available if the <tt>RecordRoute</tt> flag was set by
the application calling the PUT procedure. The reported
path may have been silently truncated from the beginning.
</dd>
<dt>PUT-Path:</dt>
<dd>
is a signed path of the IDs of peers which the
result message traversed. The DHT will try to make the
path available if the <tt>RecordRoute</tt> flag was set for the GET procedure.
The reported path may have been silently truncated from the beginning.
As the block was cached by the node at the end of this
path, this path is more likely to be stale compared to the
<tt>GET-Path</tt>.
</dd>
</dl>
</section>
<section>
<name>PUT operation</name>
<t>
A PUT operation interface may be exposed as:
</t>
<t>
<tt>PUT(Key, Block-Type, Block-Expiration, Block-Data)</tt>
</t>
<t>
The procedure typically takes at least four parameters:
</t>
<dl>
<dt>Key:</dt>
<dd>is the key under which to store the block.</dd>
<dt>Block-Type:</dt>
<dd>is the type of the block to store.</dd>
<dt>Block-Expiration:</dt>
<dd>specifies when the block should expire.</dd>
<dt>Block-Data:</dt>
<dd>is the application-specific payload of the block to store.</dd>
</dl>
<t>
The PUT procedure may allow a set of optional parameters in order to
control or modify the query:
</t>
<dl>
<dt>Replication-Level:</dt>
<dd>
is an integer which controls how many nearest peers the request
should reach.
</dd>
<dt>Flags:</dt>
<dd>
is a bit-vector which indicates certain
processing requirements for messages.
Any combination of flags as defined in <xref target="route_flags"/>
may be specified.
</dd>
</dl>
<t>
The PUT procedure does not necessarily yield any information.
</t>
</section>
</section>
<section anchor="hello_url">
<name>HELLO URLs</name>
<t>
The general format of a HELLO URL uses "gnunet://"
as the scheme, followed by "hello/" for the name
of the GNUnet subsystem, followed by "/"-separated values
with the GNS Base32 encoding (<xref target="I-D.schanzen-gns"/>) of
the <tt>Peer ID</tt>, a Base32-encoded EdDSA signature, and an expiration
time in seconds since the UNIX Epoch in decimal format.
After this a "?" begins a list of key-value pairs where the key
is the URI scheme of one of the peer's addresses and the value
is the URL-escaped payload of the address URI without the "://".
</t>
<t>
For example, consider the following URL:
</t>
<figure>
<artwork type="abnf"><![CDATA[
gnunet://hello/RH1M20EPK834M6MHZ72\
G3CMBSF3ECKNY4W0T9VAQP9Z7SZEM6Y3G/\
NGRTAH6RA04X467CGCH7M7CEXR5F9CV5HT\
ZFK0G9BWETY3CCE2QWGVT4WA7JN5M9HMWG\
60A00R71F1PJP8N5628EKGHHBAGA7M8JW3\
0/1647134480?udp=127.0.0.1%3A2086
FIXME: signature is invalid, should
maybe generate proper test vector.
]]>
</artwork>
</figure>
<t>
It specifies that the peer with the ID "RH1M...6Y3G"
is reachable via "udp" at 127.0.0.1 on port 2086 until
1647134480 seconds after the Epoch. Note that "udp"
here is underspecified and just used as a simple example.
<!-- FIXME: Must be supported by which underlay?
This does not make sense. I may be able to generate
addr-names that my underlay supports, but there is not
way to guarantee that all underlays support it. -->
In practice, the key (addr-name)
<bcp14>MUST</bcp14> refer to a scheme supported by a
DHT Underlay.
</t>
<t>
The general syntax of HELLO URLs specified using
Augmented Backus-Naur Form (ABNF) of <xref target="RFC5234"/> is:
</t>
<figure>
<artwork type="abnf"><![CDATA[
hello-URL = "gnunet://hello/" meta [ "?" addrs ]
meta = pid "/" sig "/" exp
pid = *bchar
sig = *bchar
exp = *DIGIT
addrs = addr *( "&" addr )
addr = addr-name "=" addr-value
addr-name = scheme
addr-value = *pchar
bchar = *(ALPHA / DIGIT)
]]>
</artwork>
</figure>
<t>
'scheme' is defined in <xref target="RFC3986" /> in Section 3.1.
'pchar' is defined in <xref target="RFC3986" />, Appendix A.
</t>
</section>
</back>
</rfc>
|