Add a way to discover on which protocols Order-By applies
Remove references to SQL (except in implementation notes)
Specify that order-by operate on the whole item set and inside a RSM result set
Explicitly says that creation and modification dates are set by Pubsub service itself
Specify that Clark notation should be used for extensions
Add a full example with Pubsub and RSM
Add hint for SQL based implementations
removed XEP-0060 and XEP-0313 as dependencies, they are mentioned as use cases, but are not mandatory
better wording following feedback
Namespace bump
Editorial language fixes
First draft.
Publish-Subscribe (XEP-0060)
This specification allows to explicitly change business logic to retrieve the items in a different order.
In XEP-0060, there is no such thing as "updated item". This XEP changes the business logic as follow:
Juliet wants to retrieve plays of her favorite writer, William Shakespeare. She wants to retrieve the 3 most recent ones by date of creation.
To do so, her client do a regular Pubsub request, but adds the <order> element as a children of the <pubsub> element with the "urn:xmpp:order-by:1" namespace, a by attribute equal to creation and a desc attribute equal to true.
The Pubsub service then returns the 3 most recently created plays, first one being the most recent.
Juliet realizes that there is a spelling mistake, it's "Winter's Tale" and not "Wintter's Tale". She fixes it by overwritting the item:
To check that everything is alright, she requests again the last 3 items, but this time by date of modification. To do so, the client proceeds the same way as for date of creation, except that it uses the value modification for the by attribute.
The Pubsub service returns again the 3 plays but the "Winter Tales" item has been overwritten recently, while the 2 others have never been overwritten, so it returns the items in the following order, with the most recently modified item on top:
With Message Archive Management (XEP-0313)
This way, filters can be used with a specific ordering.
By default, ordering MUST be done in ascending order. This can be reversed by using the desc boolean attribute, which MAY have a value of either true or 1.
This section provides a full example of using Order-By with Pubsub and RSM. For readability, we'll use a node with 4 items that will have following IDs (in order of their creation) A, B, C and D.
Items C has been overwritten after D creation, and item A has been overwritten even later. Thus, when ascending creation order is requested, items are in order A, B, C, D. When ascending modification order is requested, items are in order B, D, C, A.
Let's see how this work when Juliet wants to retrieve all items in ascending modification order with RSM using a page size of 2 items:
Now Juliet wants to get the second and last page to complete her collection. She does this as usual with RSM, by using the value advertised in <last> element in a <after> element.
NOTE: in this example the value used in <last> element is the item ID, but as specified in Result Set Management (XEP-0059)
Juliets wonders which are the 2 last items created. To discover this, she request again the node, but this time with a creation order field, and in descending order:
Now she knows that last created item is D, and the one created before is C.
Please note that items are in descending order in the whole result set but also inside the RSM page (thus the first item here is D), and that in this order, this request returns the first page, so index is 0 here.
If Juliet wanted to retrieve the second page of items by descending order of creation, she would do like this:
This specification can be extended by further XEPs, proposing other kind of ordering in the 'by' attribute (e.g. ordering by filename for a file sharing service). But this is beyond the scope of this XEP, and a client should not assume that other ordering than "creation" and "modification" are available without further negotiation. Any new ordering specified in a other XEP SHOULD use the Clark notation to avoid any collision (i.e.: {some_namespace}some_ordering).
It is important to note the following points:
If a server supports the "order by" protocol, it MUST advertize it including the "urn:xmpp:order-by:1" discovery feature (see Protocol Namespaces regarding issuance of one or more permanent namespaces) in response to a Service Discovery (XEP-0030)
In addition to the general feature support, an entity MUST indicated on which protocols Order-By can be used, by using the notation urn:xmpp:order-by:1@other_namespace, i.e. a concatenation of:
So if Order-By is implemented for Publish-Subscribe (XEP-0060)
In the following example, the server example.org advertizes Order-By support, and indicates that it is implemented for Pubsub and MAM:
Several ordering elements may be used, this allows to solve next levels of ordering in case of equality. In this case, the first ordering (i.e. the top most <order> element) is the main one, the second <order> element is used in case of equality, then the next one if a new equality happens and so on.
In case of equality, if no new <order> element is specified, the item order is not guaranteed and is up to the implementation (the implementation MUST keep this order consistent across requests though).
It may be difficult to find a correct value for <first> and <last> elements of RSM. Indeed, internal ID of items can't be suited for all orderings. For Pubsub service using a SQL database as backend, item ID (XMPP or internal) could be used with a window function such as row_number (supported by major database engines such as PostgreSQL, MariaDB/MySQL or SQLite) over the requested ordering. For instance, on a hypothetical table where items are requested by ascending creation then modification dates after the value ABC (which correspond to XMPP item ID in our case), a request similar to this could be used:
In this example, row_number is decreased by 1 to match RSM index (row_number starts at 1 while RSM index starts at 0), thus the item_index column can be used directly to fill RSM metadata. A Common Table Expression has been used for better readability.
This document introduces no additional security considerations above and beyond those defined in the documents on which it depends.
This document requires no interaction with the Internet Assigned Numbers Authority (IANA)
This specification defines the following XML namespace:
If the protocol defined in this specification undergoes a revision that is not fully backwards-compatible with an older version, the XMPP Registrar shall increment the protocol version number found at the end of the XML namespaces defined herein, as described in Section 4 of XEP-0053.
]]>
Thanks to Philipp Hörist, Evgeny xramtsov, Jonas Schäfer¸ and Holger Weiß for their feedback.