I am providing access to a REST endpoint where some aggregates are computed.
Let's say the model is as follows:
Purchase:
amount: amount spent
group: one of ELECTRONICS, FOOD, FURNITURE, ...
date: purchase date
Now I want to offer a timeline of purchases, aggregated on a weekly basis, and partitioned by group. This will produce the following data:
{
"ELECTRONICS": [{"week": "2018WK1", "amount": 1000.0}, ...],
"FOOD": [{"week": "2018WK1", "amount": 2000.0}, ...],
"FURNITURE": [{"week": "2018WK1", "amount": 3000.0}, ...],
...
}
Some things to note:
the number of groups is not known in advance (it depends on the stored data)
since this is computed data, I would like to return the whole thing in one single request, instead of having to do the aggregates for each of the groups in separate requests
The URL for the request would be something like: /api/weekly_purchases/2018
How can I offer these kind of resources in a REST API?
Return several resources in REST call
How would you do it as a web page?
Somewhere on your web site would be a link, with text "this week's summary" (or whatever the appropriate concept is in the language of your domain). If a user clicked that link, the browser would do a GET on one URL, which would go to the server, aggregate all of the data together, and return the result.
So do that?
REST doesn't care about the spelling of the URI (the browser doesn't try to interpret the URL, except in very shallow generic ways), so /api/weekly_purchases/2018 is fine.
The trick is recognizing that a report summarizing purchases in the current fiscal year, broken out by week, is a resource. It may have data in it that duplicates the information in other resources, even data in many other resources, but it is still a resource itself.
As already mentioned in my initial comment or by VoiceOfUnreason the same techniques that apply to the browser-based Web apply to any interaction model used by applications that follow the REST architecture principles. As also mentioned by VoiceOfUnreason a client would initially request some state returned from the entry-point, i.e. https://api.acme.com that will return a collection of links a client can use to progress its task. In order for the client to determine which URL to invoke the response should give the URI a meaningful name (link-relation name). IANA maintains a list of already specified link-relation names you should use if possible or define your own one in further standards. According to Fielding the specification of media-types and link relations is one of the most important things to do if developing a RESTful architecture.
For simplicity I use a simplified HAL-JSON syntax throughout the example.
{
...
"_links": {
"self": {
"href": "https://api.acme.com"
},
...
"archives": {
"href": "https://api.acme.com/weekly_purchases"
},
...
}
}
According to the HTML 5 spec archives
indicates that the referenced document describes a collection of records, documents, or other materials of historical interest
The link relation name therefore describes the intent of the URI which client can use if interested in retrieve a collection of historical entries. The client does not really have to know the exact URI as he will learn it by simply following the link relation's target href element. This allows the server to change its internal URI structure anytime it has to without actually breaking clients.
On following the archives target URI a client will not really know yet how the actual data has to be retrieved as the URI and the link relation name are to generic. But the server will guide the client through its task. A response on the invocation of the abovementioned target URI might return the following content:
{
"year": [
"2018": {
"_links": {
"chapter": {
"href": "https://api.acme.com/weekly_purchases/2018"
}
}
},
"2017": {
"_links": {
"chapter": {
"href": "https://api.acme.com/weekly_purchases/2017"
}
}
},
...
"2014": {
"_links": {
"chapter": {
"href": "https://api.acme.com/weekly_purchases/2014"
}
}
}
],
"_links": {
"self": {
"href": "https://api.acme.com/weekly_purchases"
},
"first": {
"href": "https://api.acme.com/weekly_purchases"
},
"next": {
"href": "https://api.acme.com/weekly_purchases?p=1"
},
"last": {
"href": "https://api.acme.com/weekly_purchases?p=3"
},
"current": {
"href": "https://api.acme.com/weekly_purchases"
}
}
}
This response basically only teaches the client that there are multiple years available to choose from and the client has to decide which year s/he is interested in an invoke that URI to proceed its task. The next, last and first link relation indicate that there are multiple pages available as only the 5 years per page are returned. The current link relation name will always point to the most recent entry in the collection, which is the initial page (or first page) of the collection-resource. Note further how multiple different link-relation names may point to the same URI. Sometimes it isn't really clear which link relation names to use as their semantics partly overlap. This is just an example on what can be done with link relation names.
A client can now further drill down to the purchases done in 2018 by following the chapter link for 2018. A response on invoking that URI may now look like this:
{
"purchase": [
"W1": {
"sum": 1263.59,
"currency": "Euro",
"_links": {
"about": {
"href": "https://api.acme.com/weekly_purchases/2018/1"
}
}
},
"W2": {
"sum": 569.32,
"currency": "Euro",
"_links": {
"about": {
"href": "https://api.acme.com/weekly_purchases/2018/2"
}
}
},
...
"W48": {
"sum": 72.98,
"currency": "Euro",
"_links": {
"about": {
"href": "https://api.acme.com/weekly_purchases/2018/48"
}
}
},
"current": {
"sum": 72.98,
"currency": "Euro",
"_links": {
"about": {
"href": "https://api.acme.com/weekly_purchases/2018/48"
}
}
}
],
"_links": {
"index": {
"href": "https://api.acme.com/weekly_purchases"
},
"self": {
"href": "https://api.acme.com/weekly_purchases/2018"
},
"current": {
"href": "https://api.acme.com/weekly_purchases/2018"
},
"prev": {
"href": "https://api.acme.com/weekly_purchases/2017"
},
"prev-archive": {
"href": "https://api.acme.com/weekly_purchases/2017"
},
"first": {
"href": "https://api.acme.com/weekly_purchases/2000"
}
}
}
You could either add content here to the weekly summary or hide it down the road by following the about link only if clients are really interested in such details.
Note further: As weekly_purchases is just a string without meaning to the client it does not really know what it means. You could therefore also rename it to purchase-archive or something like that and introduce a further choice to the client and let the client determine whether it wants a weekly, monthly or total summary of that year.
REST is about providing choices to a client and teach it what the actual choices are intended for. One of the aims the RESTful architecture tries to solve is the strict coupling between clients and servers which prevent the latter one from evolving freely and the former ones to break if the latter one changes unexpectedly. This decoupling only works if certain standards are used to increase the likelihood for interoperability. Usually out-of-band information (pre-existing knowledge about the API and how to interact with it) is leading to a coupling. Even Fielding stated that some prior knowledge is needed though but not encoded directly into the application but on reusing certain standards like well-defined and stable media-types and link-relation names.
Related
I have a very simple example using Spring Data Rest and JPA which exposes a person resources. When launching the application it works as expected and I can POST and GET instances of the resource.
When sending a GET against /person I get the following response:
"_embedded": {
"person": [
{
"firstName": "FN",
"lastName": "LN",
"_links": {
"self": {
"href": "http://localhost:9090/person/1"
},
"person": {
"href": "http://localhost:9090/person/1"
},
"address": {
"href": "http://localhost:9090/person/1/address"
}
}
}
]
},
"_links": {
"self": {
"href": "http://localhost:9090/person{?page,size,sort}",
"templated": true
},
"profile": {
"href": "http://localhost:9090/profile/person"
},
"search": {
"href": "http://localhost:9090/person/search"
}
},
"page": {
"size": 20,
"totalElements": 1,
"totalPages": 1,
"number": 0
}
}
As you can see the person resource has a firstName attribute with a value of FN.
My question is should the following GET query work out of the box?
/person?firstName=FN
or is this something that needs to be implemented with a custom search method?
Needless to say it isn't working for me but I'm seeing conflicting information as to if it is supported out of the box.
Thanks in advance,
My question is should the following GET query work out of the box?
No. You will get an error like
{
"cause": {
"cause": null,
"message": "For input string: \"findByName\""
},
"message": "Failed to convert from type [java.lang.String] to type [java.lang.Long] for value 'findByName'; nested exception is java.lang.NumberFormatException: For input string: \"findByName\""
}
Spring Data Rest has the following URL structure.
/{pluralEntityName}/{primaryKey} e.g when the entity name is Person and the primary key is Long it will be /persons/1
We see this error message because Spring Date Rest tries to convert the second argument after the entity name to the primary key of that entity. In my Person entity, the primary key is Long, so It tries to convert findByName to Long and fails.
or is this something that needs to be implemented with a custom search method?
Yes, if you want to do a search on the repository you need to write a method in the repository following Spring Data JPA's method name conventions then Spring Data JPA will automatically convert this method to an SQL query and Spring Data Rest will automatically expose this method as an endpoint.
e.g If you write a method like:
List findByNameContains(String name);
This method will be exposed with Spring Data Rest and you will be able to access it from the following endpoint:
http://localhost:8080/persons/search/findByNameContains?name=Mahsum
Btw, you can see all available search method by visiting http://localhost:8080/persons/search
I have a REST URI for a list of resources, something like:
http://foo.com/group/users
Each of these users has a sequence number and I want to expose a way to renumber those values for all the users in the collection and make this change available to everyone who accesses the list. Since this is an action on the collection as a whole, I'm not sure how to accomplish this.
I can envision a URL like http://foo.com/group/users?sequence=normalize but neither a PUT nor a POST really makes sense for the whole list, unless I submit the whole collection with the new numbers as the message data.
How can I make an update to an entire collection like this in a RESTful way without having to resend all the updated resources in the collection?
After the raffian's comment on my initial response, I reworked my answer to be more RESTful...
Use the method PATCH
This method is typically designed to update partially the state of a resource. In the case of a list resource, we could send a list with only the elements to update and the identifiers of elements in the list. The following request would be:
PATCH /group/users
[
{ "id": "userId1", "sequence": "newSequenceNumber1" },
{ "id": "userId2", "sequence": "newSequenceNumber2" },
(...)
]
Use the method POST on the list resource
This method is commonly used to add an element in the list managed by the resource. So if you want to leverage it for this action, we need to pass within the request an hint regarding the action to execute. We have the choice to add this either in a dedicated header or within the payload.
With the header approach, you will have something like that:
POST /group/users
X-Action: renumbering
[
{ "id": "userId1", "sequence": "newSequenceNumber1" },
{ "id": "userId2", "sequence": "newSequenceNumber2" },
(...)
]
With the payload approach, you will have something like that:
POST /group/users
{
"action": "renumbering",
"list": {
[
{ "id": "userId1", "sequence": "newSequenceNumber1" },
{ "id": "userId2", "sequence": "newSequenceNumber2" },
(...)
]
}
}
Hope it helps you,
Thierry
Semantically speaking, the HTTP PATCH method is the right way to go. This is also described in the currently chosen answer.
PATCH /group/users
[
{ "id": "userId1", "sequence": "newSequenceNumber1" },
{ "id": "userId2", "sequence": "newSequenceNumber2" },
...
]
However, the second method described in the chosen answer is not restful, because you invented new verbs inside a POST request. This is SOAP, not REST.
You can use both PATCH and POST on the URIs. I'd use PATCH if I were you. It's the best solution for bulk updates.
I’m working on a Spring Data Rest project and would like a way to effectively ‘split’ my API off two base roots / endpoints. For example, consider the root below:
{
"_links": {
"orders": {
"href": "http://localhost:8080/orders{?page,size,sort,projection}",
"templated": true
},
"customers": {
"href": "http://localhost:8080/customers{?page,size,sort,projection}",
"templated": true
},
"profiles": {
"href": "http://localhost:8080/profiles{?page,size,sort,projection}",
"templated": true
}
}
}
Now assume that I don’t really want my API consumers to mess with the profiles resource (they still technically can but its unsupported) but I still need it as part of my Rest API as my UI uses it. So what I really want is to base some of my APIs under “public/” and others under “internal/”
I.e.
{
"_links": {
"orders": {
"href": "http://localhost:8080/public/orders{?page,size,sort,projection}",
"templated": true
},
"customers": {
"href": "http://localhost:8080/public/customers{?page,size,sort,projection}",
"templated": true
},
"profiles": {
"href": "http://localhost:8080/internal/profiles{?page,size,sort,projection}",
"templated": true
}
}
}
The problem I have is that this doesn’t work as I believe Spring Data assumes the second level off my base (/) is the internal sub-resource. Is there any way I can achieve the above using Spring Data Rest?
Typically if you want to customize the response you will want to utilize the Spring HATEOAS project.
Say I want to model the relationship of IP Address and ports in a RESTful manner. An IP address contains many ports, while a port can only have one IP Address. Additionally an IP Address is unique in this domain by it's address, while a port is unique by it's port number within it's IP address.
What are the potential strategies to model this? I can think of three that all have advantages and disadvantages.
Separate Resources
Maintain completely separate resources of the two and represent the relationship through the URL structure and allow discovery through links.
GET .../ipaddress/{address}/
{
"ipaddress": "1.2.3.4",
"ipversion": "v4",
"links": [
{
"rel": "port"
"link": "/ipaddress/1.2.3.4/ports/1234"
},
{
"rel": "port"
"link": "/ipaddress/1.2.3.4/ports/1235"
},
{
"rel": "port"
"link": "/ipaddress/1.2.3.4/ports/1236"
}
]
}
GET .../ipaddress/{address}/port/{portnumber}
{
"port":1234,
"protocol":"unknown"
...other data here
}
Advantages
Logical representation of the resources that maps to the problem precisely.
Allows HTTP methods such as GET/POST/PUT/DELETE to be applied to not only an IP Address, but to a port within an IP Address.
Disadvantages
To retrieve all ports for an IP Address, I have to issue N + 1 requests. One request for the IP address to obtain links to all the ports and then N subsequent requests for each port resource.
Embedded Resources
Embed all port objects within the IP Address object.
GET .../ipaddress/{id}/
{
"ipaddress": "1.2.3.4",
"ipversion": "v4",
"ports": [
{
"port": 1234,
"protocol": "unknown",
otherdata....
},
{
"port": 1235,
"protocol": "unknown"
otherdata....
},
{
"port": 1234,
"protocol": "unknown"
otherdata....
}
]
}
Advantages
Can retrieve all port objects for an IP address with a single request.
Disadvantages
Can no longer issue a separate request to manipulate a port object. For example, how would you delete a single port from an IPAddress? I could issue a PUT request replacing the entire IP Address with the port removed or do some kind of delta but that seems clumsy.
Could potentially lead to a large set of data, say for example if the port object contained additional information about the application that opened it.
Hybrid Approach
Create separate resources, but within the link object in ipaddress, embed some data from that resource, in this case the port number.
GET .../ipaddress/{id}
{
"ipaddress": "1.2.3.4",
"ipversion": "v4",
"links": [
{
"port": 1234,
"rel": "port",
"link": "/ipaddress/1.2.3.4/ports/3"
},
{
"port": 1235,
"rel": "port",
"link": "/ipaddress/1.2.3.4/ports/3"
},
{
"port": 1236,
"rel": "port",
"link": "/ipaddress/1.2.3.4/ports/3"
}
] }
GET .../ipaddress/{id}/port/{portnumber}
Remains the same as the first example.
Advantages
Allows for some data to be obtained in a single request.
Allows me to issue GET/PUT/POST/DEL requires to a URI that represents a port.
Disadvantages
I have explicitly chosen what data to represent in the "link summary". If the client wants to know a list of all protocols for all ports they still have to perform N+1 requests.
I believe things like JSON+HAL use the hybrid approach, in that they have embedded links to other resources that contain data. But does the client have a choice of what is displayed?
If I don't use any embedded data I end up with too many requests and if I embed everything I end up without the fine grained control of being able to manipulate sub-resources. Are there any other modelling alternatives?
I know this is quite a trivial example as port doesn't have a lot of data in it. But this could be applied to any two objects with a dependent relationship.
Thanks
Use HAL+JSON embedded resources with their own link and a field filter in the queryString.
GET /ipaddress/1.2.3.4?fields="ipaddress,inversion,ports:[port,protocol,otherdata]"
{
"ipaddress": "1.2.3.4",
"ipversion": "v4",
"_embedded": {
"ports": {
"items": [
{
"port": 1234,
"protocol": "unknown",
otherdata....
"_links": {
"self": {
"href": "/ipaddress/1.2.3.4/ports/1234"
}
}
},
{
"port": 1235,
"protocol": "unknown"
otherdata....
"_links": {
"self": {
"href": "/ipaddress/1.2.3.4/ports/1235"
}
}
},
...
],
"_links": {
"self": {
"href": "/ipaddress/1.2.3.4/ports"
}
}
}
},
"_links": {
"self": {
"href": "/ipaddress/1.2.3.4"
}
}
}
Btw. the current implementation depends only on your needs...
Yes the client has a choice, you can add metadata (like link relations, microdata, microformats etc...) to your media format, and you can ignore parts of the response. You can send back complex link with the results which can contain input fields and you can setup with them link parameters, like field filters in the queryString, etc... Yes, this will make implementation harder, you have to work with it a lot, but I guess you don't expect a complete code without any effort...
There are a lot of other media types, for example: JSON-LD, collection+json, siren, etc... you can check them if you want, maybe their approaches are better for you...
So I have a REST API in which I Have a resource in which other resources are linked to (related models, in programming point of view).
So how I am doing it right now is that whenever I request for a resource, related resources are referenced via URLs ('/related_data/related_data_id/').
However, I worry that, let's say there are 5 related resources to the one I'm retrieving, is that I would perform 5 GET requests. I am writing an iPhone client and I'm wondering if this is how to properly do it using REST (that I'm returning URLs). A sample JSON response is this:
{
"meta": {
"limit": 20,
"next": null,
"offset": 0,
"previous": null,
"total_count": 2
},
"objects": [
{
"away_team": "/api/team/3/",
"country": "/api/country/1/",
"event_date": "2011-08-16",
"event_time": "06:00:00",
"event_timezone": "GMT",
"home_team": "/api/team/4/",
"id": "1",
"level": "/api/level/4/",
"resource_uri": "/api/event/1/",
"tournament": "/api/tournament/1/"
},
{
"away_team": "/api/team/4/",
"country": "/api/country/1/",
"event_date": "2011-09-29",
"event_time": "12:00:00",
"event_timezone": "UTC",
"home_team": "/api/team/3/",
"id": "2",
"level": "/api/level/1/",
"resource_uri": "/api/event/2/",
"tournament": "/api/tournament/6/"
}
]
}
Is this a proper way of doing it in REST, considering that "each URI must map to a resource" and all those things?
I am using Django and django-tastypie
Thanks in advance!
Yes; that is proper if the related resources are updated independently. REST architectures depend on caching for performance, and therefore work best with resources which act as atomic entities (see more here). That way, you can update resource B and have its representation be fresh without having to update resource A. See this SO comment for more design details.