(I have appropriated
pozorvlak's beware the geek tag for a reason. I hope this is as accessible as his writing on technical subjects)
Why are you doing this complicated-sounding thing?
A little background: at an exciting startup project, we're developing two main components -- a mobile client (a smartphone app) and a server. Currently, we've got an iOS (iPhone / iPad) client out, in a very early preview stage (more of an alpha than a beta) and we're building the Android side to match.
One of the big things we use -- and mention lots when talking to people we want to impress -- is push notifications: that is, our server doesn't wait for the user to check for updates, but rather proactively lets them know when there's something for their attention.
That sounds cool. So how does it work?
At a high level, your system has to talk to Apple's (for iPhones / iPads) or Google's (for Android devices) system, and Apple/Google then forwards a message to the specific device you had in mind. So you need to:
- have the address of a particular device you want a message to go to,
- speak enough of the same protocol as Apple/Google to get your message across,
- have a way, on the device, of receiving a message and doing something with it
Thanks to a very experienced iPhone developer doing our iOS work, and an
excellent post from a(nother) startup, it wasn't too much of a trial to set things up with Apple. The application on the device, when it starts up, asks the device for its address, and then sends that over to the server. The server can then send Apple a message using a slightly esoteric raw socket binary protocol, using that address, and Apple will forward it on. Finally, the application can tell the device it wants to receive those types of messages, and the device will wake it up and give it the message when one comes in.
The way Android does things is a bit more involved. The application, when it starts up, has to ask the device to ask Google for an address, and be willing to receive a message back from Google with that address. Once it's received the address, it can send that over to the server. The server, meanwhile, has to ask Google to validate its account details, and get a token that allows it to talk to Google. Once it's done that and has a device's address, the server can use the token and the address to send a message to Google. Finally, the application can tell the device it wants to receive those types of messages, and specifically how it wants to receive them, and the device will wake it up and give it the message.
OK, that seems logical enough. What went wrong?
To be fair to Google, it's only a few more steps, there's no esoteric raw socket binary protocol you have to speak, and the messages they allow are more flexible and powerful than Apple's. But combine a slightly inexperienced Android developer with
sparse documentation and
misleading examples and you have a bit of a problem. In my case, you have me diving into the
operating system code to figure out why nothing works.
Ooh, interesting! Give me all the geeky details!
With an Android application, you have to tell the device about who you are and what you want to do in the application manifest, AndroidManifest.xml . Typically this is stuff like informing the device you need Internet access, or you want to be able to keep the phone from going to sleep (e.g. if you're a game). But if you want to receive push messages (or cloud-to-device messages, as Google calls them) you have to declare that in your manifest.
First of all, you have to tell the device that you have created a component that will service your messages, using a element, and inform the device of that component's name. Then you have to tell it how you expect it to service all received messages, and how to route yours correctly.
What's confusing is that, in order to make it all work, Google suggest using their prefabricated example code -- so you have to copy-and-paste their code into your project (which, as a developer, left a bad taste in my mouth from the start) and then intermingle references to their sample code and your own code. So your manifest should end up containing a section that looks like (of course, replace all references to the org.sample namespace with your own app's):
>!-- <<- THIS IS
*GOOGLE* CODE, COPIED IN FROM THE CHROME-TO-PHONE EXAMPLE -->
Get all of that right, and implement the right methods on your registration and receiver classes (again, following closely from the examples, and using the right senderId as I discuss below) and it should all work, and you can avoid the hours of ClassCastExceptions and silently failing registration Intents that I waded through.
On the server side of things, while the documentation is sparse, at least it's not misleading. Essentially, your server needs to have a Google account. This Google account needs to be
signed up to the service, and it also needs to be used as the senderId in your app code above.
What your server does in the meantime is request a ClientLogin authentication token from Google. This has to be saved somewhere, and while
an FAQ says ClientLogins are valid for up to two weeks, it also says that validity varies by service. So, in short, your server has to be prepared for Google to arbitrarily expire your token, and has to be able to fetch a new one.
Finally, to send messages to your newly enabled Android application, your server sends HTTPS POST requests to Google using the registration ID your application got back from Google. (How to get the registration ID from the application to the server is left as an exercise for the reader ;-)). And that process, happily, is
quite well documented.
All right then. What's next?
If anyone's read this far, thank you -- I needed to get that rant off my chest. Since Google do a huge amount to promote their developer-friendliness as a company, and their products do typically have very good documentation (the
Android dev site is full of reference material), I was expecting more from them.
But I (and the startup project) will be pushing ahead! Funding is on the horizon, and we'd like to ramp up development. We are looking for smartphone and server developers -- so if you or anyone you know is interested in fast-moving, exciting, and possibly even paid work building iOS / Android apps and / or Python servers, please
let me know!
Posted originally at
http://necaris.dreamwidth.org/122198.html (
comments). Please
comment there using
OpenID (your LJ will work as identity URL).