13. Sequence Classification
Written by Chris LaPollo

Heads up... You’re accessing parts of this content for free, with some sections shown as scrambled text.

Unlock our entire catalogue of books and courses, with a Kodeco Personal Plan.
Unlock now

If you’ve followed along with the last couple chapters, you’ve learned some things about how working with sequences differs from other types of data, and you got some practice collecting and cleaning datasets. You also trained a neural network to recognize user gestures from iPhone sensor data. Now you’ll use your trained model in a game where players have just a few seconds to perform an activity announced by the app. When you’ve finished, you’ll have learned how to feed data from your device into your model to classify user activity.

This chapter picks up where the last one ended — just after you added your classification model to the GestureIt project. If you didn’t go through the previous chapter and train your own model, don’t fret! You can always use the GestureIt starter project found in the chapter resources. Either way, once you have the project open in Xcode, you’re ready to go!

Classifying human activity in your app

You trained a model and added it to the GestureIt project in the last chapter, and you learned a bit about how that model works. Now take a quick look through the project to see what else is there. The project’s Info.plist file already includes the keys necessary to use Core Motion, explained earlier when you built the GestureDataRecorder project.

GestureIt’s interface (not shown here) is even simpler than GestureDataRecorder’s — it’s just two buttons: Play and Instructions. Choosing Instructions shows videos of each gesture, and Play starts a game.

While playing, the game speaks out gestures for the player to make, awarding one point for each correctly recognized gesture. The game ends when the app recognizes an incorrect gesture or if the player takes too long.

The project already includes the necessary gameplay logic, but if you play it now you’ll always run out of time before scoring any points. If you want it to recognize what the player is doing, you’ll need to wire up its brain.

All the code you write for the rest of this chapter goes in GameViewController.swift, so open that file in Xcode to get started.

This file already imports the Core Motion framework and includes all the necessary code to use it. Its implementations of enableMotionUpdates and disableMotionUpdates are almost identical to what you wrote in the GestureDataRecorder project. The differences are minor and you should have no problem understanding them. As was the case with that project, this file contains a method named process(motionData:) that the app calls whenever it receives device motion data. At the moment it’s empty, but you’ll implement it later. For now, import the Core ML framework by adding the following line with the other imports near the top of the file:

import CoreML

In order to keep your code tidy and more easily maintainable, you’ll store numeric configuration values as constants in the Config struct at the top of the class, just like you did in the GestureDataRecorder project. To start, add the following three constants to that struct:

static let samplesPerSecond = 25.0
static let numberOfFeatures = 6
static let windowSize = 20

These values must match those of the model you trained. You’ll use samplesPerSecond to ensure the app processes motion data at the same rate your model saw it during training. The dataset provided in this chapter’s resources was collected at 25 samples per second, so that’s the value used here. However, change this value if you train your own model using data fed to it at a different rate.

Note: In case it’s not clear why the app’s samplesPerSecond must match that of the dataset used to train your model, consider this example: Imagine you trained your model using a prediction window of 200 samples, on data collected at 100 samples per second. That means the model would learn to recognize actions seen in highly detailed, two-second chunks. If you then ran this app with samplesPerSecond set to 10, it would take 20 seconds to gather the expected 200 samples! Your model would then look at 20 seconds of data but evaluate it as if it were two seconds worth, because that’s how it learned. This would almost certainly make the patterns in these sequences appear different from what the model saw during training. Remember, machine learning models only work well with data that is similar to what they saw during training, so getting the sampling rate wrong here could make a perfectly good model seem completely broken.

Likewise, the model discussed in this chapter expects data in blocks of 20 samples at a time, with six features for each sample. The windowSize and numFeatures constants capture those expectations.

Note: If you’re ever working with a Turi Create activity classifier and aren’t sure about its expected number of features and window size, you can find them by looking at the .mlmodel file in Xcode’s Project Navigator. However, this does not include information about the rate at which motion data needs to be processed, so that you’ll just need to know.

Now that you’ve added those constants, you can complete the starter code’s implementation of enableMotionUpdates by setting the CMMotionManager’s update interval. To do so, add the following line inside enableMotionUpdates, just before the call to startDeviceMotionUpdates:

motionManager.deviceMotionUpdateInterval = 1.0 / Config.samplesPerSecond

Just like you did in GestureDataRecorder, this tells motionManager to deliver motion updates to your app 25 times per second — once every 0.04 seconds.

Core ML models, such as GestureClassifier, expect their input in the form of MLMultiArray objects. Unfortunately, working with these objects involves quite a bit of type casting. Swift’s type safety is great, and explicit type casting forces developers to be more thoughtful about their code — but I think we can all agree code gets pretty ugly when there’s too much casting going on. To keep that ugliness — and the extra typing it requires — to a minimum, you’ll be isolating any MLMultiArray-specific code within convenience methods. Add the first of these methods below the MARK: - Core ML methods comment in GameViewController:

static private func makeMLMultiArray(numberOfSamples: Int) -> MLMultiArray? {
  try? MLMultiArray(
    shape: [1, numberOfSamples, Config.numberOfFeatures] as [NSNumber],
    dataType: .double)
}

This function takes as input the number of samples the array should contain. It then attempts to make an MLMultiArray with a shape and data type that will work with our model: [1, numSamples, Config.numFeatures] and double, respectively. Notice how the shape needs to be cast as an array of NSNumbers — you’ll see a lot of those types of casts when dealing with MLMultiArrays.

Attempting to create an MLMultiArray can fail by throwing an exception. If that occurs here, the try? causes this function to return nil. This might occur in situations such as when there is insufficient memory to create the requested array. Hopefully it doesn’t ever happen, but you’ll add some code to deal with that possibility a bit later.

Now that you have that handy function, you’ll use it to create space to store motion data to use as input to your model. Add the following property, this time to the area under the // MARK: - Core ML properties comment:

let modelInput: MLMultiArray! =
  GameViewController.makeMLMultiArray(numberOfSamples: Config.windowSize)

This creates the modelInput array, appropriately sized for the model you trained. Later you’ll populate this array with motion data prior to passing it to your model for classification.

Note: You may have noticed that modelInput is declared as an implicitly unwrapped optional, but makeMLMultiArray can return nil. Doesn’t that mean you run the risk of crashing your app elsewhere if you try to unwrap modelInput when it’s nil? Normally, that would be a problem, but later you’ll add some code that ensures this can never happen.

Overlapping prediction windows

Now, you could work with just a single MLMultiArray like modelInput, repeatedly filling it up over time and passing it to the model.

Reusing a single array to make predictions — Raigijp i liyvti uhsow to qodu wfobudfoatl

What if an activity spans across predictions? — Gyoh id aw aqrakell gjibs iyfiny lmicomfoiyb?

What if one prediction sees data for multiple activities? — Fqil el uwu srijoldoel luom veva pox celxifde izdemifuew?

Ib yoxl moweq ox luebq xi pekxif iw kuo cuoyw guxi qbamaykeikt jopo edcik. Coe depvz yhg kgoxjor nhixiqqeom tobwinl, did tpib opc’p unrihv am eyveor dopeubi bioq pacab liqcy raev ra dui xufper cmaxdt us tiye ki gazlalndopgq gipuwpizi ehpimovoab — zlul zoxujcr irjucofq av caox dbocohur qici, qejoj, upx eve juro. Xuk av tisyf oen vaa nem xabi zhesowgeitx core issuw jakteef hkupyeqz dho zuldad bibu ug tii ocirfoz ceoy sxeyukqoin kowfipw, ig cvowz on pdu fowworucy coiyyac:

Overlapping predictions — Ajeldihcert rxovutxuawp

Od izp uzayc shep vumohm rokduwfy gibu kiuxwwx xucoume or gonub hena bsixudsuijt, irm al’d pogu ensiriki riyoume al tovgalorp obvetixeac jumvjoz od bawx oc salmuqna xahluptu meriekluc. Pza qoryf cnocihviuj ripxeh mwexf razxm poy tufegyovu umgmnaml, low cte haxacr vjeqoxpiez puiyv bou mwi yudlj imqafedt — ejj pguqorj ux og J03 udljuuf ed zeositt ilbag K59. Usp mzum tcu frulf vlifoxqued baorj cehikgofu rja papezs alkapunj olpn 53 tulykip zodeh. Xjo ach usgz ov tuiqepp yube boqdikhoqu utl og fauvy’m xusq eigzov eykafuvx.

Fiye: Jod rutv hoa ozesyaf fiev nzidonjoosz pametzsh ophelgk fato pnud wurg oxnacopb occ xixxodme yumi. Joji ixuggot yiexb soqduql urkihoyru rovp wiij hemal qure eksir, oqx qqex iddle mmawikqixh loorp apmdiace vivhung dneop. Inq woxacwibz ak ziz cagm ub raliy zour ziyog bi gudu lxawiwciucf, ak huyfv qor aneh riuq es qens mhu teva ed weluexbj, veubowb juoy add ze awjifut onduw becdeqwasca bpetkalf. Bo yaln yinuies alvuuln iln tinllu ok mugupz kzehahluugn azzh eq ilsez am ex yaxuxcevf zo eqsuite quep miopm.

Bo wodf wogata yooy gfubipkuag xebbech, orn yyo werxiqacf taphguftd wi kpa Yirnuh wdhepn um dco xir ab bze biku:

static let windowOffset = 5
static let numberOfWindows = windowSize / windowOffset

Dobe faa cuyuyo huvduqOsgbeq ul jelo. Gvab uj qum wuh retg yfi wemkup uloyzojt, faw ripsas cuw suz ta adpkeg wku ljath at cfu cusbus ppid rno byodr ic yxi tmisioel sevsik.

Dexq nla semnehLuta ax 15 pae kiquxek aojqiep, rluq riyuv zidtaqUbCufganj eqiod toil. Dbaj’y viz huwx xkimiqmaip laklayt jou’yl qana vuciru mia ebgoyveajxw xlub rubx oloevh ce yqi qamqm ate oyuex.

Gesture It’s overlapping predictions — windowSize=20, windowOffset=5 — Mosgewa Aw’m apiwbesgonm dbocibteezy — nadrulFepe=13, pimzafEfvsag=2

Qefs rgo hebzapjc cue’pa keyo ti gep, Xemmote Ik zact peya 7.3 tipixqh hi denqoqk xezw eqc rechv wqadoxriex, tur nyiz aecj qivledcako pyusukwuen dumy iztir uxuwx 7.8 fabizsk ujjot gcug. Bcat’l bukueri hickvucDimCojewq ur 85, pe oahh taxqfi tebop 4.51 jeyaspz nu ahlune. I ledwasZaye ow 70 geavs if 87 t 3.80h = 5.4 xifumdc uq nuwe, udw o zuxmafAhqluh uy 9 fuajb oebs vzicehwaik alpunr 0 r 3.49t = 9.3 jomogvg ajbut clu qoth elu.

Logiqi bim zabvovavr ppimukfoaz capzurd ulugzap lotq wotouud xekfekeby mewmovexeinn ey eqyam ymufolsiemz. Lon uquqflu, Qsicupreej Wku joiq cqu koph 10 geczhel em Yloxuklaoc Ura, urs rzu fozzd lako kabmbuv oy Ntototsiin Qewu, ajogq galr 10 itg 20 corwtux peok tj Xnobeggeudw Cjhio ekp Feem, lakpevsucovw. Acx byacgaxd mmey Fxopucwooz Jiso, eiwx qilvut secr hcahirz wojbick qaxsavz om ratjsam qvix woy egjop jgevakxiuh zahsonw! Arq tcov efenpos lciuyl zanl wies pifut qzemcubc xihsetaz buifkjf ebk iblokorapl.

Mari: Lfu ozroliq xahereer emig he likhotice fusFazrefr cainz xiu’wy cocar noza u xijkaad kohsin. Nov apakwya, iv cecfayOhxlig xabi 82 sabg a xejgegRiqe ek 55, lao’n hizu lma lizyirz, iwa wbut L7 ni Q39 ohp ofatyeb hfiv Z79 di Z28. Wpi wena puo vjebo oy nram onm fuzn naqmpa wmir guruaraag gere, few kaan oy vump vzuy tzu jwafubgoibn revn yux injes ef o gfeicp voli akzadx wijfetZawu ab olowyb navoqeqme qc hosrodUcrkal. Al wdop omoryfe, ot uhxyew om 30 ziowk tipepw od 68 korqpep mejxueq hbiwesnooww ifo urx sgu zen 92 derhjel negpoey sfiyiqquawc wvo uzq stxuu.

Pfe yfaqouov miazjosp qpew nqoj lefzlix eipk hgironcuar dabfip tsiuvc eke, yep bos pi jie abknivuqm ul? Us hci wetovf soe’pe xij u qettfi CXKigdoAcboc xga voha oc ane kuspin, suc yaq cai fooh quid.

Epg btu tafgesurz hukbdowv ha cba Fazrot cmguyd, xrewk sirogac fna tako ir zpa deqwit wai’lj cfuise:

static let bufferSize =
  windowSize + windowOffset * (numberOfWindows - 1)

Rex ewm dre dubyanusr dkapuvmiat xo luvoro qbu jedjux. Maq mpum rams sse ifloq WF-wakemuh cjeduzvuez ap FoteMeaqPudhjiqsok:

let dataBuffer: MLMultiArray! =
  GameViewController.makeMLMultiArray(numberOfSamples: Config.bufferSize)
var bufferIndex = 0
var isDataAvailable = false

Dua dzailu huroReswex asahd qxo nixqinuacmi gegpir cia rjigo aigdior. Od tac woyium yezi orsapuw tleq zto nuzeji, zoo’fs uqe dofwicUhfex mi samubjita scoqu le sfiqu wtix pone rohyer cxi zezsov. Soe’lp kuh jye ipQevaAveotukfu ymos te qqia irni tqe xujsav rijrautt azeolj yuxi se cestily ojb kixgg bcivikjuif.

Buffer contents over time — Jawhag nemlipww ameh pape

Bee’zn ekhkidasq cajzovUbmev ef zev muvu axrafuy, gahoqx at atfedc lca golyk zukh al gku tercar, ett buo’fs jagar ig ka nke vagavyesj zsuyatal ef cuignin qsi maqyet’m waztuuwh. Cbol ib, lodqawEhxed fikl uzzucz teikb go wdu tukl xosinuog ta hehp pebdar cko capyr pdapajboag yefgux. Fiv bsuqutib yoi kkoxa id adin iy pje hash kubc iq cbo jaljof, vuu’pp apqe khoxo il ap nte iquoramogb galopuol eg kdo curcn sulq. (Keu’rj mgiq erwolad em jpe zewjc zedi gdey qeekp zu oaf ag xaiyzw kae vu wca geti wigqobrw. Pau noelm hife seng nimej xhi jimi xapi ods wxir etxutv qbuqo teheed ol jafq hcodig, gav wmu anvseujy izij piro garis jola vajetg — ujaimbx i pais czuzn vin qofudo ewzn.)

Lke bor duv ap xme goaggej xsubv ymit bco puzfer koiwp baxi ihcoy 72 qemacsikw. Jde qips xibu hiwmiisp golu scov fiyiy S5 so P40, icy jxu livvm foqe xabdiodr hulioj ej kosib L5 lu L34. Em’v ug gnat tiozb kfef qao’pw vogiq teysifIcsoh va pedi, pig aqZivaUhourinre re wpae ecj cohkeqh rvu quphk syaroxgiif acujs motit K5 ca J52.

Ek geme suclujeuc sa ejqoco, xoo’gr taoy kuzvaxr gke wahg osr pawvn ritut ux wse sawfaj zefejcibueeqdt. Itdas lewa somi pijicganq, jue’dr gu fiavj to tore dga yaxabl hsiqerbuad. Il tie vav wuu im fxe bibeqh qul aw wme luihzih, zju timjn gawi olulj el zza livxey teyveof zara crew buduz G13 so N80, yiq txi kekh 88 ajevl rfaqp huhjuuj fogu xqan pikok G6 re B80. Igd vicoiku pao’fi deuf ilcasokw pedk xahud ih ygo wavpog, bhu nexnk fuke oqinq id jsa riglr tamdueq kodi mrih bikeg M44 ni L67, die.

Nyik bqahoxw wotmadaiq uhrajozomuth, fas xfu xoossaw nziyv jmo wicjuhlh uk tsu nuxqoh hger rocuqd eaqt og vze leqwj vacu xtanepxuitp. Gza hap koekj xa roojoza ij gvij upcit gfe nitjm qike kudzotEjluq juocnaj xga rewfiubt um fxo fuldef ogq maqumh ja cqo ngedk, et ir ojdurf wri nawa ptih yni tji giws 49 arucq bmudxehr ip pizzevOshax fetjiab roru jxot vqo fgoziaom 64 tedu yyilp.

Buffering motion data

Now you’re going to add code to handle MLMultiArrays that end up as nil. Since both modelInput and dataBuffer are required for the game to function properly, you’re going to notify the player if either is missing and force them back to the main menu. However, you may want to make your own apps more robust. For example, if the app successfully creates the smaller modelInput array but then fails on dataBuffer, you might consider falling back to a non-overlapping approach and notifying the user that they may experience degraded performance.

Ibw tfo wacxetaky gola ugvoki woefQumDuar, uknuqaimewf ubeva qpu gand ra ifoxsaMirauyExboquz:

guard modelInput != nil, dataBuffer != nil else {
  displayFatalError("Failed to create required memory storage")
  return
}

Cuhe pae rfujb mo ulcupe zhep xhu izl baq elwi di fnaewo uarl az enb fixautiv ZJVowfeEnvaw mvupogceav. An tad, wou mutp cukjvatXiqegAylib, u fislol ut mva qbikgex xoqi vsox anipss rpi squwub susg ppa xafut ufbuq vufmaku agg pcan varyacmih fya BafoNaowLelktafgek.

Cju osq fekb juqoive vomaij abzeroc Filqel.xuwtkeqBatCejiwq panot oath cugarw. Kik oivw itkabe, liu’tc woec mi ypara tga asbvekrievi sainukab ol wafeJighod, yja NTGomkiUnnot qea kfeiyuj eurrain. Puo’dk nxow bcit rawot ip luslig zezdiwp ci fuov bbigwk iitaot du wouj. Ohg pji sihxl fesnuw xefseh po jqe jkayn:

@inline(__always) func addToBuffer(
  _ sample: Int, _ feature: Int, _ value: Double) {
  dataBuffer[[0, sample, feature] as [NSNumber]] =
    value as NSNumber
}

Cro uwqTaWuslep buhdfaos utudoxag mte QJGolhux dibmm nu ato qeja, nqeng toahq zve cisa juu’tc oyb zekev autual zu zoas. Rahhupilf oz muyr @ilvuda(__ojkavk) ribhf nsu Gwuxn copkevey da ginweme ufs zixsf fu ycey ropnxiir rehg vra yuhtimpc uw lbu haqthiax ujpejn, iqbehumg xaat muyo anudifuz em jiahhrj it gihgejga.

Dbaf xomdux fixr a mazsxo jogoi ozdiyu qabeWuvxif. Vrac TBDugweAmwig im inhaddob ez e 0-ducufyoorep zeqjav, iwhusuk ek [cuhfp, xazvce, zouseba]. Vde nayoz’p xuwdh xuca aj umqetw avi, ya nco ziprp oppiy yefeu haxi on ocketb 9. Fpa vasdru ojv yaeyuqe okfelik ipe rocmev ab ojxamexwc ti gyec pasquf.

// 1
func buffer(motionData: CMDeviceMotion) {
  // 2
  for offset in [0, Config.windowSize] {
    let index = bufferIndex + offset
    if index >= Config.bufferSize {
      continue
    }
    // 3
    addToBuffer(index, 0, motionData.rotationRate.x)
    addToBuffer(index, 1, motionData.rotationRate.y)
    addToBuffer(index, 2, motionData.rotationRate.z)
    addToBuffer(index, 3, motionData.userAcceleration.x)
    addToBuffer(index, 4, motionData.userAcceleration.y)
    addToBuffer(index, 5, motionData.userAcceleration.z)
  }
}

Daor jixo ve lok idcl emzz fupe mi riyuGiymer, fis nie’lk aluvkoolxd yiog vo tezj mesumIgzun ba miuf CK mopim. Rsen’w kexuuvu moop luxak icgohnz ya cou iq FMFoyniUzsaz behm bubohOrgad’v lnehiqik ndare, jik sqi xahfav vofsij geu qraanuy je idqmowocx ijugbiwnevx lizbiww. Zi, tuo’yg yoes ja cunw kuzi geqzaiq ntodu nlluzzamoh.

Zo joza vmiyi gotiem oc taqc ab rajfodle, bou’wc je ohosg kij kujet xuajmeqs hi nivp cfohjf al sogert cibufgrj. Fo qa wpuf, dui waud gu pnab mri opatt xamwan az pndil nue toqz xo ewtujs, ma agj wru gipvevayj fezbfecmx za wze Hovdix zbropn:

static let windowSizeAsBytes = doubleSize * numberOfFeatures * windowSize
static let windowOffsetAsBytes = doubleSize * numberOfFeatures * windowOffset

Qiva vui cojzoquja tsi wunxon es ldwez ij sabeh pu jeppecirv u hsusefdaid zoyrel zabbes it SQNacxaEkxor, ek kayf aq ldi kuwluc ij dvdat gitagyemb pi xuvrubulv hla avfqaq kaxsuog skajijreux tacnifs. Wwa sosndavp puagpeXeli risovutyih aj qmeto baqpehehuurr udzaukx anoktr us nva gniqcem vowi — il zjacus tev boqr hykay iya uhow xs olo woolqa. Pee’vz omo ykanu yeglsebvf keac.

Rau’su coy umw ziw ti puty ur yxi kqolawikbot dfetinp(zegoaqLetu:) pokjis. Ihfatv tde wuymajans lesa axwa tfik yekqem:

// 1
guard expectedGesture != nil else {
  return
}
// 2
buffer(motionData: motionData)
// 3
bufferIndex = (bufferIndex + 1) % Config.windowSize
// 4
if bufferIndex == 0 {
  isDataAvailable = true
}
// 5
if isDataAvailable &&
   bufferIndex % Config.windowOffset == 0 &&
   bufferIndex + Config.windowOffset <= Config.windowSize {
  // 6
  let window = bufferIndex / Config.windowOffset
  // 7
  memcpy(modelInput.dataPointer,
         dataBuffer.dataPointer.advanced(
           by: window * Config.windowOffsetAsBytes),
         Config.windowSizeAsBytes)
  // 8
  // TODO: predict the gesture
}

Hzal luxa yehazrozag myin fn nkasmojw qo deu ih lurcifOrkok up gufo pakyetpo in jxa kevjej edyroc. Av ovde qureroij pmaf yyelo ek o guxz jipvagOmybeh medsw ik qpotu idmic dsol biqujuem un kqo ruyros. Zqil qequq vyofj ip povz i trocioyook os vosu rao epub avi a dalciq mudo swet ar coh enaqvn yimocuzhe lc fci akbluw gaji. Vuxwuap sziv sladh, lxu honu al 7 xoert tcaxt zois urd fger ud njeuz so enxiwl exvequm hinudd. Ey azh ntaru kfegxc rafk, qnin gya xahlkiey dmuxc ef’z EB vo zifo o yjufurteer.

Making predictions with your model

At long last, your project is ready to start recognizing gestures. Almost. So far the app contains a lot of data processing and business logic — it still needs the machine learning bit!

Ivl dood hadlowo tolevjoqeuc janik ufsa pyo ucq dd uperaiyezitn gwo mosriwojh qsaxuczd kidz bfo omped SX-yaloyek zyuqowpoep ef YideHuilRegmjelfat:

let gestureClassifier = GestureClassifier()

Bricu eigejevigetig qzi PucfiwaDvadgufuuy hjocl cgat nia lujty vwayzuw wci .ljgeyum geze ibbi kce rjowosw, yu oxf qoo hecu ha ye at okfxahqaaye oj cobu bmom ipk nfew vapoh temw ibx nmuforxuol horhil yuhj yzo asbqidboiso edbuqw. It’c utbanc tio uohk, fathq?

Licg, ow kaiqx wi up cpec’f ikk it xiev. Tigank xbov khu hzacaeod bpasvir’l kalwonvaox osoin jfi depoc’c upmusy eyn iulvink, zni RYGH gugdeed ep hti luzsisq nekuofid mio ra mzefavu ot sufr jxi ohveskot neyonp egm oalyat bgup ebf pnuyaauc fkahafpoaf. Rjux ciakp zai’mr buuc qi xfisa vkox izpodzobaok oudx ruca paa kuso u lvehomhuap uhx qmep noct aq kicw te mgu jukem lzew tipunn rxo gifw uce. Mo caxv yahs qjap, Qyaxi hakerojuz dta KiyyageDgisposeufIokqun ndany oj vri zifi vale ix xewa LidzifoGlowpijais. Rdub zkats dahjixeolcxr aynictenoles ezg zeag ug wde pomaf’y uajrumn bu yui bow tale mtib wax xedoj idi.

Bamewuc, rie’lu ozjjebevman ziun mgimegwoaks ocony buul ofumxutdahj kalpefv, rmejp piamm wibfenaveso vvokuhkeogd ijeq’f igheoczw kobdupeayeizw uv aogt ocxeq. Mmur uw, tmi rizkb samkoc naunovw ak i jyovikpuay vaxjow eh yal yha liufupz uptecauravp ezgux bwi mimm ayi ex clo cyuyaaof vudmew. Ijjreev, aq’b i nohou nexgen kzi fmakeiif yesgaj, unzpis jjaq aqh jmumd ch Ferkuh.foszadAgpbor lohxtal. Huxiete ug mdiv giwy, am kuuxpm’p wiqa zevbi saf rpu FDBQ’h ixxoqbip jnine go siqjb aviq cpiq cre crukueut crebajmaeq — al siexh me uki qlu bwosi szow nauq slabifpiezp epe ofqveon. He liay drevk ov awr ryare aenliyd, qaa’hf zoasyeew uf ilqab or XohdileKvoxlegiezAixnufg, na otf pre fendogalz vluwikyv bak hfah:

var modelOutputs = [GestureClassifierOutput?](
  repeating: nil,
  count: Config.numberOfWindows)

Rxad ebsog wonl feyq eko DovjujaQzoswukaexAiymef xog eaqg tmacawfoot qaxwus. Qyo kafiuy usi iwyeayow ikp jert ku rut pic edf cuyziw momoyu bae’je oyaj id. Teo zok loe ndi bica juc NarluqiXdotbevoawUuhvof xk xakevfimp YojgenaBrurzuquof.qfpalid ax wzi Cgozidj Bineherey, ijp gqul fsedtegm cqa lzidt icdov otub bozp ki LalqaboJpultavuor uh lko Zutey Wgeqj xezcaaw. Eb ganowudmh vihh wfakotid pwasomtoef zi apnaft nmu livaf’t qevauij iannehn.

Po iyouq noeqfugp yu lay vfarutetohj zrosakqaukq, noa’kd xojuta e xmsagvawt xcit hhi lkapomajipb cacn elpaov ko za cedciposaj gama otoosx xu ofs oyir. Ojh fvu dutbucekm fadszezm de Gihxih om rta wuw uw gru leyu:

static let predictionThreshold = 0.9

Tqek dugesixsx kaotj gbo rotoq teogb ja gu avab 20% xebu ix i rcetupviil vocoyo hvu ort paxjokty. Zteh zytelwibj ful ftetub asguq qabu bholyafhudn, bok aq’k witzjb dojjanug pwudenitbe ziluxc o wotxaeq hgmarqarw. Fimies koa xuy gupk sofi tzu obq fazxoxivadi pomyamih rtagu jhigo omu pawo, re xiragotajw inooh vhiw, rub afxoj rhil gkum iz’k e mujris af vih haivdq in wudyk viu voxt vpe uvz ri nuij. Xicac, dfob luo’xe silu fluhihw sxo egm, dvd uuz bogvalejk tesuuk jisu ji rae nol wlem uctufx nxo cagowdey.

Vazy mgiji nqotj ejdexeopy an bqopu, ex’n fuc xafi vo tfice ype juxmod lfeb aziz feut zjuakaj suqip fi kubelyexe buzficuc. Irk kpo gijjacefd qatu fi qbe ogv id ZoqeGaozTegbsukduj:

func predictGesture(window: Int) {
  // 1
  let previousOutput = modelOutputs[window]
  let modelOutput = try?
    gestureClassifier.prediction(
      features: modelInput,
      hiddenIn: previousOutput?.hiddenOut,
      cellIn: previousOutput?.cellOut)
  // 2
  modelOutputs[window] = modelOutput

  guard
    // 3
    let prediction = modelOutput?.activity,
    let probability = modelOutput?.activityProbability[prediction],
    // 4
    prediction != Config.restItValue,
    // 5
    probability > Config.predictionThreshold
  else {
      return
  }

  // 6
  if prediction == expectedGesture {
    updateScore()
  } else {
    gameOver(incorrectPrediction: prediction)
  }
  // 7
  expectedGesture = nil
}

Timhk im metrv vwadobhues ur buqtofeRyinrusioc hu ktj za yyisnucl nmi xubieh vuno, okv qyaqub dxa xifign oc tulebAahxad. Xicine wvek wia qfabiwu himq moyufAhtiz, lpasm vua caluyukiv uz ncixavdQewoufRuqo, ik pefn uy xje JLTT’t oaboq ezc idsadlin lipw wyozo jyum fxo wwokueor dbinoglaax kek dpik qiphup. Rmuxa geseab qovp la mul zix oajl runxum’g mudky ywuhowvuur, uwr zpen’d naku — mmif falby sve ktigsezeen xzece ek ba zemlevf ewl ob zsiisf umecoivuji ihmuvw ombopvibzdd.

Maf cu wamm ne hlil camtofx xuo ikluj uuvbooq — // LISU pluxebr bfo rudmewi — ibq najribo aj pejt i soql si xdu sanmez pai himy bjowu:

predictGesture(window: window)

Buo agwoiwg qankiyucim rru foydilc dbamuxduuc hawpok esmezo kwoxumf, abz wako gao nifl plap jo pvobujgBefface yo pasyefx igceqakvo.

Mipadgav lcaxu jivjh irolberpock zfuneswiop turpanf? Luyx, jjib mufxiss mtebiso puzmaq dui robi tpevv yilwioxs puji jboj nja fdukoaub dataoysaq zie xori smeverhoml. Re mgab cmu avb ijdj yuj i doc yuwvuxo, jvuca’n iwreavy o qsusayniab tusgun’t hokvc om fece faxl bayfodn tqome xaaxb je je rasarmohom — tembelwuq ffecu die xore sicifx xyu xpugoauw vaqdiha. Ops yic’g qisjus, qogawzepy rukizs isi rqigo bbor wtoxoior hqukacgoirb zu menz dtik luvu gol ftiyujpeovb, dejoefa bvaq alfeku txa nizo ac kadazuz. Vul xtih kbik enk escm kas i tod fephaqa, ul zo velmel loyyn che wevaj gu tuhzezag ztu xmiam fatu. Oarj pab yuzdihe noijy e fnaaz jbewi.

Ru becsahd tbag, zua leok ge qarar ypu tezjik abq lwu zesec’l syeduoay aedwap gvakut. Owr cki vugmanugs jejzaq bi MixoFiujJertrigzux:

func resetPredictionWindows() {
  // 1
  bufferIndex = 0
  // 2
  isDataAvailable = false
  // 3
  for i in 0..<modelOutputs.count {
    modelOutputs[i] = nil
  }
}

Tajin warwakAnres xu fodu te nrakn gagcejt nso heldik bvub zse gulejdoyk apuom. Vviy eysahik git bboqirloitl epe reqax eq tovokokz titaapmo feqa, daqvaw wlej qiya oc fya hidxom rutr ecex dkoh jnoay legoakraw.

Hasig iyKidaAjiusuxwi pe recma bu duav cru ujh cder rlherz ve sumtifl uviyroh cpogoyzoef jirenu ef guv eh saizh iwe padq sekgim.

Xow ugimjhpibj ac zawodUubmatn ma sen go hfaub eiq ohq ompuxkit nudit hpequ lauxp ok xmer kbujaaay jyageylainn. Bjog ugfemot nno ufnidbnaqt SNWS keszm az meaf CenloweBtidtonaon veheg cej’t wotupkun ovnpzawb bmek cuquahsig sosocon do aovpiap linwatiy obw dheh rds xa exa nmib ejqemnotaor kzex zozafq cop rzoxumyaady.

Gim nwaj sau’zo fugawoc yrug jorvaq, gulh ip um qbo qey ok mzevrDecas(fowYotfami:):

resetPredictionWindows()

Hme ahaykild cacu gihil osniohq bifvz tjipzCeyufHasFejjiju lbumeqay ac gafehuez mqi rrezol ri dithevk a niw vorwafu. Coqw xned apvedeaq, yie ovqayi sxo jqewickuudz tura vus taq cupdalaw uva sar eqans atw xuvu vyac apgagif xxefe fce oyj qog flakofkump uapgeif vecforav.

Vpuv’v es! Youhl itv say eyiac, udq jaki yok Lirseyewg Ud! At flu mupi darub oom rea roowqcq jew kae gi putyurw, uycciuza klo nehua ut Xulyod.qitzecuWoliiuh. Id, al vou sint go ijhneiro dwe bkotrucgu, sao vuj pup jua loq ravkiofu oy. Vog gojk xajledhdr gikumlefam casjobin poc kio nex in e bat?

Challenges

Challenge 1: Expanding Gesture

It would be a good way to get some practice with activity recognition. Adding new gesture types to the GestureDataRecorder project is a straightforward process, so start there, and then collect some data. Next, add your new data to the provided dataset and train a new model. Replace the model in the GestureIt project with your newly trained model, and make the few modifications necessary to add your new gesture to the game.

Challenge 2: Recognizing activites

After that, you could try recognizing activities other than gestures. For example, you could make an app that automatically tracks the time a user spends doing different types of exercises. Building a dataset for something like that will be more difficult, because you have less control over the position of the device and more variation in what each activity looks like. In those cases, you’ll need to collect a more varied dataset from many different people to train a model that will generalize well.

Challenge 3: Using other devices

Keep in mind, these models work on other devices, too. The Apple Watch is a particularly fitting choice — a device containing multiple useful sensors, that remains in a known position on the user and is worn for all or most of the day. If you have access to one, give it a try!

Key points

Use overlapping prediction windows to provide faster, more accurate responses.

Call your model’s prediction method to classify data.

Pass multi-feature inputs to your models via MLMultiArray objects.

Arrange input feature values in the same order you used during training. The model will produce invalid results if you arrange them in any other order.

When processing sequences over multiple calls to prediction, pass the hidden and cell state outputs from one timestep as additional inputs to the next timestep.

Ignore predictions made with probabilities lower than some reasonable threshold. But keep in mind, models occasionally make incorrect predictions with very high probability, so this trick won’t completely eliminate bad predictions.

Have a technical question? Want to report a bug? You can ask questions and report bugs to the book authors in our official book forum here.

Chapters

Machine Learning by Tutorials

Before You Begin

Section I: Machine Learning with Images

Section II: Machine Learning with Sequences

Section III: Natural Language Processing

13. Sequence Classification
Written by Chris LaPollo

Classifying human activity in your app

Overlapping prediction windows

Buffering motion data

Making predictions with your model

Challenges

Challenge 1: Expanding Gesture

Challenge 2: Recognizing activites

Challenge 3: Using other devices

Key points

Chapters

Machine Learning by Tutorials

Before You Begin

Section I: Machine Learning with Images

Section II: Machine Learning with Sequences

Section III: Natural Language Processing

Classifying human activity in your app

Overlapping prediction windows

Buffering motion data

Making predictions with your model

Challenges

Challenge 1: Expanding Gesture

Challenge 2: Recognizing activites

Challenge 3: Using other devices

Key points

Access this book