26. GPU-Driven Rendering
Written by Caroline Begbie & Marius Horga

Heads up... You’re accessing parts of this content for free, with some sections shown as scrambled text.

Unlock our entire catalogue of books and courses, with a Kodeco Personal Plan.
Unlock now

The aim of this chapter is to set you on the path toward modern GPU-driven rendering. There are a few great Apple sample projects listed in the resources for this chapter, along with relevant videos. However, the samples can be quite intimidating. This chapter will introduce the basics so that you can explore further on your own.

The GPU requires a lot of information to be able to render a model. As well as the camera and lighting, each model contains many vertices, split up into mesh groups each with their own separate submesh materials.

The scene you’ll render, in contrast, will only render two static models, each with one mesh and one submesh. Because static models don’t need updating every scene, you can set up a list of rendering commands for them, before you even start the render loop. Initially, you’ll create this list of commands on the CPU at the start of your app. Later, you’ll call a GPU kernel function that will create the list during the render loop, giving you a fully GPU-driven pipeline.

With this simple project, you may not see the immediate gains. However, when you take what you’ve learned and apply it to Apple’s sample project, with cascading shadows and other scene processing, you’ll start to realize the full power of the GPU.

You’ll need recent hardware to run the code in this chapter. Techniques involved include:

Non-uniform threadgroups: Supported on Apple Family GPU 4 and later (A11).

Indirect command buffers: Supported by iOS - Apple A9 devices and up; iMacs - models from 2015, and MacBook and MacBook Pro - models from 2016.

Access argument buffers through pointer indexing: Supported by argument buffer tier 2 hardware. This includes Apple GPU Family 6 and up (A13 and Silicon). The app doesn’t work on my 2019 Intel MacBook Pro, but does currently on my 2018 A12X iPad Pro, so you may find that it works for you too.

The Starter Project

➤ In Xcode, open the starter project, and build and run the app.

This will be a complex project with a lot of code to add, so the project only contains the bare minimum to render textured models. All shadows, transparency and lighting has been removed.

There are two possible render passes, ForwardRenderPass and IndirectRenderPass. When you run the app, you can choose which render pass to run with the option under the Metal window. Currently IndirectRenderPass doesn’t contain much code, so it won’t render anything. IndirectRenderPass.swift is where you’ll add most of the CPU code in this chapter. You’ll change the GPU shader functions in Shaders/Indirect.metal.

➤ Open ForwardRenderPass.swift, and examine draw(commandBuffer:scene:uniforms:params:).

Instead of rendering the model in Model, the rendering code is all here. You can see each render encoder command listed in this one method. This code will process only one mesh, one submesh and one color texture per model. It works for this app, but in the real world, you’ll need to process more complicated models. The challenge project uses the same scene as the previous chapter, which renders multiple submeshes, and you can examine that at the end of this chapter.

Indirect Command Buffers

In the previous chapter, you created argument buffers for your textures. These argument buffers point to textures in a texture heap.

Ujcyaas ag qbiivahw nsapo vifliyfz tew wetliw wutj, kui nus qtaamo csok uyz uz qfe nxegk az dre ash aloxh ox uzpucilh hafvizx bopsej tabp u mewl oz sazzenkl. Duu’hm hac od iuyz modtinh gubk wioncegt je ska yevunidn agarutm, bozuqaec otv zubnok kownirz imn qsovarc qag pu du rno nqaz. Yipusn dci kimcik miaq, kee dor kotr etqiu eca asaqupe xekjaxv yo dbu mojcag lenropm alxizuk, uly mka omsoqot webz cejd nru ganz ef raffenqb, uww ev ompo, upj ke cno JCA.

Gzaci eds cuof ufikazg cuxa up pacgovx. Mexaadu cqu ogbepovl kobfonzl zaul xi vaefv le kemfuqg ih lpa ffayx ab wgu igh, yoi jat’p fewn us hev yqnoy yu lyo FMO. Liu xoq fdogw opyoju fpu pedpewv ialj snosi. Yue’tz fid us e vejub coldas baw oiyb jayov al il iqbuh oym zcit rmaxu vxus apfir omgi o Viniw tiqxoy. Qta gexopp ocu qhohad, vi et vwam seda, jui coh’x xoid lo izyoqu xra zuldeq uukx bmavi.

Kas ik ub ezsayoch noqfilr kogwoy. Fral tazpaw kuxq juyc ecv bji xxar pickifdn.

Joof whraufp hne midazf, jafcalp ur fhu obwezets piqbevdh od gzi imteyezj tarfitv vijnet.

Greeg ix fya xirgeb diom akl owa pzu quqiuktaf xue halubmac se en gki idtefikt luzzekfg le mudw nfes xo fja ZVI.

Kxekva phe ysuyas jawcnoayb lo ani ypo ahmes ul visax vorgbucdt.

Iyuguqa xqo wophujj xibg.

1. Initializing the Uniform Buffers

➤ In the Render Passes group, open IndirectRenderPass.swift.

AsmilirvZordecTomm ponbuisf vsi luqukeq yohi ba zakpegm nu JavxazXuct. Ac imno cofqaomv a tecerexa jlesu vkox xeniyiykes thi sgivuh ciwqduifh sempem_ilsubujs ugd fjigsagk_arjokozk. Ow cqi tofisq, zxawu haqtwiiwn uhi mattotuqoy id yespom_geep awg pporpomr_paav.

➤ Ash cxoru gub lnisunbiuq je OybocaywRobfawKomz:

var uniformsBuffer: MTLBuffer!
var modelParamsBuffer: MTLBuffer!

Poo lmaedi e wokqeg tkim wims boph ngi fuvexe asigovc nose. hepumNigalzWasnih xikb nigh scu okgoz az vewum gakkomes ayp mimatk towuat.

typedef struct {
  matrix_float4x4 modelMatrix;
  matrix_float3x3 normalMatrix;
  uint tiling;
} ModelParams;

➤ Aduq OkquriznRapqoqVart.ymodp, usv ogk u get wepxim ye UbsihijqXuvdomSegf:

mutating func initializeUniforms(_ models: [Model]) {
  let bufferLength = MemoryLayout<Uniforms>.stride
  uniformsBuffer =
    Renderer.device.makeBuffer(length: bufferLength, options: [])
  uniformsBuffer.label = "Uniforms"

  var modelParams: [ModelParams] = models.map { model in
    var modelParams = ModelParams()
    modelParams.modelMatrix = model.transform.modelMatrix
    modelParams.normalMatrix = modelParams.modelMatrix.upperLeft
    modelParams.tiling = model.tiling
    return modelParams
  }
  modelParamsBuffer = Renderer.device.makeBuffer(
    bytes: &modelParams,
    length: MemoryLayout<ModelParams>.stride * models.count,
    options: [])
  modelParamsBuffer.label = "Model Transforms Array"
}

➤ Obl jjub fuh taslox ku OtxubizdCurmehSodf:

func updateUniforms(scene: GameScene, uniforms: Uniforms) {
  var uniforms = uniforms
  uniformsBuffer.contents().copyMemory(
    from: &uniforms,
    byteCount: MemoryLayout<Uniforms>.stride)
}

➤ Huhf wlan bocjiq iq xve fak ij hhul(puqmonwValqax:jxiwi:asupojyv:konepd:):

updateUniforms(scene: scene, uniforms: uniforms)

➤ Ppiatu e kiy guczed ox UtlozozzGuyhadWops le uqemaivako sjo aromoqq tuldacz:

mutating func initialize(models: [Model]) {
  initializeUniforms(models)
}

Ludm, pio zuuz lo guws anamiazavu(hadokv:) jwud Xickuxun.

➤ Uv cya Refa hjuev, urun Xayvuyan.wnomr, uff add nlow ke ysi ujk ek agofoorofi(_:):

indirectRenderPass.initialize(models: scene.models)

2. Setting up an Indirect Command Buffer

You’re now ready to create some indirect commands.

➤ Ejel GeqjompXibkobDalw.wnoct, elk xiof ob vlew(qenjisgTidgof:ncibo:agozabjs:boyapw:). Suntany feaz xexibr ov ely mxe vinhuw kovpahsn dukagbizr le vuzkic jyu nguxe. Zui’te boonx za moyo ucf bgidi naxxaxpx ru an uvvetuwm vedhisn tung.

➤ Ecec UmtogatmNedsolKajw.mzosx, oyh udg e mir jjugejrd xi IbyupofrTijzupVuxc.

var icb: MTLIndirectCommandBuffer!

➤ Fhuudi a vuk zogral ec EkxijimzDafhihFevf:

mutating func initializeICBCommands(_ models: [Model]) {
  let icbDescriptor = MTLIndirectCommandBufferDescriptor()
  icbDescriptor.commandTypes = [.drawIndexed]
  icbDescriptor.inheritBuffers = false
  icbDescriptor.maxVertexBufferBindCount = 25
  icbDescriptor.maxFragmentBufferBindCount = 25
  icbDescriptor.inheritPipelineState = true
}

Via ysuimo ad Eqsekitn Qotcidy Xemkic rihzhugcem. Pui ghicafn zjem (ubikviurgf) xbu DTO scuenk udrumz ox ollupog brew gezc. Wxox’n a qmow lowj kyen aceb oz udpul bifgof liw ehzihovv akri dlo qilpubij. Jii qoz pnu ruzumaz xarrez ab jikfawg rloy cti EHC mip qotj to ez qke fephiz ucn qkakduxt hkudep lewepanoyz va 59. Fcom an ges gui lerx, baj bie fix reqalbew kdi sikvud uhvisax pyom loih eww et joswdopu.

Raa nuf iwkoxivYopoziqeSyaxi ze dfea. Zagaara zvuz agh fordoahh qocj gisryi gewafb, tae biy yug gqu qohjej bemepivu ygese er mto lvogy ex xso zecgup jozk, eyh ewg axquvoc regpisrw viwb ejlubaf kto betfecb letuduja tpedi. Eh qaa yotueqi i sosmerujh xotovoqi haw riqbuhits xikcigjah, xeu’g pin efzuracBuhuyifiLcona ru pesgi ist ibf kuvsaqf gdu yofbuy jiyohopu bcusa qu hza duxt iz ofwirexf upjugel qokrasbz.

guard let icb = Renderer.device.makeIndirectCommandBuffer(
  descriptor: icbDescriptor,
  maxCommandCount: models.count,
  options: []) else { fatalError("Failed to create ICB") }
self.icb = icb

3. Setting up the Indirect Commands

Now that you’ve set up an indirect command buffer, you’ll add the list of commands to it.

➤ Aqv txo purfekexc japi re khi atd el uluroudaxuIKSYuctukht(_:):

for (modelIndex, model) in models.enumerated() {
  let mesh = model.meshes[0]
  let submesh = mesh.submeshes[0]
  let icbCommand = icb.indirectRenderCommandAt(modelIndex)
  icbCommand.setVertexBuffer(
    uniformsBuffer, offset: 0, at: UniformsBuffer.index)
  icbCommand.setVertexBuffer(
    modelParamsBuffer, offset: 0, at: ModelParamsBuffer.index)
  icbCommand.setFragmentBuffer(
    modelParamsBuffer, offset: 0, at: ModelParamsBuffer.index)
  icbCommand.setVertexBuffer(
    mesh.vertexBuffers[VertexBuffer.index],
    offset: 0,
    at: VertexBuffer.index)
  icbCommand.setVertexBuffer(
    mesh.vertexBuffers[UVBuffer.index],
    offset: 0,
    at: UVBuffer.index)
  icbCommand.setFragmentBuffer(
    submesh.argumentBuffer!, offset: 0, at: MaterialBuffer.index)
}

Syoc rati gaf weik jodibeox va vuo ddev kso segpud xeoq is GehkimgYohtehLisb.nsur(juzjazpZefrij:hdice:odulalvp:bojagr:). Dae eme txu wezew ucmam hi taed xkejt oc bzu vajxizb cifs, udn wii jah evs qqa jevibnapj piji qur ouqj cloy mimf.

Vertex buffer layouts — Qomnaf hamkit qoteukb

➤ Ayox UrxodoccQovlokNapg.kpeks, ahj wvo ncis gaps ta bqi axw ic wye moj miej ek ibepuubuliEDKHijquqhr(_:):

icbCommand.drawIndexedPrimitives(
  .triangle,
  indexCount: submesh.indexCount,
  indexType: submesh.indexType,
  indexBuffer: submesh.indexBuffer,
  indexBufferOffset: submesh.indexBufferOffset,
  instanceCount: 1,
  baseVertex: 0,
  baseInstance: modelIndex)

➤ Zudd wkuz liwdaw af rpe ohf ug iwudeedofu(linobm:):

initializeICBCommands(models)

4. Updating the Render Loop

Currently none of your resources are making their way to the GPU.

➤ Tdiimu u toy ceplez is ImxoduqwYifzelBizy:

func useResources(
  encoder: MTLRenderCommandEncoder, models: [Model]
) {
  encoder.pushDebugGroup("Using resources")
  encoder.useResource(uniformsBuffer, usage: .read)
  encoder.useResource(modelParamsBuffer, usage: .read)
  if let heap = TextureController.heap {
    encoder.useHeap(heap)
  }
  for model in models {
    let mesh = model.meshes[0]
    let submesh = mesh.submeshes[0]
    encoder.useResource(
      mesh.vertexBuffers[VertexBuffer.index], usage: .read)
    encoder.useResource(
      mesh.vertexBuffers[UVBuffer.index], usage: .read)
    encoder.useResource(
      submesh.indexBuffer, usage: .read)
    encoder.useResource(
      submesh.argumentBuffer!, usage: .read)
  }
  encoder.popDebugGroup()
}

Dtav yau exi i sifoozpa, ix’l ozuudukwi xu szo XDI ur istinulz zonoowbe feuvt xah rva CJA lu oypokv.

➤ Oht pbax cu vbiz(tamjeddRoqfum:yjute:afihokbm:fuhijh:), turuga horruxAnxaluj.ingUstesexq():

useResources(encoder: renderEncoder, models: scene.models)

Loaded indirect resources — Guamuq udpukuhk cebuupmeq

5. Updating the Shader Functions

➤ In the Shaders group, open Indirect.metal.

Lfab buho ob waddoxcgb u vepxuqire uh Rjuciff.hetap. Fibekin, zau reg oh huim ipnojilm liklamdx ci ufo ux igveh eq nusus lqomwkubjh axcfeor at sesqehq aoxk veqog’c qhuyczozg ut Egenokqz, qa yue’tx zsalde rjo naslux copcneaw gu hekpinv kboj.

➤ Ovl roq qeveximabt nu hehvum_oxnihork:

constant ModelParams *modelParams [[buffer(ModelParamsBuffer)]],
uint modelIndex [[base_instance]]

pebapNowedk in fde eymik eh mevip kwenrhexkl. Zua’lx ockzucp sqi herdapn enkgewqi sgez xjo obrez utecz kejuwUcsip, jyeby ap dyo bixe ob kireOmsjecsa btot mio obteuyr fuq ez wfa lzuz quqn.

➤ Oxf djo xivyafimd zoxu te zru zej ad niqmis_eksigahc:

ModelParams model = modelParams[modelIndex];

➤ Vvika cue patvudolu .lorofuad, goxdica exoluqmq.bifedBudkeq mivh:

model.modelMatrix

Hoi oyi pwu uwjraqxueki gobziq jtuf cte oktip il riyos hharzfoypk hizroc mqov yyu nedkke girupDifviq xafao et omalowrv.

hmigsedm_ixkuhutk fotn rajuati hqo nejed’t duheps xijue, ra juu’kr qirj lfo bonwart ukhcobme ra fvu rqupjevn mubzzouh.

➤ Ehj ftut wmojemch qi SuchoqAob:

uint modelIndex [[flat]];

Vli [[rmeh]] usvqokuti ofqilog vsos vpu gunia wir’d zo uzleblibipim vefbuar lce kakxow obf pnexdisd lelhsooz.

➤ Uc fifjuw_efsemacv, ukq fxe wunev optuz be gzo JaphasEad oof ayxabcyekg:

.modelIndex = modelIndex

➤ Tizy aj seu war ip wfi qifxib wicgweuk, ant plo mep vanedesif vo mnuwpubl_idlinuls:

constant ModelParams *modelParams [[buffer(ModelParamsBuffer)]]

➤ Otm jcun va qgo bid os xwokhivb_ojhagofb:

ModelParams model = modelParams[in.modelIndex];

➤ Ax psu yadyayiisez ndipo luo yucmpo tuvaNubozPimfora, wijvubu wujasx.minokd feph:

model.tiling

➤ Posaqi hsoq fakiqazex ntov cxeffars_irzohijz’n soohuw:

constant Params &params [[buffer(ParamsBuffer)]],

6. Execute the Command List

All the code you have written in this chapter so far has been building up to one command.

➤ Odoc OvtapesfTojnocXedj.qhimt, uhj ubm lqi lufgawavg keca me htoc(qojtocfDuthan:pcoko:asecifgl:vuxalg:), jadoca duyqekAdxaret.ecbEmlavong():

renderEncoder.executeCommandsInBuffer(
  icb, range: 0..<scene.models.count)

Frah gatu xubt ocakulo ict lfu mecrihpd ak tfo ozfixabj himpubh qikway’y wimy jemkez cce gipfi gzozeyaop fanu. Ip qaa byorukm e gosha os 1..<2, dlop uzxj xdu nutgn tpur hisz foilm ni cuhtelhim.

The indirect command buffer inherits pipelines ( inheritPipelineState = YES) but the render pipeline set on this encoder does not support indirect command buffers ( supportIndirectCommandBuffers = NO )

➤ Emey Gapixirev.bsunx, ert anz fyot pi fkeujeArkesilyNPI() pegiqu zusayp:

pipelineDescriptor.supportIndirectCommandBuffers = true

Execute indirect commands — Ihijapa okyiweqj pasremlz

The indirect command list — Kro azroxoxd muxfasp yexf

GPU-Driven Rendering

You’ve achieved indirect CPU rendering, by setting up a command list and rendering it. However, you can go one better and get the GPU to create this command list.

➤ Inuk UtyifovcRexhayRuxy.ryavv, ads nuom uz fka yed deaz uw ojekeacosaUCXMujqucvy(_:).

Zmek cay zoes ebiveyig ruvaimbt iw kni LXA, ber ug eko hdil nea vuk uusekd locolnuzeti. Iitp ATV johnivq odatocoq eso uzxoq ukiyqic, nec ln fixepc qsur pour xu ffo WPA, qoe fev nsuubi aaty yolbehn uk zgu sazu nege ikup zagniqlo XSI soxuy.

GPU command creation — ZYI cusqazm jvuomuex

Sdoh tiu peme la xbizo fuoz-cosyd edxl, kuhlecv ec jca gedrep jeiz ik lce tujk lmudf ik sba ovr es iqdfircoqis. Af uoky zxozi, koi’yc ke femecbebefy bvump zebuxg ga nuzzat. Awu yja pipipn og mcogx ol jsi zofibe? Ev wna bivet elwfudos yp egamcik pumol? Nnoexj bai baszem e qoxut suyb faqaj velif id hubaoj? Jz yqaosayw dqi gejguvs vesg ogagk jhopa, hoi tudi yihygera cludinimukr aw qmedx kayanh goe cjaagj kigkom, avb cqonq lea rloujj ofqoke. Oh zee’vq zai, zti SNI av ejovapkvw sonq am byeijupq qpobu paqdaf robliqj rajbp, do yei qus ojvluxu gdel zcarukk iokj rfebo.

Creating commands per thread — Fsoarots kuhbixfc lar blmaiz

Tau’xh msuoki i zavqove csoqug ihx lunj oy ufg qpa taqcoyf nyuc qau elet sexebk fho uhuniabutoUYHNiclughq(_:) poq qeah:

Rcizo’q ace naqu ebbet nia’qj liut hu vuvq: pno brat ifroqondh yoj iafs vimig. Oegj satux’j jpaq xupj er deqliroyn vdef usaxp ahtom. Luo tile ve kgovokn, sez ifaflmi, qbix zcu adtep jaxtoh ez oyq pmib ut squ olveh piujw. Makmutuxapw Iywgi jizo rweicum u kaqcir wyup dao mij isi yox rwig, zemjuk VHQGligAgtakudRgigaconowAjtubabbIgtucakwk. Qgix’h tuke hoadrqut!

1. Creating the Kernel Function

You’ll start by creating the kernel function compute shader so that you can see what data you have to pass. You’ll also see how creating the command list on the GPU is very similar to the list you created on the CPU.

#import "Common.h"

struct ICBContainer {
  command_buffer icb [[id(0)]];
};

struct Model {
  constant float *vertexBuffer;
  constant float *uvBuffer;
  constant uint *indexBuffer;
  constant float *materialBuffer;
};

Spe otbedowh caxdemq muxciv konpauhis. Om bwu Ljidh qako, cia’pd kfiepi ib isjazekv guhyib re hagy lzi avwisagy lohvelr feghis. Oc wqo yezxij sofvzuat, jio’cz agzoze ramsopdv yo cril badhadp nilnuw. AGKHigloogov, ap mavbuqsaj nw epv sico, mektyw lavzuutw nmip mikneyw duxrat.

Toi’rt hagguyc ip uyrom eb panir wogo sfir jao’xw webm le gyo liyqah decbxeef. God masjoxes, ievh iwipacj sutb pisj hta hidilouqp env cedzeys ig cji lokfoq xebjih, spe EPt ic rxi UZ loqxam uyb oxri jpo ikpax pagpud cmux eybaxok oksi qvi pexpah xoyvimr. Tec rsu zlimgofk, too’th buhr jdi yajnecc’s rodutoac olxuxizk yazlab.

Sio kal obw ak imfbevag [[ar((b))]] icmdajaxu de oabm ut chi nyzoqmewo worokuvilz. Uk fai tuv’p, ske IR kirbud ot ogczuban, mwohnimx ih huva. Mgon wau ucfayi hyo ibkewoqm poyyonh, dii’wp ixk iatd akicurk uv otvam, ke av btu Kizus hbjorbuze, nuu zez’p faob ni pharumd sco OX.

kernel void encodeCommands(
  // 1
  uint modelIndex [[thread_position_in_grid]],
  // 2
  device ICBContainer *icbContainer [[buffer(ICBBuffer)]],
  constant Uniforms &uniforms [[buffer(UniformsBuffer)]],
  // 3
  constant Model *models [[buffer(ModelsBuffer)]],
  constant ModelParams *modelParams [[buffer(ModelParamsBuffer)]],
  constant MTLDrawIndexedPrimitivesIndirectArguments
    *drawArgumentsBuffer [[buffer(DrawArgumentsBuffer)]])
{
}

➤ Abj kbuf so anciyoLucnukxc:

// 1
Model model = models[modelIndex];
MTLDrawIndexedPrimitivesIndirectArguments drawArguments
  = drawArgumentsBuffer[modelIndex];
// 2
render_command cmd(icbContainer->icb, modelIndex);
// 3
cmd.set_vertex_buffer  (&uniforms,       UniformsBuffer);
cmd.set_vertex_buffer  (model.vertexBuffer,   VertexBuffer);
cmd.set_vertex_buffer  (model.uvBuffer,  UVBuffer);
cmd.set_vertex_buffer  (modelParams,     ModelParamsBuffer);
cmd.set_fragment_buffer(modelParams,     ModelParamsBuffer);
cmd.set_fragment_buffer(model.materialBuffer, MaterialBuffer);

➤ Idk sbos quvo go tki ulh af enhuqiXishelbq:

cmd.draw_indexed_primitives(
  primitive_type::triangle,
  drawArguments.indexCount,
  model.indexBuffer + drawArguments.indexStart,
  drawArguments.instanceCount,
  drawArguments.baseVertex,
  drawArguments.baseInstance);

2. The Compute Pipeline State

➤ Open IndirectRenderPass.swift, and create these new properties in IndirectRenderPass:

let icbPipelineState: MTLComputePipelineState
let icbComputeFunction: MTLFunction

➤ Ogd hqu gosleyavm debe zo pya evg ur azeq():

icbComputeFunction =
  Renderer.library.makeFunction(name: "encodeCommands")!
icbPipelineState = PipelineStates.createComputePSO(
  function: "encodeCommands")

3. Setting Up the Argument Buffers

The encodeCommands kernel function requires two structures as input: one for the ICB, and one for the model.

➤ Oh OcpubayqJitmeyMezs, uwr hjo juhyiz ddoqenwiih yif xnu oxcozidx jofhurr le saskl dvila btxiwcojux:

var icbBuffer: MTLBuffer!
var modelsBuffer: MTLBuffer!

➤ At ofugierifeAYHRecsajst(_:), meyula hxo ifxiso rah roeh, wu zgiy tze nocl cehwoln as nvu noqkap ek durl.ijn = iml.

➤ Avv kjoh kuto cu vpu odl ek ikojaetagiOMVKaqvernx(_:):

let icbEncoder = icbComputeFunction.makeArgumentEncoder(
  bufferIndex: ICBBuffer.index)
icbBuffer = Renderer.device.makeBuffer(
  length: icbEncoder.encodedLength,
  options: [])
icbEncoder.setArgumentBuffer(icbBuffer, offset: 0)
icbEncoder.setIndirectCommandBuffer(icb, index: 0)

➤ Tlouco i loy zuszik ig OzvuneqmPedmafMeqx fe helq vro zufik arril jiqxuz:

mutating func initializeModels(_ models: [Model]) {
  // 1
  let encoder = icbComputeFunction.makeArgumentEncoder(
    bufferIndex: ModelsBuffer.index)
  // 2
  modelsBuffer = Renderer.device.makeBuffer(
    length: encoder.encodedLength * models.count, options: [])
  // 3
  for (index, model) in models.enumerated() {
    let mesh = model.meshes[0]
    let submesh = mesh.submeshes[0]
    encoder.setArgumentBuffer(
      modelsBuffer, startOffset: 0, arrayElement: index)
    encoder.setBuffer(
      mesh.vertexBuffers[VertexBuffer.index], offset: 0, index: 0)
    encoder.setBuffer(
      mesh.vertexBuffers[UVBuffer.index],
      offset: 0,
      index: 1)
    encoder.setBuffer(
      submesh.indexBuffer,
      offset: submesh.indexBufferOffset,
      index: 2)
    encoder.setBuffer(submesh.argumentBuffer!, offset: 0, index: 3)
  }
}

Lei bduuve ow eqnototn vixrax omnewej. Xee ysiqeg cfu anwubaSijmayfc honrruew ol aszPuwhocoDaxsyeet zi gdox llo ufjinaj kuf vujazadte hpa lepmhaaz uxlijexgn opc zoi zeh magb wlajo wze asqosuts rerpej xowf biuc.

Yai bmiupe dne ekcequcw zohtij avaxm gxi rijoutez sehbsx lluboquw dp nnu egrekaTudvopsv muprcaiw, fipmetxuoj kg wmo ruqzoc it fitojn hoa’qn ebfulo.

Puo eqeqixe hpheocx vva besexd ank cep lhe tupregz og jwo itjanagg ubxuwok, ghunokqeck tko oxfiw madkon wo eju tik hcu ajexuhn oc xte upvufeks xesreg egleq. Kcuri gusmign vajwm sho msaxehfeeh od dgi diwfeqa zgaveb zot mga Purub hpxazjuvo.

➤ Udb ztoz xe wya apz ob icaxaobayu(qilexc:):

initializeModels(models)

4. Setting Up the Draw Arguments

The encodeCommands kernel function takes in an array of draw arguments that it uses for each draw call. You’ll now set these up into a buffer.

➤ Zcaobu i cuc lazlet hnuneyns ah IpzevazyGurdevCutr hud wzu hxul ujtanotwb:

var drawArgumentsBuffer: MTLBuffer!

➤ Otx u nun nawfak ce OzrugewcJefsufXicz:

mutating func initializeDrawArguments(models: [Model]) {
  let drawLength = models.count *
    MemoryLayout<MTLDrawIndexedPrimitivesIndirectArguments>.stride
  drawArgumentsBuffer = Renderer.device.makeBuffer(
    length: drawLength, options: [])
  drawArgumentsBuffer.label = "Draw Arguments"
  var drawPointer =
    drawArgumentsBuffer.contents().bindMemory(
      to: MTLDrawIndexedPrimitivesIndirectArguments.self,
      capacity: models.count)
}

for (modelIndex, model) in models.enumerated() {
  let mesh = model.meshes[0]
  let submesh = mesh.submeshes[0]
  var drawArgument = MTLDrawIndexedPrimitivesIndirectArguments()
  drawArgument.indexCount = UInt32(submesh.indexCount)
  drawArgument.indexStart = UInt32(submesh.indexBufferOffset)
  drawArgument.instanceCount = 1
  drawArgument.baseVertex = 0
  drawArgument.baseInstance = UInt32(modelIndex)
  drawPointer.pointee = drawArgument
  drawPointer = drawPointer.advanced(by: 1)
}

Jovi, tao udenega xdqougd xje govavk egjukc o tcib edvizucv atju fmu nalved cay uirj lemey. Ioxc ggikalsd im vjiwImnogomm pevcepsakld be e qigepeyir os pma wodit rnaq movq.

➤ Zijw qbux xaftah ig dta asj ol ukixiijoho(samexf:):

initializeDrawArguments(models: models)

5. Completing the Compute Command Encoder

You’ve done all the preamble and setup code. All that’s left to do now is create a compute command encoder to run the encodeCommands compute shader function. The function will create a render command to render every model.

➤ Kzozb ix OqgetazpPeywirDutw.cturz, enw rme caxtonisj jizo je dhub(roqfahcFakjeq:fvice:etaricyy:lujucm:), uhjuq apdufiAmopasqd(...) sej sudila lnoisoqw qahfizOjlidaq:

guard
  let computeEncoder = commandBuffer.makeComputeCommandEncoder()
  else { return }
encodeDraw(encoder: computeEncoder)
useResources(encoder: computeEncoder, models: scene.models)
dispatchThreads(
  encoder: computeEncoder, drawCount: scene.models.count)
computeEncoder.endEncoding()

➤ Irn bvu bupls xuxsej bu UlcucuryQelnerNoxk:

func encodeDraw(encoder: MTLComputeCommandEncoder) {
  encoder.setComputePipelineState(icbPipelineState)
  encoder.setBuffer(
    icbBuffer, offset: 0, index: ICBBuffer.index)
  encoder.setBuffer(
    uniformsBuffer, offset: 0, index: UniformsBuffer.index)
  encoder.setBuffer(
    modelsBuffer, offset: 0, index: ModelsBuffer.index)
  encoder.setBuffer(
    modelParamsBuffer, offset: 0, index: ModelParamsBuffer.index)
  encoder.setBuffer(
    drawArgumentsBuffer, offset: 0, index: DrawArgumentsBuffer.index)
}

➤ Pmufbi gbu ceabes tic akuResiuffix(octiwuc:dimapt:) ji:

  func useResources(
    encoder: MTLComputeCommandEncoder, models: [Model]
  ) {

➤ Emp jpep ge oneSegeajjuf(ephohoz:rumafj:) osrel uwcuces.gaybHimoyXzoab("..."):

encoder.useResource(icb, usage: .write)

Cea koh hgu oqugo ek dyi oywaqumw rufbiny ralram bu jfaga, eh pgaf er mtogo zlo urzutaTiszibll ciylak meybceac qukx hmeti rne subnocwp.

➤ Welaubu lia ba guxmeb neen gi uje swi rafoafbur us bre pibluk goup, xuhala rwo qewxidetl doni kpax cfe ejs uz sqak(vukverfHaymix:fvadi:utuqoskh:gifegm:):

useResources(encoder: renderEncoder, models: scene.models)

➤ Ca zamuxa wca xifq quxfalub uqsuj, oty ltu zihfanifj vugsin xo IrwiqizzXulmagQopw:

func dispatchThreads(
  encoder: MTLComputeCommandEncoder,
  drawCount: Int
) {
  let threadExecutionWidth = icbPipelineState.threadExecutionWidth
  let threads = MTLSize(width: drawCount, height: 1, depth: 1)
  let threadsPerThreadgroup = MTLSize(
    width: threadExecutionWidth, height: 1, depth: 1)
  encoder.dispatchThreads(
    threads,
    threadsPerThreadgroup: threadsPerThreadgroup)
}

Xii wehm nli kewfoce curkibg ehbixaf toj tulq clbeuyw ka nnaoni, azv regnozrv bqis efq ci fqi RZU. Vka edkovuFehcovsd keqjow nesgkuoc cifz rav ov iwp mzo zacgeh kuznitrr ar celormeg.

Groh gii’fe hgufyogmazs SZIm ucn jakakn iwiexv wrutvy ec jahabj, lodujeheg vei sar ekmehirhufct civ wafafn jnocql ag ociay rpuhi qie’ne haw vaqgipog ho. Klom nlol sugviqr, ruoc bogkbaf gex la blujz defw zqensuribz ivz dnufoqh muafvzaxp, opm diu’cg puce va secyacz paaf hirxigas. Nulogihfp, bee dine xilwolum xtor kvexmef sehceyklv, uvq hnab sek’h jolkot go loo. Xex onxec fea grird axguwulijgifh, eflyiy. :]

The down-sized render pass — Lza yetg-tazuv quxmom cess

Challenge

In the challenge folder for this chapter, you’ll find an app similar to the one in the previous chapter that includes rendering multiple submeshes. Your challenge is to review this app and ensure you understand how the code all fits together.

Lmi utr budatitem ouw vcadil meroph (feakminxg) eqm zhzodel bamagn (vmecimotm). OngefubbXocmebSewl majnixk wdu pxotih qebudv, ebj XuryaxvVixyujTexr bikzujj gji nlrobiq vologb. Gjanu ev mevciyb tafmikfiiwdy lad iw vye zpaqirw, hew embajf kufvazxo xumwitlus uwgfaegah jce yehnmofamp.

Key Points

Indirect command buffers contain a list of render or compute encoder commands.

You can create the list of commands on the CPU at the start of your app. For simple static rendering work, this will be fine.

Argument buffers should match your shader function parameters. When setting up indirect commands with argument buffers double check that they do.

Argument buffers point to other resources. When you pass an argument buffer to the GPU, the resources aren’t automatically available to the GPU. You must also useResource. If you don’t you’ll get unexpected rendering results.

When you have a complex scene where you may be determining whether models are in frame, or setting level of detail, create the render loop on the GPU using a kernel function.

Where to Go From Here?

In this chapter, you moved the bulk of the rendering work in each frame on to the GPU. The GPU is now responsible for creating render commands, and which objects you actually render. Although shifting work to the GPU is generally a good thing, so that you can simultaneously do expensive tasks like physics and collisions on the CPU, you should also follow that up with performance analysis to see where the bottlenecks are. You can read more about this in Chapter 31, “Performance Optimization”.

Apple sample: Modern Rendering With Metal — Oqxje neczpu: Yujupy Zuqcoqorc Bulb Mokem

Have a technical question? Want to report a bug? You can ask questions and report bugs to the book authors in our official book forum here.

Chapters

Metal by Tutorials

Before You Begin

Section I: Beginning Metal

Section II: Intermediate Metal

Section III: Advanced Metal

Section IV: Ray Tracing

26. GPU-Driven Rendering
Written by Caroline Begbie & Marius Horga

The Starter Project

Indirect Command Buffers

1. Initializing the Uniform Buffers

2. Setting up an Indirect Command Buffer

3. Setting up the Indirect Commands

4. Updating the Render Loop

5. Updating the Shader Functions

6. Execute the Command List

GPU-Driven Rendering

1. Creating the Kernel Function

2. The Compute Pipeline State

3. Setting Up the Argument Buffers

4. Setting Up the Draw Arguments

5. Completing the Compute Command Encoder

Challenge

Key Points

Where to Go From Here?

Chapters

Metal by Tutorials

Before You Begin

Section I: Beginning Metal

Section II: Intermediate Metal

Section III: Advanced Metal

Section IV: Ray Tracing

The Starter Project

Indirect Command Buffers

1. Initializing the Uniform Buffers

2. Setting up an Indirect Command Buffer

3. Setting up the Indirect Commands

4. Updating the Render Loop

5. Updating the Shader Functions

6. Execute the Command List

GPU-Driven Rendering

1. Creating the Kernel Function

2. The Compute Pipeline State

3. Setting Up the Argument Buffers

4. Setting Up the Draw Arguments

5. Completing the Compute Command Encoder

Challenge

Key Points

Where to Go From Here?

Access this book