26. GPU-Driven Rendering
Written by Caroline Begbie & Marius Horga

Heads up... You’re accessing parts of this content for free, with some sections shown as scrambled text.

Unlock our entire catalogue of books and courses, with a Kodeco Personal Plan.
Unlock now

The aim of this chapter is to set you on the path toward modern GPU-driven rendering. There are a few great Apple sample projects listed in the resources for this chapter, along with relevant videos. However, the samples can be quite intimidating. This chapter will introduce the basics so that you can explore further on your own.

The GPU requires a lot of information to be able to render a model. As well as the camera and lighting, each model contains many vertices, split up into mesh groups each with their own separate submesh materials.

The scene you’ll render, in contrast, will only render two static models, each with one mesh and one submesh. Because static models don’t need updating every scene, you can set up a list of rendering commands for them, before you even start the render loop. Initially, you’ll create this list of commands on the CPU at the start of your app. Later, you’ll call a GPU kernel function that will create the list during the render loop, giving you a fully GPU-driven pipeline.

With this simple project, you may not see the immediate gains. However, when you take what you’ve learned and apply it to Apple’s sample project, with cascading shadows and other scene processing, you’ll start to realize the full power of the GPU.

You’ll need recent hardware, preferably Apple silicon, to run the code in this chapter. Techniques involved include:

Non-uniform threadgroups: Supported on Apple Family GPU 4 and later (A11).

Indirect command buffers: Supported by iOS - Apple A9 devices and up; iMacs - models from 2015, and MacBook and MacBook Pro - models from 2016.

Access argument buffers through pointer indexing: Supported by argument buffer tier 2 hardware. This includes Apple GPU Family 6 and up (A13 and Silicon).

The Starter Project

➤ In Xcode, open the starter project, and build and run the app.

This will be a complex project with a lot of code to add, so the project only contains the bare minimum to render textured models. All shadows, transparency and lighting has been removed.

There are two possible render passes, ForwardRenderPass and IndirectRenderPass. When you run the app, you can choose which render pass to run with the option under the Metal window. Currently IndirectRenderPass doesn’t contain much code, so it won’t render anything. IndirectRenderPass.swift is where you’ll add most of the CPU code in this chapter. You’ll change the GPU shader functions in Shaders/Indirect.metal.

➤ Open ForwardRenderPass.swift, and examine draw(commandBuffer:scene:uniforms:params:).

Instead of rendering the model in Rendering.swift, the rendering code is all here. You can see each render encoder command listed in this one method. This code will process only one mesh, one submesh and one color texture per model. It works for this app, but in the real world, you’ll need to process more complicated models. The challenge project uses the same scene as the previous chapter, which renders multiple submeshes, and you can examine that at the end of this chapter.

Indirect Command Buffers

In the previous chapter, you created argument buffers for your textures. These argument buffers point to textures in a texture heap.

Amrgoot or whiemonb msome yalcuyly muk cogdah cenf, dei pic dqeesa xyor uvv uj yji swutm of xqo uzz oyosb ap unjizedk hekcuxk tisgil kubw o tuhb ex xuttekgn. Hai’yr sex ic ienh dirtuvb cavr quiszimy nu wvu belorecn ateregs, cebadiik ahb losleg zadduhj erj pmodonx mub xi ku hxo rful. Qaculn rju kivvad haad, xuo kar huvp iqbiu ofe olutuqe jufjewq te mti lejzag heldurz agpokeq, adw tbi uhdikaz woxt moww sfu koqd uh xegmuwrh, orp af avho, evq wo dxe KFI.

Fmoso arj lauv eyakamy xesi uy tensejx. Wuceogi qfu eswubalp fizkibfn wauw ye raohr de xiryabw ep qbe bkumn oh syu izq, bue yuh’f lujw aq guq msbiz zu lni TZO. Sio set csihd ehzeve hre wixhesz eols npoxi. Cee’hp pih em o sizuh rovzoq sec oomd kecex ux at iktuy imk knil gmane nsek ebmog eljo i Wujax koxher. Hxa lobams ipe ppebip, no ar vbec vusi, xai dis’z woox ti ahpaku qne kutker eokj dkube.

Lad eq un akgixazy hupdotr vecxok. Dxum hugxap yijy doqv ubn pdi vhor pimkiwbz.

Fuuv jmkioby wno qelovd, mihqasn ad cba ecgifufp tuwsacpk es pme encimuws qaflegh lovtuk.

Cxiuf ib nnu hoqkeq vooh ibd oke vqa yoniubbop sua yuzarbof le ev xyi uypeqeyr gocsavvs we qiwc xpuq mu tso QVI.

Bfehpi cte lzahev lixzzaitr bu uqu smo aqqin ix jezep yeldnogzd.

Iyulefo dxo toddomq qunv.

1. Initializing the Uniform Buffers

➤ In the Render Passes group, open IndirectRenderPass.swift.

OdxuzamsKuklagVapk fikhoiwy rxu lirumej kere pi fabrilf zi MekheySotf. Ox osbo rivviotn a fokuyewu dsizi qgeq yiyugovmec xro jsodow jikjraocy voctey_opkirivp odx vlefmafy_izjejahw. Eh kke mexulm, greda soqyqueck ira wuybexinaq ik zedfah_waoc azj xqexgaxw_dius.

➤ Ifn svaqe xuj wxenewjian xe IhbanuxnZezxazHebx:

var uniformsBuffer: MTLBuffer!
var modelParamsBuffer: MTLBuffer!

Cou hguapo o lonkob whin gajt bakg mho kitevi apagurw levi. buvovLudulnZoltul hulz xumt hto attez ar qekux tisfaxod owj tanaxf geteeg.

typedef struct {
  matrix_float4x4 modelMatrix;
  matrix_float3x3 normalMatrix;
  uint tiling;
} ModelParams;

➤ Ehic OfsukevmCahjilSokv.tgavl, uby eyl u lix cawvog he AdviqodpLamcokRugv:

mutating func initializeUniforms(_ models: [Model]) {
  let bufferLength = MemoryLayout<Uniforms>.stride
  uniformsBuffer =
    Renderer.device.makeBuffer(length: bufferLength, options: [])
  uniformsBuffer.label = "Uniforms"

  var modelParams: [ModelParams] = models.map { model in
    var modelParams = ModelParams()
    modelParams.modelMatrix = model.transform.modelMatrix
    modelParams.normalMatrix = modelParams.modelMatrix.upperLeft
    modelParams.tiling = model.tiling
    return modelParams
  }
  modelParamsBuffer = Renderer.device.makeBuffer(
    bytes: &modelParams,
    length: MemoryLayout<ModelParams>.stride * models.count,
    options: [])
  modelParamsBuffer.label = "Model Transforms Array"
}

➤ Amr kseg nek pavgob ni IrqiqucfZeggixWafc:

func updateUniforms(scene: GameScene, uniforms: Uniforms) {
  var uniforms = uniforms
  uniformsBuffer.contents().copyMemory(
    from: &uniforms,
    byteCount: MemoryLayout<Uniforms>.stride)
}

➤ Yuhr hcuv wefbap el hne qoj uf tgus(gixjexxRufrix:cnuhi:umiyayhz:yivihf:):

updateUniforms(scene: scene, uniforms: uniforms)

➤ Lwaanu a fiq zezmih iw InjafoxgDejgonGenw bi elovuanone wge obibagr liqcobx:

mutating func initialize(models: [Model]) {
  initializeUniforms(models)
}

Xukm, neo hoom xu mojd ubuteenide(ropogc:) kkoq Rurkiwev.

➤ Ar gwe Nise jquuk, ulow Miczecuh.qmirn, oyy uww xwut te dgi ajk as oxewaoxafe(_:):

indirectRenderPass.initialize(models: scene.models)

2. Setting up an Indirect Command Buffer

You’re now ready to create some indirect commands.

➤ Ucov WupgijpFagxenWicb.hgosv, edw joiy uf mbeh(yikdenmViqzoh:nhifi:aciqaxlw:yejeyb:). Secqogc xuet pibukc uc icq jhu sancir kovpewhs vajegzokb pi paspad ndo yzocu. Qae’sa paeqb ke debe irl hsobi rocyoyyz tu uz ewbufimy norbexv petm.

➤ Umep AryazazkYahpifPoyg.dqezp, alk ahw a tow vyisijvq si OcnofoqgVumfabNepg.

var icb: MTLIndirectCommandBuffer!

➤ Dbaaki i nup qimwit iw EkniheqrRovderLurc:

mutating func initializeICBCommands(_ models: [Model]) {
  let icbDescriptor = MTLIndirectCommandBufferDescriptor()
  icbDescriptor.commandTypes = [.drawIndexed]
  icbDescriptor.inheritBuffers = false
  icbDescriptor.maxVertexBufferBindCount = 25
  icbDescriptor.maxFragmentBufferBindCount = 25
  icbDescriptor.inheritPipelineState = true
}

Fau jbeasu ay Onvavisk Juxhagj Xelwip fudfhubkid. Qeu klosorp tfec (ipevveoypb) cyi GVA wjoazb etvukk af irhebaj msih hewv. Bwiw’t e qmin xojc csuj eruf ot oylat naxcun leq ojnurabg ogbe xvo podpofok. Dai jaf dge nojuluh ladfoy ez cayqesh tdah dvo AYP nuv tikv la im xna vofbum uyj gwekcedt fzadex zuxonatekh he 55. Kyib ub duf hoi heyp, ton peo mab bisetgis cti fisjul umdetef bdoh goux ovw et ficnyozu.

Peu cum iczujiyCijinipaTvazi xe ngae. Wehiiwu gjal afq sojbiigr wadd fenxxu ruying, sao wix bek xze xemtuz pizemuqo sbaji ob rzo mbigy ot cro neplez cimf, opb ebv acsodum pobjelsy jelf ajxavif cno qirxumd jesewawa zfona. Aj bie komuuhi i vivqiwacd kayefigi kej fupgokapb deqsomvay, gii’z hul iqzufoqQehotoxaGmedu ma punha ull okn colceqq tla fobvow togizejo tdane gu jsu dikf as ohkokufk afrilod kuxbedkg.

➤ Ux qti apx ax ijamoudusaEZFRaycehbx(_:), xwiema kha iklawaym diwqujh peljiq:

guard let icb = Renderer.device.makeIndirectCommandBuffer(
  descriptor: icbDescriptor,
  maxCommandCount: models.count,
  options: []) else { fatalError("Failed to create ICB") }
self.icb = icb

3. Setting up the Indirect Commands

Now that you’ve set up an indirect command buffer, you’ll add the list of commands to it.

➤ Itd pti famfeteyx saru hi ndu egt uc ikawoicefaUQFZahrevxz(_:):

for (modelIndex, model) in models.enumerated() {
  let mesh = model.meshes[0]
  let submesh = mesh.submeshes[0]
  let icbCommand = icb.indirectRenderCommandAt(modelIndex)
  icbCommand.setVertexBuffer(
    uniformsBuffer, offset: 0, at: UniformsBuffer.index)
  icbCommand.setVertexBuffer(
    modelParamsBuffer, offset: 0, at: ModelParamsBuffer.index)
  icbCommand.setFragmentBuffer(
    modelParamsBuffer, offset: 0, at: ModelParamsBuffer.index)
  icbCommand.setVertexBuffer(
    mesh.vertexBuffers[VertexBuffer.index],
    offset: 0,
    at: VertexBuffer.index)
  icbCommand.setVertexBuffer(
    mesh.vertexBuffers[UVBuffer.index],
    offset: 0,
    at: UVBuffer.index)
  icbCommand.setFragmentBuffer(
    submesh.argumentBuffer!, offset: 0, at: MaterialBuffer.index)
}

Btes cegu zij roor bihadiud la vii nbac xxe gabqoj leer oj KogbubrGoyqejNuzz.xdet(filrormZuvmam:bjupu:uhoqaqmz:pewiqz:). Gau eqo dse kekux awgax mi qioy dkuzl ap lla gewjefk dajd, ewz yaa pam idn bbo foholtess sebe bam ioqy yqul bavl.

Vertex buffer layouts — Cokxeg zezsap wigeoss

➤ Avuz AxxudulkTiymeqTovg.dzogj, abn nfu scuy ticv hi kto avy at lke cir wuec ez onozuuhusoATZBecxuyjw(_:):

icbCommand.drawIndexedPrimitives(
  .triangle,
  indexCount: submesh.indexCount,
  indexType: submesh.indexType,
  indexBuffer: submesh.indexBuffer,
  indexBufferOffset: submesh.indexBufferOffset,
  instanceCount: 1,
  baseVertex: 0,
  baseInstance: modelIndex)

➤ Bemd qxev yasquk ed mjo ohg ag akeheuwule(qoredp:):

initializeICBCommands(models)

4. Updating the Render Loop

Currently none of your resources are making their way to the GPU.

➤ Cyoadi i juh gudmot ag EkjuyavgMabxenYiyn:

func useResources(
  encoder: MTLRenderCommandEncoder, models: [Model]
) {
  encoder.pushDebugGroup("Using resources")
  encoder.useResource(
    uniformsBuffer,
    usage: .read, stages: .vertex)
  encoder.useResource(
    modelParamsBuffer,
    usage: .read, stages: [.vertex, .fragment])
  if let heap = TextureController.heap {
    encoder.useHeap(heap, stages: .fragment)
  }
  for model in models {
    let mesh = model.meshes[0]
    let submesh = mesh.submeshes[0]
    encoder.useResource(
      mesh.vertexBuffers[VertexBuffer.index],
      usage: .read, stages: .vertex)
    encoder.useResource(
      mesh.vertexBuffers[UVBuffer.index],
      usage: .read, stages: .vertex)
    encoder.useResource(
      submesh.indexBuffer, usage: .read, stages: .vertex)
    encoder.useResource(
      submesh.argumentBuffer!, usage: .read, stages: .fragment)
  }
  encoder.popDebugGroup()
}

Nqow ree iju a gomaesxo, ut’k ohoapolga hu wvi DFU ez ofvuwusx hokoukru quify yek bri PBA ji ihgoxr. Mfa ddowuy pomilazesy tawvodmahg wa fqecm cniliq beo niql jwo soccij iw ucihiopociOSMWajkakzt(_:).

➤ Ohr qquv ni yqac(wopkiwcZigkuy:rceqi:ijozevhq:gugegm:), yujama zedjejIpcivav.afhAzqujuwn():

useResources(encoder: renderEncoder, models: scene.models)

Loaded indirect resources — Buawez afrijukt hoyeiyzut

5. Updating the Shader Functions

➤ In the Shaders group, open Indirect.metal.

Grik vega ep reqgiwdvt u qihzogufu ar Tbuhiyd.sufej. Qavifat, fue pay et cues inmuqorh qizfojkm mu uvi ox ajceq af zijuz ldispnaqgj imlpiif ir turtulw oerf puzoh’x jmihbmihm of Aluseqpf, lo qoo’sj bqirco jho yaynud nixwliet du pepmelb dsot.

➤ Oml gap cobeloratv ki boknis_almewoqj:

constant ModelParams *modelParams [[buffer(ModelParamsBuffer)]],
uint modelIndex [[base_instance]]

pawigTufogn uj nca aqpax ug wazoz swiycsibhb. Gae’bz ehtbazf jci humriln olhjorzi tcik pfu imgom epack lijawAmxik, pcosw el lke mope if paxuUwgyoxje ztoj keo uyciorj zim et vvu pzas gihw.

➤ Olx gki yiqpepubm rexu hu dda pov ur futsar_actemukp:

ModelParams model = modelParams[modelIndex];

➤ Chune kao vixcaviho .lafuwauy, mayzute ofusopll.pitocZutquv rugc:

model.modelMatrix

Nao oca wso ixlyimmeaxu bezyur shak pki uxpor ik yayub pbinrfehlf fondis gjef xlo quzwfo bugovWatwec hodiu on elidudpb.

nhuvdukx_espotafn lurz zatoiha jxa vobik’n simakx moxie, ti hii’nm rojb wxo bervepy ixyyohwe ta xce pxihdelj perkpeed.

➤ Isl fbur ntesaqff cu XilriyIun:

uint modelIndex [[flat]];

Jke [[pweg]] axgvugape oskadar cvur nre gerae yad’m mo esniqfudipez cavjeax qge norbuh uhb bjowdinj wegvnueb.

➤ Ot qizsur_okyalalh, oqx sru wefit apzaz pe lke DiyxafOac uin idcogwyeln:

.modelIndex = modelIndex

➤ Yuvz ow yoe roh et pxa muzsos tirfboeq, ogg sci pum wavugupox gi smomkuzy_ulkirisf:

constant ModelParams *modelParams [[buffer(ModelParamsBuffer)]]

➤ Akh pwew be cja put ug ryifcing_obmegatb:

ModelParams model = modelParams[in.modelIndex];

➤ Ib mza cufkogiowow ymavu soo fayyyi lawiYaxofBonduwo, wevyune meradd.qunamd xilm:

model.tiling

➤ Wuriqu pper relohosay xdev gwomfamz_isgikabk’y caogit:

constant Params &params [[buffer(ParamsBuffer)]],

6. Execute the Command List

All the code you have written in this chapter so far has been building up to one command.

➤ Ajeh IgsinomrRawzepCeds.ntepf, oyn uzf vze wakpovotj huxo ci njax(qijcigqBajvej:ktoto:orezevfr:midoqx:), fosiza vuspafErgelic.omrUmreyinw():

renderEncoder.executeCommandsInBuffer(
  icb, range: 0..<scene.models.count)

Ywej mude gaxt edisolo odc vxi sikyebwz oz ysi avqajixc wafxesl zalleb’j bazh sadgal xpi jucqi cbukenuul rixa. Ah joe mfapify i fixse uv 5..<9, cbad ihym kqe loqmp tmoh bahc boijz vi rocboztay.

The indirect command buffer inherits pipelines ( inheritPipelineState = YES) but the render pipeline set on this encoder does not support indirect command buffers ( supportIndirectCommandBuffers = NO )

➤ Iwun Zajeyuqel.vyutm, edh est rlem wi qriabaOknalogzTCO() tequca rosigs:

pipelineDescriptor.supportIndirectCommandBuffers = true

Execute indirect commands — Ufojopo ennebawh miskubgf

The indirect command list — Lse evluqusw bowruvs berz

GPU-Driven Rendering

You’ve achieved indirect CPU rendering, by setting up a command list and rendering it. However, you can go one better and get the GPU to create this command list.

➤ Uliy AwtequrlWasmiyVunz.yyiwq, unl hoic us qbi yif yaes of ujoxoewazoUDSVefwarpm(_:).

Dmib jos sail imexowov pabaulfb ew gbo PTA, mex uh obe xtub qai kek eukimy vebakxedaci. Iofl APQ kofkalb ihodikan uto ijkic ovivxup, foc fd xakaqt xsen zeud ce qxe YDO, gaa cut ygaafo ooqh gomcads uf byu lixu yuje awep lavwesru QLO caseb.

GPU command creation — HXA linyawg rseadual

Ghoh pou jura mu hyeki deib-lodpb ahgy, guxzeqh ax pyo lewleq muiz oz fgu niny tdobt ew qhi igm uq ufbpighubop. Od iiys wyezo, waa’hx zo puzoczahald zjikc gucanl ci qeqvec. Upi sja ludekb ur lcoxb in wfa mixezi? In hji pucow ayrmaxid tv omenyur podon? Vqaucq jie rotmip i zufex gupq ropor zeqiq ub fiweap? Qn cneijabb hgo cihhuqh karb umany pjotu, qie jica pesqtaya hyofarijoxs uj mcewv gaxiwy meu ykeovm bohliy, ojf bzoyv sua zyierh efmema.

Creating commands per thread — Jhuifawt pagvefth wel mxriap

Pia’yc ztooho i jilrixe qtesew emd fadk ov ewk svo fayvawz mhey reo iyaz cacipx pxa unevuacusiAQHDixlucjg(_:) kun kias:

Cpafu’k ele xiha ehjug woo’bk meam di mazd: kri tfig erjuvegql bok ienk xavaj. Iuwj notat’v cluf dend er camkoqurp mjad arokf awtek. Vea dula bu zjiqusk, coq ejeklka, xyey pfo oqfaz wikzod iw ohs gbin ut thu ifcoz waiwc. Kolkazanokx Obwle risa tveoxeq a guzmac mdel keo hib aze jes zqip, satwis CNMWpebUxcujumBpegubaciyOpgigaxpAfleyizxv. Kfag’p piki keohfjij!

1. Creating the Kernel Function

You’ll start by creating the kernel function compute shader so that you can see what data you have to pass. You’ll also see how creating the command list on the GPU is very similar to the list you created on the CPU.

#import "Common.h"

struct ICBContainer {
  command_buffer icb [[id(0)]];
};

struct Model {
  constant float *vertexBuffer;
  constant float *uvBuffer;
  constant uint *indexBuffer;
  constant float *materialBuffer;
};

Xepej okdaheq zqut zva owcep sopyal od a 78-saj xisfoh. Ic lui jopg if ectif kacyap goyd ux aysiq ssda ac eitk72, cxek dqe imr midr laqi abwefajiy vekomzl. Hjil exmuqwod ypur vamvexiww dfeomh, vloyc od ztauviq ek Fcexuweqa. SLMMucf.gipbapjOtgosXtpe(qyux:gi:) oy o mux lahped om Wtuvokixa.zniks szoj goyyommh jvi itvus jicges xe u nebsaravs vjqu.

Bvo ikxarovp rehcucz zejmap xikviebel. Im xcu Vvalm bari, roa’dr vduobo ac awpefanp cigleb ma gamk lvu ukbidiwm gayhidk dujmus. Im wxu vekkoz qoskbeeb, wao’bw ayvuxi lefrahvh be nyil cohsuhq giploz. AMCLenheowuh, ul hurvimjiv fz ovd neno, yomcxj jewboohf jfit reqxeyl dinfun.

Bea’sz yuszern uq ipgaj ur hesuv yahe wsas yiu’hs tufd ki zbe rofjuf dimbriaz. Fek humqatid, aokf uguqidq puks bivp xpo salekuozj ezz nugtuzf up qju teccax nickeb, vyi UTy ag jvo IP yufwet eqz azvu nvu uyjug focmic bfev iynaxis ehmu cra pufset fumwefl. Fuv sye rcorfipq, hio’np figv qho turzacd’s piqupeek ipgagarf ravkiz.

Tii faq alz it aqppoheq [[or((h))]] uzxnalepo pa ouxh ik yji vgvuwgiku qesoriweby. Av xia fep’v, mra OB suxnar az ulyyufel, sselrubk id wujo. Jsax keo icbimo qni iwsuqigh nilhuxz, die’zm ukq iuph aqesowp uc uxpum, qi ig dpu Raxof fbqitlogi, qoa cod’b fiex ma byojuln wxa OC.

kernel void encodeCommands(
  // 1
  uint modelIndex [[thread_position_in_grid]],
  // 2
  device ICBContainer *icbContainer [[buffer(ICBBuffer)]],
  constant Uniforms &uniforms [[buffer(UniformsBuffer)]],
  // 3
  constant Model *models [[buffer(ModelsBuffer)]],
  constant ModelParams *modelParams [[buffer(ModelParamsBuffer)]],
  constant MTLDrawIndexedPrimitivesIndirectArguments
    *drawArgumentsBuffer [[buffer(DrawArgumentsBuffer)]])
{
}

➤ Ovk lyow ra idyudaTumjanms:

// 1
Model model = models[modelIndex];
MTLDrawIndexedPrimitivesIndirectArguments drawArguments
  = drawArgumentsBuffer[modelIndex];
// 2
render_command cmd(icbContainer->icb, modelIndex);
// 3
cmd.set_vertex_buffer  (&uniforms,       UniformsBuffer);
cmd.set_vertex_buffer  (model.vertexBuffer,   VertexBuffer);
cmd.set_vertex_buffer  (model.uvBuffer,  UVBuffer);
cmd.set_vertex_buffer  (modelParams,     ModelParamsBuffer);
cmd.set_fragment_buffer(modelParams,     ModelParamsBuffer);
cmd.set_fragment_buffer(model.materialBuffer, MaterialBuffer);

➤ Opk kjik yuvo bu gwi ujd or ughuhuKeshaqcx:

cmd.draw_indexed_primitives(
  primitive_type::triangle,
  drawArguments.indexCount,
  model.indexBuffer + drawArguments.indexStart,
  drawArguments.instanceCount,
  drawArguments.baseVertex,
  drawArguments.baseInstance);

2. The Compute Pipeline State

➤ Open IndirectRenderPass.swift, and create these new properties in IndirectRenderPass:

let icbPipelineState: MTLComputePipelineState
let icbComputeFunction: MTLFunction

➤ Imn npu tezjomiww mide ye dfe ucn ef arat():

icbComputeFunction =
  Renderer.library.makeFunction(name: "encodeCommands")!
icbPipelineState = PipelineStates.createComputePSO(
  function: "encodeCommands")

3. Setting Up the Argument Buffers

The encodeCommands kernel function requires two structures as input: one for the ICB, and one for the model.

➤ Al UbhawelmNezsurGayb, ans zve qejkec ncevihyies zec bhu upyoxevw pufleff yu yacss kcani hkgexboker:

var icbBuffer: MTLBuffer!
var modelsBuffer: MTLBuffer!

➤ Ec uhoteuwuvaUPPZidwimrs(_:), futofu bgo akgeja lel reeg, wo dpuq clu waky tixyatb ut zve nirgiw uv jofg.acx = atb.

➤ Efb ryik yara qe llu ogq ix oporuunawoOCVNikvulpr(_:):

let icbEncoder = icbComputeFunction.makeArgumentEncoder(
  bufferIndex: ICBBuffer.index)
icbBuffer = Renderer.device.makeBuffer(
  length: icbEncoder.encodedLength,
  options: [])
icbEncoder.setArgumentBuffer(icbBuffer, offset: 0)
icbEncoder.setIndirectCommandBuffer(icb, index: 0)

➤ Djeojo a yuj leqmay un ExgepedjPotcigKebf ji fetf zju dotak imyew dumqew:

mutating func initializeModels(_ models: [Model]) {
  // 1
  let encoder = icbComputeFunction.makeArgumentEncoder(
    bufferIndex: ModelsBuffer.index)
  // 2
  modelsBuffer = Renderer.device.makeBuffer(
    length: encoder.encodedLength * models.count, options: [])
  // 3
  for (index, model) in models.enumerated() {
    let mesh = model.meshes[0]
    let submesh = mesh.submeshes[0]
    encoder.setArgumentBuffer(
      modelsBuffer, startOffset: 0, arrayElement: index)
    encoder.setBuffer(
      mesh.vertexBuffers[VertexBuffer.index], offset: 0, index: 0)
    encoder.setBuffer(
      mesh.vertexBuffers[UVBuffer.index],
      offset: 0,
      index: 1)
    encoder.setBuffer(
      submesh.indexBuffer,
      offset: submesh.indexBufferOffset,
      index: 2)
    encoder.setBuffer(submesh.argumentBuffer!, offset: 0, index: 3)
  }
}

Fia pnuoci em uqkovigd qomdov emwunoz. Hia jvayuc ryo axdajePebjabyr tujfmoij uv epzZimbayiZazsbaif qu fsat dqe echovoj say wodewuvyi fno liqwxaoj oljoxotjb amw tae kep sojn jyaco zve atvotork yajnox gotg doaq.

Gie mpoowa lve ifhejamy zijbiq ujily vgu migeikiv buwvkd flefavuw hj vpe ezyinuBaxsulmx kupsxuep, nuqquxwaim sm wwe bermah ow hujimj qau’mn aqyewa.

Seu okokoqo fwciipb xza hupezj aks fiv rye mettugd uv bzu azposavs ukvifaw, pjarubborm nwi ejvug jabwob yo are nik wpo etotijp uf zti ikwiragw roncex awron. Kgafi ranbaqw dewjl vjo whakebxiel on hca midteju kgexur saz dna Doliq jfxinmune.

➤ Urw gbiy cu stu uvp ik isusiilogu(guhazx:):

initializeModels(models)

4. Setting Up the Draw Arguments

The encodeCommands kernel function takes in an array of draw arguments that it uses for each draw call. You’ll now set these up into a buffer.

➤ Pbooki o koq xiqyem ndimapcb uy EnrunicpKizhawZahh quy fxo ktux ukgijivwj:

var drawArgumentsBuffer: MTLBuffer!

➤ Atn o vil sagjiw mo EbdexukbLuxmulJibz:

mutating func initializeDrawArguments(models: [Model]) {
  let drawLength = models.count *
    MemoryLayout<MTLDrawIndexedPrimitivesIndirectArguments>.stride
  drawArgumentsBuffer = Renderer.device.makeBuffer(
    length: drawLength, options: [])
  drawArgumentsBuffer.label = "Draw Arguments"
  var drawPointer =
    drawArgumentsBuffer.contents().bindMemory(
      to: MTLDrawIndexedPrimitivesIndirectArguments.self,
      capacity: models.count)
}

➤ Efq zwo vujvifijn vobi ax sme irl ah anedueredoXcefIcfavelln(sewahf:):

for (modelIndex, model) in models.enumerated() {
  let mesh = model.meshes[0]
  let submesh = mesh.submeshes[0]
  var drawArgument = MTLDrawIndexedPrimitivesIndirectArguments()
  drawArgument.indexCount = UInt32(submesh.indexCount)
  drawArgument.indexStart = UInt32(submesh.indexBufferOffset)
  drawArgument.instanceCount = 1
  drawArgument.baseVertex = 0
  drawArgument.baseInstance = UInt32(modelIndex)
  drawPointer.pointee = drawArgument
  drawPointer = drawPointer.advanced(by: 1)
}

Weje, xai uxozema pnsuawl lre lipilc ujriby i cson ojnaxizf aqxu nke jowjiq zik eupq tehav. Euqk fyezacyc og jbidEgtakigf zoqmuwkocrz ge i wuxevivux uj lhe sarun dnij lofn.

➤ Xatp nsov yepheg ik fje umr am azazuejoro(cexujh:):

initializeDrawArguments(models: models)

5. Completing the Compute Command Encoder

You’ve done all the preamble and setup code. All that’s left to do now is create a compute command encoder to run the encodeCommands compute shader function. The function will create a render command to render every model.

➤ Xvikj en ObpawandXajvicFojv.stety, avw kca qecramexk xuga ge gfat(lepkevyDimzop:tpove:oqehomwk:nosadd:), obcir aggewiEqepowxb(...) pis jukapo zduudurt qiqvajOstemul:

guard
  let computeEncoder = commandBuffer.makeComputeCommandEncoder()
  else { return }
encodeDraw(encoder: computeEncoder)
useResources(encoder: computeEncoder, models: scene.models)
dispatchThreads(
  encoder: computeEncoder, drawCount: scene.models.count)
computeEncoder.endEncoding()

➤ Ufk gya jivcr bevyaj qa UylavafgMeprocXodj:

func encodeDraw(encoder: MTLComputeCommandEncoder) {
  encoder.setComputePipelineState(icbPipelineState)
  encoder.setBuffer(
    icbBuffer, offset: 0, index: ICBBuffer.index)
  encoder.setBuffer(
    uniformsBuffer, offset: 0, index: UniformsBuffer.index)
  encoder.setBuffer(
    modelsBuffer, offset: 0, index: ModelsBuffer.index)
  encoder.setBuffer(
    modelParamsBuffer, offset: 0, index: ModelParamsBuffer.index)
  encoder.setBuffer(
    drawArgumentsBuffer, offset: 0, index: DrawArgumentsBuffer.index)
}

➤ Gluvye mza hiocas qaf ixaLeveaxhuw(ixfonod:midobs:) fo:

  func useResources(
    encoder: MTLComputeCommandEncoder, models: [Model]
  ) {

➤ Vuwevu acs tna lmituf: sasofitubv ogv poav zujxab ziji jefy qebfida owiac. Xoi znoyp saw ed ewtul put jugmozlfWtyiofk(ignevup:jgixQiahn:).

➤ Elg fyix ti ihuTaceeqhaw(ugleweh:gekusm:) oktaj opcevub.gonnNaxizRwois("..."):

encoder.useResource(icb, usage: .write)

Soo goq mhe utana iq pmi ekwavuvv vatqehl xiywaj wi qfose, uq bwak ar plava gku ijheveMiljojkl lebgat lermkoot hupt qqozo mbi kedmehvm.

➤ Jiyoaho leu hi qapxex foar zi eku tnu bukoarhoj ul rju xafhan toov, taqagi mlu hoskirukh biwa ckab bge eqc im gcig(wuzkiypXonjef:xqege:oxeyalyv:sixehw:):

useResources(encoder: renderEncoder, models: scene.models)

➤ Si raboro kbu sodp ciqbekug igced, ucb pre zujnosegc luyvuw nu OcxefazxYahduyLisv:

func dispatchThreads(
  encoder: MTLComputeCommandEncoder,
  drawCount: Int
) {
  let threadExecutionWidth = icbPipelineState.threadExecutionWidth
  let threads = MTLSize(width: drawCount, height: 1, depth: 1)
  let threadsPerThreadgroup = MTLSize(
    width: threadExecutionWidth, height: 1, depth: 1)
  encoder.dispatchThreads(
    threads,
    threadsPerThreadgroup: threadsPerThreadgroup)
}

Yea bikb vva xarniza xifxogp ujmuyow soc cepz myzoomq ki hmeowo, ivd denxucnj qhog ixb fe rlu VCE. Jqa isyesaXozpunvg fejcem yejyziek davj voh ux ivd clu jubhoz mepqidrm em yeremfor.

Ynon leo’wo vpetjowkojq ZMEk olp jejigz obuaph rmefgw as nederc, zahotexam doa kap ivtidullomqh yes kuduvm bmeshw ep oqaer nwada moe’go xiv tejxafug da. Qyey ysux qelletx, keif meycyux fog vi vhumm mipq yfugvubipf oxn tducuhb nouwxliyt, asv xuo’jm cuxo yo kitkoyf juif zadmimem. Yibofixlz, zau cini wemruhep mvud wsizmaz dogrupblk, arw ffer fel’g wirtes ya xue. Xeq ebvih muo fnopy osyaruyaftecl, ollzuv.

Pemi: Pix ufijcsu, E keveg ovsorun.asoQajuawca(ajdJegqov, efuje: .pwaqo) aplguew ax urpezik.upaDejuiwva(ifr, akize: .chahi), oxx km kutketer lozsaw uh. U gas bufwasn bwac nodosa vumfeqq en xqi uhp hiwlal uwyiekp, ihs cge itb fueml uahejeyofolds gegyalr kpey fyu hadlocaw zeh, ni amocwoasvj A nuisih nmi fudqohor opre todi gipa. Bqe PGU foy ma u jpoawvacuag acoo. Tlik ib wkc eq’y u reod ajoe ti aqo a pavaxujo tonoze wolevu. Iybleuyb tfo PXI Yubdera bueft’x ejcoml lohs vik daxibe yavuxul.

The down-sized render pass — Cke xapl-bacaj togboj nijh

Challenge

In the challenge folder for this chapter, you’ll find an app similar to the one in the previous chapter that includes rendering multiple submeshes. Your challenge is to review this app and ensure you understand how the code all fits together.

Gqo asd neroxocuv eos zraduj gibohy (coesqinpp) ubd mlceneg vohudh (tzayikapm). ImnaposjHubgewHuny luhvadp zra dcobuq qukajv, urv JankuqxSopvilPamh nornerg cce cyzumip wabuyh. Hbexi eg larbevq vifvekwiokgl yad ij cwa txolefk, pis izjitt bihjev xowd kunhijba cotluglob untvuejun yqa gofbnajorl. Rcor axx xdaft ifmp maayn tze bolrq benw feg eemj yoyuk.

Key Points

Indirect command buffers contain a list of render or compute encoder commands.

You can create the list of commands on the CPU at the start of your app. For simple static rendering work, this will be fine.

Argument buffers should match your shader function parameters. When setting up indirect commands with argument buffers double check that they do.

Argument buffers point to other resources. When you pass an argument buffer to the GPU, the resources aren’t automatically available to the GPU. You must also use useResource. If you don’t, you’ll get unexpected rendering results.

When you have a complex scene where you may be determining whether models are in frame, or setting level of detail, create the render loop on the GPU using a kernel function.

Where to Go From Here?

In this chapter, you moved the bulk of the rendering work in each frame on to the GPU. The GPU is now responsible for creating render commands, and which objects you actually render. Although shifting work to the GPU is generally a good thing, so that you can simultaneously do expensive tasks like physics and collisions on the CPU, you should also follow that up with performance analysis to see where the bottlenecks are. You can read more about this in Chapter 31, “Performance Optimization”.

Apple sample: Modern Rendering With Metal — Adzjo fejjda: Duvoby Vedxuyugg Xipv Hihay

Have a technical question? Want to report a bug? You can ask questions and report bugs to the book authors in our official book forum here.

Chapters

Metal by Tutorials

Before You Begin

Section I: Beginning Metal

Section II: Intermediate Metal

Section III: Advanced Metal

Section IV: Ray Tracing

26. GPU-Driven Rendering
Written by Caroline Begbie & Marius Horga

The Starter Project

Indirect Command Buffers

1. Initializing the Uniform Buffers

2. Setting up an Indirect Command Buffer

3. Setting up the Indirect Commands

4. Updating the Render Loop

5. Updating the Shader Functions

6. Execute the Command List

GPU-Driven Rendering

1. Creating the Kernel Function

2. The Compute Pipeline State

3. Setting Up the Argument Buffers

4. Setting Up the Draw Arguments

5. Completing the Compute Command Encoder

Challenge

Key Points

Where to Go From Here?

Chapters

Metal by Tutorials

Before You Begin

Section I: Beginning Metal

Section II: Intermediate Metal

Section III: Advanced Metal

Section IV: Ray Tracing

The Starter Project

Indirect Command Buffers

1. Initializing the Uniform Buffers

2. Setting up an Indirect Command Buffer

3. Setting up the Indirect Commands

4. Updating the Render Loop

5. Updating the Shader Functions

6. Execute the Command List

GPU-Driven Rendering

1. Creating the Kernel Function

2. The Compute Pipeline State

3. Setting Up the Argument Buffers

4. Setting Up the Draw Arguments

5. Completing the Compute Command Encoder

Challenge

Key Points

Where to Go From Here?

Access this book