Create an LLM policy

This guide will walk you through creating an LLM policy on Witboost

1. Introduction

The LLM endpoint configuration is needed to create a computational policy with LLM. A subscription with Azure OpenAI or Gemini is also needed because Witboost adopts a bring-your-own LLM approach. The first step is to go in the Governance Panel of Witboost in the policy registry

Introduction

2. Create a new Policy

We start the policy creation process clicking on create button

Create a new Policy

3. Policy Metadata

We start by inserting basic metadata for the new policy

Policy Metadata

4. Business Metadata Corretness

We are going to create a policy to assess the correctness of business metadata that is a crucial step to make the data intelligible and discoverable

Business Metadata Corretness

5. Description

Let's provide a meaningful description for the policy so people will understand why is important

Description

6. Select the resource type for this policy

In Witboost is possible to create different policies for different systems defined in the practice shaper. In this case we want to apply this policy to Data Products

Select the resource type for this policy

7. Select LLM

We want to create an LLM policy so we need to select this engine for the computation of the policy

Select LLM

8. Policy metadata completed

The other parameters in this case are already fine because we want to create a policy that will act at deploy time and it will be actively triggered by Witboost when a Data Product will be deployed or tested.

Policy metadata completed

9. Next Step

So now we are completed with the policy general metadata and we can proceed to the next stage.

Next Step

10. Define the prompt

The LLM policy needs to be instructed to perform its goal through a prompt

Define the prompt

11. Insert the Prompt

Let's type Detect any information put as placeholder or copied and pasted and not related to the general meaning of the data product. With this prompt we want to avoid people putting placeholders or repeating information just because they don't have time to properly fill the business metadata. Because this will impact deeply the discoverability and intelligibility of the data, we want to avoid it.

Insert the Prompt

12. Move on

Move into the next section

Move on

13. Save

Now let's save this policy and see how it works

Save

14. Custom Descriptor

The policy has been created, and it is now possible to test it. The best way to test an LLM policy is to leverage a custom data product descriptor. This way, we can easily iterate on testing new prompts and descriptors. It is recommended to test LLM policies extensively to check their deterministic capability.

Custom Descriptor

15. Open Descriptor editor

Open the Descriptor editor.

Open Descriptor editor

16. Paste your descriptor

In this editor is possible to paste and edit the descriptor

Paste your descriptor

17. Apply and Run

Now we paste the descriptor of a data product exposing cash flow information in a banking context. this data product has a couple of output ports with related schemas and business metadata. Now let's see what the policy is capable to detect

Apply and Run

18. Test Results

So in this first run it seems the policy is detecting problems in the descriptor we pasted, let's see what kind of problems

Test Results

19. Check errors

As you can see in this case the policy detected that information SLA field has been populated with a placeholder

Check errors

20. Return to policy

Let's return to the policy

Return to policy

21. Click "Descriptor editor"

Access the Descriptor editor.

Click 'Descriptor editor'

22. Fix the descriptor

Now we simulate how an user will react to this policy result: he will go to the descriptor and will fix it. In this case the policy correctly detected that a placeholder has been put into the Information SLA field.

Fix the descriptor

23. Let's fix it

We now remove the placeholder so we an further test the policy

Let's fix it

24. Test Again

Let's test again the policy

Test Again

25. Still errors

There is another error to check, now we jump in and we investigate

Still errors

26. Another placeholder

Also in this case seems that in the descriptor, specifically in the fully qualified name field we have another placeholder

Another placeholder

27. Click "Descriptor editor"

Access the Descriptor editor.

Click 'Descriptor editor'

28. Fix also this

Also in this case we see how the policy is helping us to clean up the descriptor. We now fi also this one to move forward.

Fix also this

29. Remove placeholder

So the policy seems to do its job

Remove placeholder

30. Test on Marketplace

So after this preliminary test with a custom descriptor, we can test this new policy all the data products already present in the marketplace

Test on Marketplace

31. Click "Run a Test"

Let's start the test

Click 'Run a Test'

32. Marketplace test results

The results are good: the policy to detect placeholders detected only one data product with some issue among sixteen

Marketplace test results

33. Policy Lifecycle

Now we can better understand how the policy life-cycle works

Policy Lifecycle

34. Grace

Grace is the first step to introduce a new policy, because limit the friction between platform team and data product teams. This policy will now become active but for a configurable grace period it will only raise warnings instead of blocking data product teams.

Grace

35. Grace On

So now the policy is officially active and in the grace period before becoming a mandatory quality gate.

Grace On

36. My Projects

Let's open our Data Products to test them and see if the policy is already visible to them.

My Projects

37. Data Product Test

Now I pickup again the Cash Flow Data Product that I now has some placeholder. Pay attention that before during the policy testing we didn't modify the real descriptor but just a volatile copy.

Data Product Test

38. Test Results

As we can see the policy is immediately available and enabled. As expected is failing for this Data Product, but interestingly it is just raising a Warning because the policy is in Grace Period

Test Results

39. Publish

Now we go back to the policy and we try to make it active without grace period

Publish

40. Click "Publish"

After this step it will not be possible to modify the policy, but it will be necessary to create a new version.

Click 'Publish'

41. MyProjects

Let's try to test again the Data Product

MyProjects

42. Run Again

Let's run the test again

Run Again

43. KO

Now that the policy is published, it is raising an error that will not allow to deploy this data product in production until the error will be fixed.

44. Disable

The platform team has also the extraordinary power to disable a policy in case of need ( Ex. an urgent deployment ). This also has the goal to minimize the friction between the platform team and the data product ones.

Disable

45. New version

At certain point there could be the need to evolve the policy, so is possible to create a new version.

New version

The guide covered the detailed process of creating, testing, and evolving an LLM policy on Witboost.

1. Introduction​

2. Create a new Policy​

3. Policy Metadata​

4. Business Metadata Corretness​

5. Description​

6. Select the resource type for this policy​

7. Select LLM​

8. Policy metadata completed​

9. Next Step​

10. Define the prompt​

11. Insert the Prompt​

12. Move on​

13. Save​

14. Custom Descriptor​

15. Open Descriptor editor​

16. Paste your descriptor​

17. Apply and Run​

18. Test Results​

19. Check errors​

20. Return to policy​

21. Click "Descriptor editor"​

22. Fix the descriptor​

23. Let's fix it​

24. Test Again​

25. Still errors​

26. Another placeholder​

27. Click "Descriptor editor"​

28. Fix also this​

29. Remove placeholder​

30. Test on Marketplace​

31. Click "Run a Test"​

32. Marketplace test results​

33. Policy Lifecycle​

34. Grace​

35. Grace On​

36. My Projects​

37. Data Product Test​

38. Test Results​

39. Publish​

40. Click "Publish"​

41. MyProjects​

42. Run Again​

43. KO​

44. Disable​

45. New version​