Data
Start with the terms and conditions for any two brands that offer both digital and in-store purchases and use technologies that may collect visual likeness or biometric data through virtual try-on tools. Ulta Terms and Conditions Sephora Beauty Insider
Research
There's a lot of research related to both document comparison and analyzing policy and legal documents. Here's what they've found helps: Provide context. Describe the documents provided. For instance "Below are the terms and conditions for two beauty brands." [1][2] Define categories. When extracting information into categories, provide a full sentence definition. This performed significantly better than simply naming the data types to look for. Placement matters. Placing the text at the beginning may slightly increase accuracy and recall of the document. [1] Prompt chaining. Break up your prompt into multiple steps, each taking on a role relevant to it's instruction. [2] But only break up when the complexity is such that the AI assistant isn't giving you good enough results. Studies also show that extraction and analysis may work better in a single step in some cases, perhaps because the LLM still has full context of the document. [1] Term Parser - for extracting information Term Verifier - for validating extracted information is in the source document Analyst - for comparing terms found between documents and lastly making sense of the difference Provide examples. It takes more time, but if you give the AI assistant sample input and output it will use that to more accurately analyze your documents. For policy analysis two examples may provide the best results. This technique is called "few-shot prompting". [1] Sources The full papers are available online. Rodriguez, D., Yang, I., Del Alamo, J.M. et al. Large language models: a new approach for privacy policy analysis at scale. Computing 106, 3879–3903 (2024). https://doi.org/10.1007/s00607-024-01331-9 Mridul, M.A., Kang, I., Seneviratne, O. Terminators: Terms of service parsing and auditing agents. arXiv preprint arXiv:2505.11672 (2025). https://doi.org/10.48550/arXiv.2505.11672
Guiding Questions
The objective is to use AI to compare documents given our experiment guidelines. Use these guiding questions to focus your prompting: Does the policy mention biometric data or virtual likeness? How is that data collected, stored, or shared? What rights are you granting the company over your data or image? Which parts of the policy seem unusual or differ from other companies? Is it clear how to opt out or remove my data?
To get the most value from our session, make sure you've completed this checklist:
Selected two documents
Tried 2–3 prompts
Answered the guiding questions
Prepared to share one insight or challenge
Registered for the session