negotiation with self-play and in-context
learning from AI feedback [9].
In summary, the use of agentic workflows not
only improves the fidelity of AI outputs but also
contributes to the broader goal of making
advanced AI capabilities more accessible and
affordable. These insights are crucial for
complex workflows that require agentic
architecture, where models interact through
multiple revisions, additions, and objections to
produce a complete work product—whether in
content generation, production efficiency, or
decision-making—using smaller yet highly
capable models.
References
1. Yao, J-Y., Ning, K-P., Liu, Z-H., Ning, M-N.,
Liu, Y-Y. and Yuan, L. (2024) 'LLM lies:
Hallucinations are not bugs, but features
as adversarial examples', arXiv. Available
at: https://arxiv.org/abs/2310.01469
2. Rawte, V., Chakraborty, S., Pathak, A.,
Sarkar, A., Tonmoy, S.M.T.I., Chadha, A.,
Sheth, A.P. and Das, A. (2023) 'The
troubling emergence of hallucination in
large language models – An extensive
definition, quantification, and prescriptive
remediations', arXiv. Available at:
https://arxiv.org/abs/2310.04988
3. Ji, Z., Lee, N., Frieske, R., Yu, T., Su, D.,
Xu, Y., Ishii, E., Bang, Y.J., Madotto, A.
and Fung, P. (2023) 'Survey of
Hallucination in Natural Language
Generation', ACM Computing Surveys,
55(12), article 248, pp. 1-38. Available at:
https://doi.org/10.1145/3571730.
4. Tonmoy, S.M.T.I., Zaman, S.M.M., Jain, V.,
Rani, A., Rawte, V., Chadha, A. and Das,
A. (2024) 'A comprehensive survey of
hallucination mitigation techniques in large
language models', arXiv. Available at:
https://arxiv.org/abs/2401.01313
5. Ji, Z., Yu, T., Xu, Y., Lee, N., Ishii, E. and
Fung, P. (2023) 'Towards mitigating LLM
hallucination via self reflection', in
Bouamor, H., Pino, J. and Bali, K. (eds.)
Findings of the Association for
Computational Linguistics: EMNLP 2023,
Singapore: Association for Computational
Linguistics, pp. 1827–1843. Available at:
https://aclanthology.org/2023.findings-emnl
p.123 (Accessed: 18 October 2024).
6. Verspoor, K. (2024) 'Fighting fire with fire
— using LLMs to combat LLM
hallucinations', Nature, 630, pp. 569-570.
Available at:
https://doi.org/10.1038/d41586-024-01641-
0
7. Zhang, B., Mao, H., Ruan, J., Wen, Y., Li,
Y., Zhang, S., Xu, Z., Li, D., Li, Z., Zhao,
R., Li, L. and Fan, G. (2024) 'Controlling
large language model-based agents for
large-scale decision-making: An
actor-critic approach', arXiv. Available at:
https://arxiv.org/abs/2311.13884
8. Dibia, V., Chen, J., Bansal, G., Syed, S.,
Fourney, A., Zhu, E., Wang, C., & Amershi,
S. (2024). AutoGen Studio: A No-Code
Developer Tool for Building and Debugging
Multi-Agent Systems. arXiv preprint
https://arxiv.org/abs/2408.15247
9. Fu, Y., Peng, H., Khot, T. and Lapata, M.
(2023) 'Improving language model
negotiation with self-play and in-context
learning from AI feedback', arXiv. Available
at: https://arxiv.org/abs/2305.10142