Submission declined on 1 June 2025 by AstrooKai (talk).

This submission reads more like an essay than an encyclopedia article. Submissions should summarise information in secondary, reliable sources and not contain opinions or original research. Please write about the topic from a neutral point of view in an encyclopedic manner.

Your draft shows signs of having been generated by a large language model, such as ChatGPT. Their outputs usually have multiple issues that prevent them from meeting our guidelines on writing articles. These include:

Promotional tone, editorializing and other words to watch
Vague, generic, and speculative statements extrapolated from similar subjects
Essay-like writing
Hallucinations (plausible-sounding, but false information) and non-existent references
Close paraphrasing

Please address these issues. The best way is usually to read reliable sources and summarize them, instead of using a large language model. See our help page on large language models.

If you would like to continue working on the submission, click on the "Edit" tab at the top of the window.
If you have not resolved the issues listed above, your draft will be declined again and potentially deleted.
If you need extra help, please ask us a question at the AfC Help Desk or get live help from experienced editors.
Please do not remove reviewer comments or this notice until the submission is accepted.

Where to get help

If you need help editing or submitting your draft, please ask us a question at the AfC Help Desk or get live help from experienced editors. These venues are only for help with editing and the submission process, not to get reviews.
If you need feedback on your draft, or if the review is taking a lot of time, you can try asking for help on the talk page of a relevant WikiProject. Some WikiProjects are more active than others so a speedy reply is not guaranteed.

How to improve a draft

Wikipedia:Contributing to Wikipedia – a basic overview on how to edit Wikipedia.
Help:Wikitext – how to use the markup
Help:Referencing for beginners – how to include references
Wikipedia:Article development – how to develop your article
Wikipedia:Writing better articles – how to improve your article
Wikipedia:Verifiability – make sure your article includes reliable third-party sources

You can also browse Wikipedia:Featured articles and Wikipedia:Good articles to find examples of Wikipedia's best writing on topics similar to your proposed article.

Improving your odds of a speedy review

To improve your odds of a faster review, tag your draft with relevant WikiProject tags using the button below. This will let reviewers know a new draft has been submitted in their area of interest. For instance, if you wrote about a female astronomer, you would want to add the Biography, Astronomy, and Women scientists tags.

Add tags to your draft

Editor resources

Easy tools: Citation bot (help) | Advanced: Fix bare URLs

Declined by AstrooKai 50 days ago. Last edited by Citation bot 4 seconds ago. Reviewer: Inform author.

Resubmit

Please note that if the issues are not fixed, the draft will be declined again.

Submission declined on 31 May 2025 by ToadetteEdit (talk).

This submission reads more like an essay than an encyclopedia article. Submissions should summarise information in secondary, reliable sources and not contain opinions or original research. Please write about the topic from a neutral point of view in an encyclopedic manner.

Declined by ToadetteEdit 51 days ago.

Lead

A new article about AI code agent is created. When the automatic programming wiki page talked about AI code agent, it linked the phrase to a wiki page about vibe coding, which is not exactly AI code agent but just a new phrase coined for the coding behavior when using AI code agent and thus not accurate enough for the context. This demonstrate the gap between wiki pages and the demand to create a page for AI code agent.

AI code agents

AI code agents are AI (artificial intelligence) systems that can autonomously perform programming tasks, such as generating source code, debugging, or code analysis. They serve as automated “coders” that can react to high-level natural language instructions in the coding workflow. The concept of automating code generation has its roots in conventional computer science research (often termed automatic programming or program synthesis)^[1], but modern AI code agents became prominent in the 2020s due to advances in LLMs (large language models).^[2]^[3]

Definition and historical development

Theoretically, the underlying idea of an AI system that writes code on its own falls under the concepts of program synthesis or automatic programming. In 1957, Alonzo Church ^[4]presented a seminal paper titled "Application of Recursive Arithmetic to the Problem of Circuit Synthesis" at the Summer Institute of Symbolic Logic at Cornell University. While his focus was on synthesizing digital circuits from mathematical specifications, this work laid the foundational ideas for program synthesis. Program synthesis traditionally is defined as constructing a program that meets a given specification, relieving humans of the need to manually write correct code. Early approaches in the 1960s–70s focused on using formal logic and theorem proving to derive programs. Cordell Green (1969)^[5] introduced one of the first program synthesizers by applying theorem proving to problem-solving. This demonstrates that formal logic could be used to generate specified criteria for self generated programs to follow. Zohar Manna and Richard Waldinger (1970s–1980s) ^[6]developed an approach of deriving programs directly from their specifications using logical inference rules. Besides theoretical frameworks of program synthesis, practical tools and system emerged accordingly, Cornell Program Synthesizer (1978)^[7] is developed by Tim Teitelbaum and Thomas Reps, this was one of the first integrated development environments (IDEs) that incorporated program synthesis approaches, allowing developers to receive syntax-directed editing feedback during program development.

In the early 21st century, Armando Solar-Lezama ^[8]introduced the idea of synthesizing code by filling in “holes” in a partial program using constraint solvers and developed Sketch (mid-2000s), which is a notable milestone. By encoding a synthesis problem as a SAT/SMT problem^[9]^[10] researchers could leverage powerful solvers to find programs that meet a specification. Microsoft’s FlashFill (released in Excel 2013)^[11] was the first widely-used commercial program synthesis tool, enabling end-users to complete text processing tasks automatically without coding. In the late 2010s, the convergence of Big Code (large collections of open-source code) and deep learning set the stage for modern code agents.^[12] In 2015–2016, researchers began training neural networks on those large collections of open-source code.^[13] For instance, DeepCoder (2017) ^[14]demonstrated that a neural network could learn to compose simple programs (in a domain-specific language) from examples. Another similar example was Bayou (2018)^[15]. It used a neural language model to generate API-centric Java code from a few hints. These projects were limited in scope, but they successfully proved that applying deep learning method achieve automatic programming is a viable path.

In modern technology industry today, an “AI code agent” refers more broadly to an AI system that are pre-trained on a large collection of codebase through machine learning models with the access to conventional software engineering and program synthesis tools. They are one of the domain specific applications under the more general "AI agent" concept. IBM^[16] defines an AI agent as a system capable of autonomously performing tasks (planning actions and invoking tools) for goals set by a programmer—an AI code agent is such a system specialized in software development tasks. The term itself came into more common use following the emergence of practical coding assistants in the 2020s. GitHub’s Copilot (2021) was described as an “AI pair programmer”^[17] , and OpenAI’s 2025 release of Codex was explicitly introduced as a “cloud-based software engineering agent”^[18]. Thus, while the precise phrase "AI code agent" may not have a single inventor, it represents the convergence of the AI agent concept with longstanding efforts to automate programming.

Techniques and Methods

AI code agents are built on a combination of techniques from programming languages, formal methods, and machine learning. Program synthesis^[19] techniques are derived from programming languages and formal methods, which is central to the early development of automatic programming. It comes in two broad flavors:

Deductive program synthesis: These methods construct code from formal specifications using logical deduction. Early approaches viewed code generation as a byproduct of proving theorems: if one can prove that an output exists satisfying certain conditions, a program can be extracted from that proofcs.uni-potsdam.de. Classic deductive systems, like those developed by Manna and Waldinger, attempted to generate programs by symbolic reasoning and correctness proofs. This approach guarantees correctness but often struggles with the complexity of real-world specifications.^[20]
Inductive program synthesis: These techniques infer programs from examples or informal specifications. A important subcategory under it is programming by examples (PBE), where the system is given example input-output pairs and must generate code consistent with them. An early success in this area was Microsoft’s FlashFill (2013) for Excel, which, given a few examples of string transformations, could synthesize a program to perform the task for all rows. Inductive methods often use search algorithms or heuristics to explore the space of possible programs.^[21] Evolutionary algorithms (genetic programming) were also explored in the 1990s as a way to evolve programs to fit example data, yielding some success on small-scale problems.^[22]

Machine learning and neural networks have become increasingly important in AI code agents, especially in the last decade. Instead of manually encoding search strategies, modern systems train models on large corpora of code. Neural sequence-to-sequence models and Transformers treat code as a form of language to be learned. A milestone was the introduction of models like DeepCoder (Balog et al., 2017)^[14], which learned to predict useful code components from input-output examples and guided a search to assemble programs. By the late 2010s, large-scale language models pre-trained on source code became feasible. These large language models (LLMs) for code (often based on the Transformer architecture) are now the dominant method for AI coding assistants. OpenAI’s Codex (2025)^[23] demonstrated that an LLM (fine-tuned on billions of lines of code) could translate natural language prompts into code with remarkable competence, solving around 70% of the programming tasks in a standard evaluation. Such models, including Codex and its successors, underlie tools like Copilot. They work by probabilistically predicting code that is likely to satisfy the intent described in the prompt or the context in the editor.

To enhance the capabilities of code-generation models, developers also integrate other AI techniques. Reinforcement learning (RL) is used to further train code agents for specific goals – for example, OpenAI’s Codex agent (2025) was tuned with reinforcement learning on coding tasks to better adhere to instructions and to iteratively run generated code against tests until a correct solution is found. DeepMind’s AlphaCode^[24] employed a massive generate-and-filter strategy: it would generate a multitude of candidate programs for a given problem using a Transformer-based model, then execute and filter those candidates based on their runtime results (tests) to pick correct solutions. In a related vein, DeepMind’s later project AlphaDev (2023)^[25] used reinforcement learning^[26] to discover new efficient algorithms in assembly code, treating algorithm discovery as a game and finding sorting routines faster than known human benchmarks. Additionally, AI code agents often incorporate static analysis or symbolic reasoning as tools: for instance, an agent might internally call a type-checker or symbolic executor to validate or refine its generated code.^[27]^[28]Modern systems therefore are hybrids – they leverage the learned knowledge and pattern recognition of ML models, the rigorous checking of formal methods, and sometimes an iterative loop (propose code, test it, fix errors) to mimic how a human might debug. Combining these techniques allows state-of-the-art code agents to tackle complex programming tasks that were once far beyond the reach of automated systems.

Critiques and Limitations

Despite their advancements, AI code agents face a number of challenges and criticisms. A serious concern is accuracy and reliability. While modern code generators output seemingly correct code snippets, they do not truly “understand” code semantics and can deceive developer by producing incorrect code confidently.In general, AI agents have a tendency to “hallucinate” – producing code that looks plausible but is logically flawed or references nonexistent libraries/functions. Empirical evaluations of GitHub Copilot^[29], for example, found that its suggestions are either correct or completely wrong. One study showed that for certain tasks, Copilot’s code suggestions had correctness rates as low as 27% (for JavaScript) – meaning the majority of its outputs did not initially work without modification. This unreliability means developers must review and test AI-written code carefully, which contrasts the concept of automatic programming.

Moreover, because they are trained on past code, they may over-represent older frameworks or patterns and under-suggest newer, potentially better ones.^[30]

AI code agents also raise security and legal concerns. Studies have shown that naive use of these tools can introduce more profound issues in the related industry. A 2021 research paper from NYU’s Center for Cybersecurity revealed that about 40% of code produced by Copilot in their scenario contained potential security flaws (such as use of insecure practices).^[31]

Legally, AI agents trained on open-source code have stirred controversy over intellectual property. In late 2022, a group of programmers filed a class-action lawsuit alleging that tools like Copilot violate open-source licenses by regurgitating sections of licensed code without proper attribution. They characterized Copilot’s operation as “software piracy on an unprecedented scale” if it outputs code identical or similar to licensed code from its training set. GitHub and OpenAI have contested this, and the legal questions are still unresolved. In response, some AI coding tools now provide citation or at least an indicator when a suggestion closely matches a known code repository, and there is ongoing work on allowing users to exclude their code from training data.^[32]^[33]

Integration and practicality issues also limit AI code agents. Many of these tools run on large cloud-hosted models, which means using them might require sending proprietary code to an external service – a privacy and confidentiality concern for companies. Indeed, some organizations have banned internal use of tools like ChatGPT or Copilot after incidents where sensitive code was inadvertently leaked. For instance, Samsung temporarily banned generative AI usage in 2023 after an engineer pasted confidential source code into ChatGPT, which posed an IP risk. This highlights the need for on-premises or private deployments of AI coding models for certain users.^[34]

Additionally, the current generation of code agents can struggle with context limitations. They have a fixed context window (few thousand tokens), so on large projects they may not see all relevant parts of the codebase, leading to inconsistent suggestions. They also lack true understanding of a project’s architecture or the intent behind code, so they might make suggestions that don’t fit the overall design without the developer providing detailed guidance.^[35]

Another limitation is that many code agents do not actively run or test the code they generate (unless explicitly integrated with a runtime), so they might produce syntactically correct but logically incorrect solutions. Most IDE assistants will only provide one suggestion at a time.^[36]

References

^ Cardelli, Luca; Wegner, Peter (1985-12-10). "On understanding types, data abstraction, and polymorphism". ACM Comput. Surv. 17 (4): 471–523. doi:10.1145/6041.6042. ISSN 0360-0300.
^ Austin, Jacob; Odena, Augustus; Nye, Maxwell; Bosma, Maarten; Michalewski, Henryk; Dohan, David; Jiang, Ellen; Cai, Carrie; Terry, Michael; Le, Quoc; Sutton, Charles (2021). "Program Synthesis with Large Language Models". arXiv:2108.07732 [cs.PL].
^ Lyu, Michael R.; Ray, Baishakhi; Roychoudhury, Abhik; Shin Hwei Tan; Thongtanunam, Patanamon (2024). "Automatic Programming: Large Language Models and Beyond". arXiv:2405.02213 [cs.SE].
^ Friedman, Joyce (December 1963). "Alonzo Church. Application of recursive arithmetic to the problem of circuit synthesisSummaries of talks presented at the Summer Institute for Symbolic Logic Cornell University, 1957, 2nd edn., Communications Research Division, Institute for Defense Analyses, Princeton, N. J., 1960, pp. 3–50. 3a-45a". Journal of Symbolic Logic. 28 (4): 289–290. doi:10.2307/2271310. ISSN 0022-4812. JSTOR 2271310.
^ Pirotte, Alain (January 1973). "Automatic theorem proving based on resolution". Annual Review in Automatic Programming. 7: 201–266. doi:10.1016/0066-4138(73)90001-3. ISSN 0066-4138.
^ Manna, Zohar; Waldinger, Richard (June 1975). "Knowledge and reasoning in program synthesis". Artificial Intelligence. 6 (2): 175–208. doi:10.1016/0004-3702(75)90008-9. ISSN 0004-3702.
^ Teitelbaum, Tim; Reps, Thomas (1981-09-01). "The Cornell program synthesizer: a syntax-directed programming environment". Commun. ACM. 24 (9): 563–573. doi:10.1145/358746.358755. ISSN 0001-0782.
^ Solar-Lezama, Armando (2009). "The Sketching Approach to Program Synthesis". Programming Languages and Systems. Lecture Notes in Computer Science. Vol. 5904. Berlin, Heidelberg: Springer Berlin Heidelberg. pp. 4–13. doi:10.1007/978-3-642-10672-9_3. ISBN 978-3-642-10671-2. Retrieved 2025-05-31.
^ Monniaux, David (2016). "A Survey of Satisfiability Modulo Theory". arXiv:1606.04786 [cs.LO].
^ Hozzová, Petra; Kovács, Laura; Norman, Chase; Voronkov, Andrei (2023). "Program Synthesis in Saturation". Automated Deduction – CADE 29. Lecture Notes in Computer Science. Vol. 14132. pp. 307–324. arXiv:2402.18962. doi:10.1007/978-3-031-38499-8_18. ISBN 978-3-031-38498-1.
^ "Using Flash Fill in Excel - Microsoft Support". support.microsoft.com. Retrieved 2025-05-31.
^ Allamanis, Miltiadis; Barr, Earl T.; Devanbu, Premkumar; Sutton, Charles (2018-07-31). "A Survey of Machine Learning for Big Code and Naturalness". ACM Computing Surveys. 51 (4): 1–37. doi:10.1145/3212695. ISSN 0360-0300.
^ Allamanis, Miltiadis; Peng, Hao; Sutton, Charles (2016). "A Convolutional Attention Network for Extreme Summarization of Source Code". arXiv:1602.03001 [cs.LG].
^ ^a ^b Balog, Matej; Gaunt, Alexander L.; Brockschmidt, Marc; Nowozin, Sebastian; Tarlow, Daniel (2016). "DeepCoder: Learning to Write Programs". arXiv:1611.01989 [cs.LG].
^ Murali, Vijayaraghavan; Qi, Letao; Chaudhuri, Swarat; Jermaine, Chris (2017). "Neural Sketch Learning for Conditional Program Generation". arXiv:1703.05698 [cs.PL].
^ Pinnaka, Manasvi; Zaidi, Sohail; Viswanathan, Vimal (2024-03-09). "Integrating Advanced IBM Cloud-Based AI/Machine Learning Platform to Develop Predictive Models for Medical Applications". 2024 IEEE Integrated STEM Education Conference (ISEC). IEEE. pp. 1–2. doi:10.1109/isec61299.2024.10665146. ISBN 979-8-3503-5280-1.
^ "Considering Responsible AI with GitHub Copilot". Programming with GitHub Copilot. 2024-08-06. pp. 217–227. doi:10.1002/9781394319787.ch13. ISBN 978-1-394-31978-7.
^ Thoram, Harshitha (2025-05-15). "OpenAI Codex CLI". doi.org. doi:10.59350/tfad0-ff762. Retrieved 2025-05-31.
^ Kitzelmann, Emanuel (2010). "Inductive Programming: A Survey of Program Synthesis Techniques". Approaches and Applications of Inductive Programming. Lecture Notes in Computer Science. Vol. 5812. Berlin, Heidelberg: Springer Berlin Heidelberg. pp. 50–73. doi:10.1007/978-3-642-11931-6_3. ISBN 978-3-642-11930-9. Retrieved 2025-05-31.
^ Manna, Zohar; Waldinger, Richard (January 1980). "A Deductive Approach to Program Synthesis". ACM Transactions on Programming Languages and Systems. 2 (1): 90–121. doi:10.1145/357084.357090. ISSN 0164-0925.
^ Gulwani, Sumit (2011-01-26). "Automating string processing in spreadsheets using input-output examples". Proceedings of the 38th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages. New York, NY, USA: ACM. pp. 317–330. doi:10.1145/1926385.1926423. ISBN 978-1-4503-0490-0.
^ Koza, JohnR. (June 1994). "Genetic programming as a means for programming computers by natural selection". Statistics and Computing. 4 (2). doi:10.1007/bf00175355. ISSN 0960-3174.
^ Chen, Mark; et al. (2021). "Evaluating Large Language Models Trained on Code". arXiv:2107.03374 [cs.LG].
^ Kolter, J. Zico (2022-12-09). "AlphaCode and "data-driven" programming". Science. 378 (6624): 1056. doi:10.1126/science.add8258. ISSN 0036-8075. PMID 36480633.
^ Tufano, Michele; Agarwal, Anisha; Jang, Jinu; Roshanak Zilouchian Moghaddam; Sundaresan, Neel (2024). "AutoDev: Automated AI-Driven Development". arXiv:2403.08299 [cs.SE].
^ Mankowitz, Daniel J.; Michi, Andrea; Zhernov, Anton; Gelmi, Marco; Selvi, Marco; Paduraru, Cosmin; Leurent, Edouard; Iqbal, Shariq; Lespiau, Jean-Baptiste; Ahern, Alex; Köppe, Thomas; Millikin, Kevin; Gaffney, Stephen; Elster, Sophie; Broshear, Jackson (2023-06-07). "Faster sorting algorithms discovered using deep reinforcement learning". Nature. 618 (7964): 257–263. doi:10.1038/s41586-023-06004-9. ISSN 0028-0836. PMC 10247365. PMID 37286649.
^ Li, Yihe; Meng, Ruijie; Duck, Gregory J. (2025). "Large Language Model powered Symbolic Execution". arXiv:2505.13452 [cs.PL].
^ Wang, Wenhan; Liu, Kaibo; An Ran Chen; Li, Ge; Jin, Zhi; Huang, Gang; Ma, Lei (2024). "Python Symbolic Execution with LLM-powered Code Generation". arXiv:2409.09271 [cs.SE].
^ Nguyen, Nhan; Nadi, Sarah (2022-05-23). "An empirical evaluation of GitHub copilot's code suggestions". Proceedings of the 19th International Conference on Mining Software Repositories. New York, NY, USA: ACM. pp. 1–5. doi:10.1145/3524842.3528470. ISBN 978-1-4503-9303-4.
^ Ciniselli, Matteo; Pascarella, Luca; Bavota, Gabriele (2022). "To What Extent do Deep Learning-based Code Recommenders Generate Predictions by Cloning Code from the Training Set?". arXiv:2204.06894 [cs.SE].
^ Pearce, Hammond; Ahmad, Baleegh; Tan, Benjamin; Dolan-Gavitt, Brendan; Karri, Ramesh (May 2022). "Asleep at the Keyboard? Assessing the Security of GitHub Copilot's Code Contributions". 2022 IEEE Symposium on Security and Privacy (SP). IEEE. pp. 754–768. doi:10.1109/sp46214.2022.9833571. ISBN 978-1-6654-1316-9.
^ Xu, Weiwei; Gao, Kai; He, Hao; Zhou, Minghui (2024). "LiCoEval: Evaluating LLMS on License Compliance in Code Generation". arXiv:2408.02487 [cs.SE].
^ Stalnaker, Trevor; Wintersgill, Nathan; Chaparro, Oscar; Heymann, Laura A.; Massimiliano Di Penta; Daniel M German; Poshyvanyk, Denys (2025). "Developer Perspectives on Licensing and Copyright Issues Arising from Generative AI for Software Development". ACM Transactions on Software Engineering and Methodology. arXiv:2411.10877. doi:10.1145/3743133.
^ Ali, Mutahar; Arunasalam, Arjun; Farrukh, Habiba (2025). "Understanding Users' Security and Privacy Concerns and Attitudes Towards Conversational AI Platforms". 2025 IEEE Symposium on Security and Privacy (SP). pp. 298–316. arXiv:2504.06552. doi:10.1109/SP61157.2025.00241. ISBN 979-8-3315-2236-0.
^ Zhang, Yusen; Sun, Ruoxi; Chen, Yanfei; Pfister, Tomas; Zhang, Rui; Arik, Sercan Ö. (2024). "Chain of Agents: Large Language Models Collaborating on Long-Context Tasks". arXiv:2406.02818 [cs.CL].
^ Esposito, Matteo; Li, Xiaozhou; Moreschini, Sergio; Ahmad, Noman; Cerny, Tomas; Vaidhyanathan, Karthik; Lenarduzzi, Valentina; Taibi, Davide (2025). "Generative AI for Software Architecture. Applications, Trends, Challenges, and Future Directions". arXiv:2503.13310 [cs.SE].

[1] Cardelli, Luca; Wegner, Peter (1985-12-10). "On understanding types, data abstraction, and polymorphism". ACM Comput. Surv. 17 (4): 471–523. doi:10.1145/6041.6042. ISSN 0360-0300.

[2] Austin, Jacob; Odena, Augustus; Nye, Maxwell; Bosma, Maarten; Michalewski, Henryk; Dohan, David; Jiang, Ellen; Cai, Carrie; Terry, Michael; Le, Quoc; Sutton, Charles (2021). "Program Synthesis with Large Language Models". arXiv:2108.07732 [cs.PL].

[3] Lyu, Michael R.; Ray, Baishakhi; Roychoudhury, Abhik; Shin Hwei Tan; Thongtanunam, Patanamon (2024). "Automatic Programming: Large Language Models and Beyond". arXiv:2405.02213 [cs.SE].

[4] Friedman, Joyce (December 1963). "Alonzo Church. Application of recursive arithmetic to the problem of circuit synthesisSummaries of talks presented at the Summer Institute for Symbolic Logic Cornell University, 1957, 2nd edn., Communications Research Division, Institute for Defense Analyses, Princeton, N. J., 1960, pp. 3–50. 3a-45a". Journal of Symbolic Logic. 28 (4): 289–290. doi:10.2307/2271310. ISSN 0022-4812. JSTOR 2271310.

[5] Pirotte, Alain (January 1973). "Automatic theorem proving based on resolution". Annual Review in Automatic Programming. 7: 201–266. doi:10.1016/0066-4138(73)90001-3. ISSN 0066-4138.

[6] Manna, Zohar; Waldinger, Richard (June 1975). "Knowledge and reasoning in program synthesis". Artificial Intelligence. 6 (2): 175–208. doi:10.1016/0004-3702(75)90008-9. ISSN 0004-3702.

[7] Teitelbaum, Tim; Reps, Thomas (1981-09-01). "The Cornell program synthesizer: a syntax-directed programming environment". Commun. ACM. 24 (9): 563–573. doi:10.1145/358746.358755. ISSN 0001-0782.

[8] Solar-Lezama, Armando (2009). "The Sketching Approach to Program Synthesis". Programming Languages and Systems. Lecture Notes in Computer Science. Vol. 5904. Berlin, Heidelberg: Springer Berlin Heidelberg. pp. 4–13. doi:10.1007/978-3-642-10672-9_3. ISBN 978-3-642-10671-2. Retrieved 2025-05-31.

[9] Monniaux, David (2016). "A Survey of Satisfiability Modulo Theory". arXiv:1606.04786 [cs.LO].

[10] Hozzová, Petra; Kovács, Laura; Norman, Chase; Voronkov, Andrei (2023). "Program Synthesis in Saturation". Automated Deduction – CADE 29. Lecture Notes in Computer Science. Vol. 14132. pp. 307–324. arXiv:2402.18962. doi:10.1007/978-3-031-38499-8_18. ISBN 978-3-031-38498-1.

[11] "Using Flash Fill in Excel - Microsoft Support". support.microsoft.com. Retrieved 2025-05-31.

[12] Allamanis, Miltiadis; Barr, Earl T.; Devanbu, Premkumar; Sutton, Charles (2018-07-31). "A Survey of Machine Learning for Big Code and Naturalness". ACM Computing Surveys. 51 (4): 1–37. doi:10.1145/3212695. ISSN 0360-0300.

[13] Allamanis, Miltiadis; Peng, Hao; Sutton, Charles (2016). "A Convolutional Attention Network for Extreme Summarization of Source Code". arXiv:1602.03001 [cs.LG].

[:0-14] Balog, Matej; Gaunt, Alexander L.; Brockschmidt, Marc; Nowozin, Sebastian; Tarlow, Daniel (2016). "DeepCoder: Learning to Write Programs". arXiv:1611.01989 [cs.LG].

[15] Murali, Vijayaraghavan; Qi, Letao; Chaudhuri, Swarat; Jermaine, Chris (2017). "Neural Sketch Learning for Conditional Program Generation". arXiv:1703.05698 [cs.PL].

[16] Pinnaka, Manasvi; Zaidi, Sohail; Viswanathan, Vimal (2024-03-09). "Integrating Advanced IBM Cloud-Based AI/Machine Learning Platform to Develop Predictive Models for Medical Applications". 2024 IEEE Integrated STEM Education Conference (ISEC). IEEE. pp. 1–2. doi:10.1109/isec61299.2024.10665146. ISBN 979-8-3503-5280-1.

[17] "Considering Responsible AI with GitHub Copilot". Programming with GitHub Copilot. 2024-08-06. pp. 217–227. doi:10.1002/9781394319787.ch13. ISBN 978-1-394-31978-7.

[18] Thoram, Harshitha (2025-05-15). "OpenAI Codex CLI". doi.org. doi:10.59350/tfad0-ff762. Retrieved 2025-05-31.

[19] Kitzelmann, Emanuel (2010). "Inductive Programming: A Survey of Program Synthesis Techniques". Approaches and Applications of Inductive Programming. Lecture Notes in Computer Science. Vol. 5812. Berlin, Heidelberg: Springer Berlin Heidelberg. pp. 50–73. doi:10.1007/978-3-642-11931-6_3. ISBN 978-3-642-11930-9. Retrieved 2025-05-31.

[20] Manna, Zohar; Waldinger, Richard (January 1980). "A Deductive Approach to Program Synthesis". ACM Transactions on Programming Languages and Systems. 2 (1): 90–121. doi:10.1145/357084.357090. ISSN 0164-0925.

[21] Gulwani, Sumit (2011-01-26). "Automating string processing in spreadsheets using input-output examples". Proceedings of the 38th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages. New York, NY, USA: ACM. pp. 317–330. doi:10.1145/1926385.1926423. ISBN 978-1-4503-0490-0.

[22] Koza, JohnR. (June 1994). "Genetic programming as a means for programming computers by natural selection". Statistics and Computing. 4 (2). doi:10.1007/bf00175355. ISSN 0960-3174.

[23] Chen, Mark; et al. (2021). "Evaluating Large Language Models Trained on Code". arXiv:2107.03374 [cs.LG].

[24] Kolter, J. Zico (2022-12-09). "AlphaCode and "data-driven" programming". Science. 378 (6624): 1056. doi:10.1126/science.add8258. ISSN 0036-8075. PMID 36480633.

[25] Tufano, Michele; Agarwal, Anisha; Jang, Jinu; Roshanak Zilouchian Moghaddam; Sundaresan, Neel (2024). "AutoDev: Automated AI-Driven Development". arXiv:2403.08299 [cs.SE].

[26] Mankowitz, Daniel J.; Michi, Andrea; Zhernov, Anton; Gelmi, Marco; Selvi, Marco; Paduraru, Cosmin; Leurent, Edouard; Iqbal, Shariq; Lespiau, Jean-Baptiste; Ahern, Alex; Köppe, Thomas; Millikin, Kevin; Gaffney, Stephen; Elster, Sophie; Broshear, Jackson (2023-06-07). "Faster sorting algorithms discovered using deep reinforcement learning". Nature. 618 (7964): 257–263. doi:10.1038/s41586-023-06004-9. ISSN 0028-0836. PMC 10247365. PMID 37286649.

[27] Li, Yihe; Meng, Ruijie; Duck, Gregory J. (2025). "Large Language Model powered Symbolic Execution". arXiv:2505.13452 [cs.PL].

[28] Wang, Wenhan; Liu, Kaibo; An Ran Chen; Li, Ge; Jin, Zhi; Huang, Gang; Ma, Lei (2024). "Python Symbolic Execution with LLM-powered Code Generation". arXiv:2409.09271 [cs.SE].

[29] Nguyen, Nhan; Nadi, Sarah (2022-05-23). "An empirical evaluation of GitHub copilot's code suggestions". Proceedings of the 19th International Conference on Mining Software Repositories. New York, NY, USA: ACM. pp. 1–5. doi:10.1145/3524842.3528470. ISBN 978-1-4503-9303-4.

[30] Ciniselli, Matteo; Pascarella, Luca; Bavota, Gabriele (2022). "To What Extent do Deep Learning-based Code Recommenders Generate Predictions by Cloning Code from the Training Set?". arXiv:2204.06894 [cs.SE].

[31] Pearce, Hammond; Ahmad, Baleegh; Tan, Benjamin; Dolan-Gavitt, Brendan; Karri, Ramesh (May 2022). "Asleep at the Keyboard? Assessing the Security of GitHub Copilot's Code Contributions". 2022 IEEE Symposium on Security and Privacy (SP). IEEE. pp. 754–768. doi:10.1109/sp46214.2022.9833571. ISBN 978-1-6654-1316-9.

[32] Xu, Weiwei; Gao, Kai; He, Hao; Zhou, Minghui (2024). "LiCoEval: Evaluating LLMS on License Compliance in Code Generation". arXiv:2408.02487 [cs.SE].

[33] Stalnaker, Trevor; Wintersgill, Nathan; Chaparro, Oscar; Heymann, Laura A.; Massimiliano Di Penta; Daniel M German; Poshyvanyk, Denys (2025). "Developer Perspectives on Licensing and Copyright Issues Arising from Generative AI for Software Development". ACM Transactions on Software Engineering and Methodology. arXiv:2411.10877. doi:10.1145/3743133.

[34] Ali, Mutahar; Arunasalam, Arjun; Farrukh, Habiba (2025). "Understanding Users' Security and Privacy Concerns and Attitudes Towards Conversational AI Platforms". 2025 IEEE Symposium on Security and Privacy (SP). pp. 298–316. arXiv:2504.06552. doi:10.1109/SP61157.2025.00241. ISBN 979-8-3315-2236-0.

[35] Zhang, Yusen; Sun, Ruoxi; Chen, Yanfei; Pfister, Tomas; Zhang, Rui; Arik, Sercan Ö. (2024). "Chain of Agents: Large Language Models Collaborating on Long-Context Tasks". arXiv:2406.02818 [cs.CL].

[36] Esposito, Matteo; Li, Xiaozhou; Moreschini, Sergio; Ahmad, Noman; Cerny, Tomas; Vaidhyanathan, Karthik; Lenarduzzi, Valentina; Taibi, Davide (2025). "Generative AI for Software Architecture. Applications, Trends, Challenges, and Future Directions". arXiv:2503.13310 [cs.SE].

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]