Scalable Real-Time Negotiation Toolkit Organizational-Structured Distributed Resource Allocation

Scalable Real-Time Negotiation Toolkit Organizational-Structured, Distributed Resource Allocation PI: Victor R. Lesser University of Massachusetts 1

Problem Description/Objective Organizational-Structured Distributed Resource Allocation • The specific technical problems we are trying to solve: – Development of Soft Real-Time, Distributed Resource Allocation Protocols – Development of Techniques for Specification, Implementation and Adaptation of Agent Organizations • Relevance to Do. D: – Techniques for building large-scale, soft real-time, multi-agent applications involving complex resource allocation decisions • Distributed sensor networks, distributed command control 3

Approach to Soft, Real-Time Distributed Coordination/Resource Allocation • Structured as a distributed optimization problem with a range of “satisficing” solutions • Adaptable to available time and communication bandwidth • Responsive to dynamics of environment • Organizationally-constrained — range of agents and issues are limited • Can be done at different levels of abstraction • Does not require all resource conflicts to be resolved to be successful — resource manager agents able to resolve some issues locally 5

Multi-level Approach to Distributed Resource Allocation and Coordination • Organizational Design: Determine appropriate agent roles and responsibilities • Team-Based Negotiation: Managers negotiate solutions to allocation conflicts • Local Autonomy: Individuals decide local unresolved allocation details 6

Capabilities • Scaling: Support for large-scale adaptive agent sensor networks • Efficiency: Organizationally grounded resource allocation • Responsiveness: Dynamic, soft real-time resource allocation • Adaptability: Organizational self-design and maintenance 7

Major Issues in Implementing this Approach • What is an appropriate organization for agents – Scalability and Robustness • What is the protocol for distributed resource allocation – Soft Real-Time, Graceful Degradation, Efficient • What is the structure of an agent architecture that supports: – agents functioning in an organizational context – agents implementing complex distributed resource protocols – agents operating under soft real-time constraints How domain-independent and efficient can we make these approaches? 8

Our Solution at the Organizational Level • Decompose environment to form a partitioned organization. – Each partition (sector) will contain a set of sensor nodes, each with its own controlling agent. – Individual sectors are relatively autonomous. • Specialize members of the agent population to dynamically take on multiple, different goals/roles. – Individual agents become “managers” of different aspects of the problem. – Managers form high-level plans to address their goals, and negotiate with other nodes to achieve them. 9

Sectored-Based Agent Organization Agents multiplex among different roles Sector Manager Tracking Manager Scanning Agent Tracking Agent 10

Organizationally-Structured Communication among Agents Dr. A Dr. Q Dr. R TB RR TD PT C RB PC DA TB U ES Sector Manager Tracking Manager Scanning Agent Tracking Agent

Managing Conflicted Resources: Sensors, Processors, Communication • Sensors – Conflicting scanning tasks from different sector managers • Locally resolved by agent connected to sensor — SRTA agent – Tracking tasks wanting same sensor resources • Negotiation among track managers — SPAM protocol • Communication – Communication degradation due to lack of locality • Track manager migration among sectors – Communication channel overload • Sector manager assignment of track manager roles • Processors – Data fusion overload/knowledge locality • Sector manager assignment of data fusion/track manager roles – Multiplexing Roles • SRTA local agent control/scheduling 12

SRTA: Soft Real-Time Agent Architecture • Facilitates creation of multi-resource management agents • Basis of building complex “virtual” agent organizations • Allows for abstract negotiation — maps abstract assignment into detailed resource allocations • Ability to resolve conflicts locally that are not resolved through negotiation These are key to building soft real-time distributed allocation policies 13

Soft Real-Time Control Architecture Negotiation Commitments/ Decommitments Other Agents (e. g. SPAM) Schedule Failure Problem solver Periodic Task Controller Goal Description/Objective Schedule failure/ Abstract view Update Expectations TÆMS Library TAEMS-Plan Network/Objective Learning Update Cache Resource Modeler Resource Uses Conflict Resolution Module Schedule Multiple Structures Task Merging Cache Check Cache Hit DTC-Planner Linear Plan Partial Order Scheduler Parallel Schedule Failure Parallel Execution Module Results 14

SPAM: Resource Adaptive Family of Anytime Negotiation Strategies • Low bandwidth or not a lot of time: – Single-shot — single assignment message to the sensor agent based on uncertain/incomplete information • Relaxing of objectives based on local information • High bandwidth, a lot of time: – Multi-step negotiation with track managers and sensors 15

Mediation-Based Negotiation World View. Multi-Linking of Resource Allocations Interdependency Graph S 12, S 22 Mediator View 1 M 14 S 32 M 25 1 2 M 20 S 15 M 33 S 25, S 20 S 7 S 2, S 14 M 0 S 5 M 20 S 18 M 7 S 53 M 8 S 8 M 33 1 1 1 M 7 1 M 8 1 16

Stage 2 - Track Manager to Track Manager Negotiation • Originating track manager acts as mediator – Generates solution space – Recommends solution quality reductions – Chooses final solution • Negotiation Mediator gets partial non-local information – Some/All of the sensor schedules relevant to specific track • Used to find neighbors (other track managers) in the constraint graph – Conflicting track managers’ information • Domain of acceptable assignments – Current solution quality – Number of possible sensors that can be used for tracking – Sensors that are in conflict (mediator to neighbor and neighbor to neighbor) • Additional constraints – fuzzy notion of constraints on non-directly conflicted sensors 17

Major accomplishments/contributions of the project • Development of SPAM heuristic resource allocation protocol – Showed importance of mediation-based negotiation (partial centralization) with overlapping context and extended views along critical paths for search and communication efficiency • Development of APO distributed constraint algorithm based on SPAM concepts – Better performance than best known algorithm - AWC • Development of SRTA soft real-time architecture – Demonstrated that a sophisticated domain-independent agent architecture that operates in soft real-time could be built • Demonstrated importance of organizational structuring for distributed resource allocation – Showed how using negotiation organization could be dynamically constructed and efficiently modified as the environment changed 18

Recent Accomplishments • New Results on SPAM, APO and Opt APO • First Results on Organizational. Structured Coalition Formation • Performance Improvements in FARM • Organization Design Framework 19

SPAM’s Effectiveness • • • 20 randomly placed sensors. Between 2 and 9 randomly placed fixed targets. 160 test runs (20 runs for each number of targets) Ran until SPAM converged. Optimal – utility was computed using a Branch and Bound search where the domain for each track was the possible objective levels. – tracks were computed by using a Branch and Bound search where the domain for each track was either the minimal utility for tracking or nothing. • Greedy – Each track manager requests 4 of the available sensors at random for every time slot. – Commitments override each other in the sensor schedules. 20

Utility Comparison as a % of Optimal • SPAM stays closer to the optimal value and has less variance in its utility. 21

Tracking Comparison • SPAM tracks nearly 100% of the optimal number of targets that can be tracked. • Greedy ignores more targets as contention increases. 22

Time to Convergence The time to converge increases linearly with an increase in contention. 23

SPAM’s Scalability • 100 -800 agents. • Each agent was either a sensor or track manager. • Fixed ratio of sensors to targets of 2. 5 sensors per target. – Fairly overconstrained • Sensors are randomly placed. • Targets move with a random velocity that is uniformly distributed from 0. 0 to 1. 0 ft/s. • Environment size had a fixed expected sensor density of 4 sensors per point. • Twenty 3 -minute runs per data point. 24

Utility Scalability SPAM consistently maintains a higher utility than a greedy assignment 25

Tracking Scalability SPAM also tracks a higher percentage of the targets that are viewable by 3 or more sensors. 26

Communication Scalability There is no apparent increase in the communications per agent as the number of agents increase. 27

Asynchronous Partial Overlay (APO) • A new algorithm for Distributed Constraint Satisfaction (DCSP) • Three basic principles – Mediation-based – Overlapping views and exploiting local context – Extending views along critical paths • Proven to be both complete and sound 28

How it Works • Agents take the role of mediator when they have conflict • Mediator gathers information from other agents in the session concerning value preferences and their effects • Mediator chooses a solution that removes local constraint violations and minimizes the effect outside of its view • Mediator then links with agents for which it caused violations (expanding context along critical paths) 29

Testing APO • Implemented the graph coloring domain in the Farm simulator – 3 Coloring problems – nodes = 15, 30, 45, 60, 75, 90 – edges • 2. 0 X nodes (Low, left of phase transition) • 2. 3 X nodes (Medium, in phase transition) • 2. 7 X nodes (High, right of phase transition) • Compared APO against the Asynchronous Weak. Commitment (AWC) protocol (Yokoo ‘ 95) – 10 random, solvable problems, each with 10 different starting assignments (Minton et al. ‘ 92) • AWC is currently the best known method for solving DCSPs 30

Results – Low Density Graphs 31

Results – Medium Density Graphs 32

Results – High Density Graphs 33

SPAM II — Optimal APO • Currently working on an optimization version of APO. • Based on the three main APO principles. – Mediation-based – Overlapping views and exploiting local context – Extending views along critical paths 34

How it works • Each agent computes the optimal value for their local subproblem (upper bound) and the current value based on their current view. • Mediation occurs when: – upper bound is greater than its current value – one of the links they own has a value that is not the highest available • Mediator gathers information from other agents in the session concerning value preferences and their effects • Mediator chooses a solution that is optimal and minimizes the impact outside of its view • Mediator then links with agents that had their current value lowered as a result of the mediation (expanding context along critical paths) 35

Testing Optimal APO • Preliminary testing on partial constraint satisfaction in 3 -coloring – Optimal APO appears to be sound and optimal – It may be better than other DCOP techniques • More testing is needed to confirm these suspicions • Optimality and soundness proofs are underway 36

Organizationally-Structured Distributed Coalition Formal Definition of the Task Allocation Problem: • Let R = {R 1, …, Rk} be the set of resources. • Let A = {a 1, … , an} be the set of agents, where each agent ai controls a set of resources CRi = {cri, 1, …, cri, k} • Let T = {T 1, …, Tm} be the set of tasks to be undertaken, where each task Tj has a utility, a set of required resources RRj = {rrj, 1, …, rrj, k}, an arrival time, a duration, and possibly a deadline. • The goal is to maximize the total utility of accomplished tasks. • A task Tj is accomplished if it was assigned to a coalition Cj of agents that collectively has enough resources to accomplish Tj while satisfying its timing constraints. How to construct for a large number of agents an organization of agents and associated allocation policy that optimizes this allocation process over an ensemble of tasks 37

An Organization for Distributed Coalition Brokering a 0 a 1 a 6 a 5 a 0 a 2 a 3 a 1 a 2 • a 4 • a 3 a 4 a 5 The only way to achieve scalability is to “organize” agents into a hierarchical structure. We can then use this structure in allocating agents (teams, coalitions, etc. ) to incoming tasks. 38

Now the organization in action… Task T 2 (200, 400) arrives at a 5 a 1 Task T 1 (100, 50) arrives at a 3 a 5 a 0 a 3 39

The Need for Search a 1 a 5 failed to form the coalition and sending task up to a 0 a 6’s schedule a 6 Can we learn a policy for deciding how to a 3’s schedule search at an agent a 3 successfully formed based on meta-level a coalition for T 1. information on a 3 resources at children a 7 agents. a 7’s schedule 40

Elements of an Organization • Organization structure • Decision making • Information abstraction • Goal decomposition 41

The Local Decision Problem • Each manager has the following state: – For each sub-cluster, its size, average resources, and standard deviation • and is required to make an action: – serial: select the best candidate agent for asking for resources – parallel: decompose the required resources over the sub-managers • Both versions of the decision problem can be modeled as an MDP, where RL can be used. – Q learning algorithm with neural nets as functional approximators 42

Experiments Setup We tested two organizations: 1. A small organization, consisting of 40 individual agents, managed by 10 managers. 2. A larger organization consisting of 90 individual agents, managed by 13 managers. • Tasks were chosen randomly from a fixed pool (one pool for each organization). • We compared learned policies (different exploration rates) against random and heuristic policies. We measured two quantities: – – the average utility achieved by the organization the average number of messages exchanged by the organization. 43

Results for Small Organization 44

Results for Small Organization 45

Results for Large Organization 46

Results for Large Organization 47

Conclusion on Organization for Distributed Coalition Brokering • The learned policies outperformed both random and heuristic policies for both the small and large organizations, achieving higher utility with less communication. – Less exploration seems to better due to interaction among learners • Though using neural nets threatens policy convergence, our learned policies always converged. Abstraction and decomposition functions highly affects convergence. • We expect more improvement in the performance of the learned policy with better abstraction and decomposition functions. • Our next step is to study the optimization of the organizational structure and how this interacts with optimizing the decision making (different organizations have different optimal policies). 48

Farm Distributed, generic multi-agent simulation environment • Provides – Environmental state accessors – Controllable communication mechanism – Plug-in mechanism for adding functionality • Agents run in “allocated” real time. – Each agent receives an amount of real CPU time to run in 49

System Architecture Analyses • State / trend analysis Farm Core Driver • Non-agent activity Meta-Agent • Thread scheduling • Communication Agent • • Agent … Agent GUIs • Plug-in management • Control flow Agent … Agent • State visualization … Meta-Agent • Thread scheduling • Communication Agent … Agent Each component may be run on a separate processor Most components are optional, and may be added dynamically 50

Global Data • Allows global data “properties” to disseminate information – Environmental simulation • (e. g. target location, visibility lists) – Statistics and instrumentation • (e. g. current utility, message totals) • Data flows among components – Readers: Analysis, agents, visualization. . – Writers: Environmental drivers, agents… 51

Global Data Bottleneck • Central storage of these properties is impractical – Thousands of agents may cause millions of accesses – Creates a potential bottleneck, as well as high communication overhead • Distribute data across components – Farm core tracks ownership of properties – Storage and access control is distributed – Data may be proactively pushed 52

Distributed Storage • Pattern of ownership matters – Co-location of data and usage is desirable – 11 strategies were evaluated across 3 domains – Compared simulation duration and message overhead • Examples: – FRP: First-reader owner, subsequent readers added to push list – S: Offline learned, max-access owner and push list contains net gain plug-ins (reads-writes > 0) 53

Results • ANTs/DSN domain: – S performed most consistently over the 3 domains – FRP is (mostly) reasonable, lower overhead choice – Simulation duration improvements vs. centralized: Strategy / Domain DSN Graph Color Learning FRP 7. 5% 21. 6% 0. 7% S 4. 8% 28. 5% 44. 4% 54

Organization Design and Instantiation • Develop automated organization design & instantiation capabilities – Domain independent approach • evaluation includes models for EW Challenge-like domain • range from small, simple organizations (1 s-10 s of agents) to large, complex organizations (1000 s of agents) – Ability to generate automatically appropriate organizational structure • based on performance requirements & task-environment expectations • e. g. , change from simple peer-to-peer, to single-level hierarchy, to multi-level hierarchy as scale/requirements change 55

Automating Organization Design and Instantiation Domain Organization Model Organization Designer Instantiator Candidate Evaluator Organizations Performance Requirements Evaluations Coordination Knowledge Resources/ Agent Capabilities Domain & Coordination Activity Models Env Model & Requirements Task Environment Analysis Models Org Design Knowledge & Search Strategies Role/Agent Bindings Operational Simulator Scenario Generator Abstract-Task Executer Detailed Task-Level Evaluation 56

Designer Input Monitor • Problem-Domain Goal Tree Detect • Task Environment Scan coverage. Area (0 0 1000) max. Simultaneous. Tracks 60 max. Arrival. Rate 5/min expected. Track. Distribution : uniform max. Velocity 20 Handle Track Fuse Update Verify . . . • Performance Requirements max. Detect. Delay 4 tracking. Resolution 8 tradeoff. Weights: Detection. 4 Tracking. 6 evaluation. Weights: Communication. 6 Computation. 1 Scanning. 3 57

Designer Input • Agents/Resources Agent: 1 Location: (40 90) Role: Radar Scanner Focused Radar Fuser Manager Subordinate. . . Cap: radar. Radius 20 scan. Rate. 25/sector focused. Scan. 75 compute. Power 1. 6 memory 1. 2 storage 20 com. Method broadcast-1 com. Range 600 com. Rate 15. . . Agent: 2 • Roles Radar. Scanner Achieves: scan Requirements: area coverage. Area rate fn(max. Detect. Delay, max. Arrival. Rate) com fn(max. Arrival. Rate) Decomposition. Method fn() Linear. Sweep. Scanner Achieves: scan Requirements: area coverage. Area sweep. Speed fn(max. Detect. Delay, max. Arrival. Rate) com fn(max. Arrival. Rate) Decomposition. Method fn() . . . Com. Method: broadcast-1 bandwidth 240 max. Effective. Load. 7 . . . 58

Designer Output • Role-Goal-Agent Bindings Agent-1 Radar. Scanner area (20 40 60 80) scan. Interval 3 send. To {Agent-34} rec. From {} Focused. Scanner send. To Fuser rec. From Fuser tradeoff. Weights: Radar. Scanner. 4 Focused. Scanner. 6 Agent-2 Radar. Scanner area (60 80 100 120) scan. Interval 3 send. To {Agent-34} rec. From {} Focused. Scanner send. To Fuser rec. From Fuser tradeoff. Weights: Radar. Scanner. 4 Focused. Scanner. 6 . . . 59

Automating Organizational Design and Instantiation Problem Domain Description Stable for a variety of contexts Goal Tree Roles Environmental Model Performance Requirements Agent Set Variable/Context specific Design & Instantiation Process Coordination Knowledge Role-Goal-Agent Bindings The organizational structure: Includes both problem domain and coordination domain bindings 60

Problem-Domain Knowledge • Automated designer is provided: – An organizational goal tree parameterized by an environmental model and performance requirements – A set of roles that could be used to satisfy the organizational leafgoals • For each role a requirement function dependent on a goal and its parameters – A set of agents and a capability list for each area dimensions max # vehicles, vehicle speed, EM distribution etc. PR max. delay, min. res. Monitor max delay Detect Scanner Verifier Track Fuser min. res. … (e. g. scan rate, scan radius, communication resources, CPU resources, …) A 1, A 2, …, An 61

Problem-Domain Bindings • To each leaf goal bind a role that satisfies it • For each role-goal binding, bind a set of agents that together satisfy the requirements Scan Fuser Scanner Verifier A 1, A 2, …, Ai, Aj, Ak …, An Fuser Scanner Verifier Fuse 62

Coordination Domain Scan Radar Ai, …, Aj Fuser CG CG The above results in a set of agent bindings to roles and goals. Alone this is not enough to guarantee that the organizational goals are satisfied. The agents’ behavior must be coordinated Each set of role-goal-agent bindings generates a Coordination Goal when needed Ak, …, Al 63

Coordination Goals • To satisfy a coordination goal, the design and instantiation process uses domainindependent coordination knowledge – Chooses coordination mechanisms and roles – Assigns coordination role-goal-agent bindings • Coordination bindings may themselves generate coordination goals 64

Coordination Goals (cont. ) • Which mechanism to use depends on the character of the role-goal-agent bindings. Examples: – A simple goal satisfied by a small number of agents may require only peer-to-peer coordination – A goal such as scanning for new vehicles satisfied by many agents requires a single-level hierarchy to assign scan schedules – A goal of tracking multiple vehicles may generate resource contention, thus requiring a coordination mechanism such as SPAM 65

Teams • Many goals can be satisfied with long-term organizational structures • Other goals are more transient and may be better satisfied with teams – Temporary “organization structures” established to satisfy particular goals and disbanded when the goals no longer hold • e. g. , Tracking a particular vehicle – Formed in response to dynamically generated subgoals – Thus, not strictly organizational structures • Rather than generate teams, the design and instantiation process must ensure that organizational structures and resources exist for agents to generate and participate in teams as needed 66

Implementation • We are building an automated organizational design and instantiation system • We frame the process of organizational design and instantiation as a search process – Use heuristics to generate a reasonable set of complete role-goal-agent bindings – Evaluate each organization (set of bindings) • described shortly • Eventually want to be able to perform early evaluation of partial bindings during generation of the candidate sets 67

Implementation Role-goal bindings Role-goal-agent bindings Domain + coordination bindings a c b Heuristics guide the search to a set {a, b, c} of reasonable organizations • Roles multiplexed within agents • Communication costs kept low • Computational load balanced a b c Org. Evaluation a highest rated organization 68

Organization Evaluation Goals: • Determining performance evaluation values for MAS structures • Mainly, comparison of different structures given the same environment – no absolute evaluation (specific performance numbers) – rather comparable performance-estimator values • Predicting the performance of the system regarding the given circumstances 69

Evaluating an Organization EM PR Goals Expected. Goal Satisfaction Agent 1: Cap. List, {B 1, B 2, …, Bm} Agent 2: Cap. List, {B 1, B 2, …, Bq} … Agentn: Cap. List, {B 1, B 2, …, Br} B i= Role-Goal Com. To Com. From Exec. Time Exec. Period Com. Time Org. Evaluation Agent Loading/ Performance Scalar Evaluation 70

Evaluation • Load calculation for the agents, based on expected number of task occurrences – Goal-specific load – Total load of an agent • Delays computed based on work- and communication load – computed from expected task occurrences • Performance estimation based on delayed tasks – decreased task utility • Response time estimation – Based on processor and communication load 74

Contributions • Separation of problem-domain from organizational coordination • Domain-independent coordination knowledge is part of the automated design system – Developer need not pre-specify organizational relationships, only problem-domain-specific goals and roles • Provide a framework for representing organizational activities and relationships • Developing algorithms for effectively searching and evaluating organization design space 76

Lessons Learnt • RADSIM and Debugging/Analysis Tools key to success of effort on EW Challenge Problem - would have put more manpower earlier in project to make them more flexible and efficient tools, and accurate representations of hardware • We needed to build separate simulation systems because could not scale RADSIM • Could not explore different communication and processing assumptions • • Extension of Oracle procedure of University of South Carolina to multiple targets – Would have allowed us to better understand the effectiveness of our resource allocation protocols EW problem did not stress limits on computational resources only on communication and sensors – Dramatic limits on communication dominated design issues too much and thus narrowed scope of issues/solutions pursued – Sensors Processing/Fusion not a time-consuming process nor were there any realistic options about trading off time for accuracy EW Challenge Problem was sufficiently interesting and relevant to lead us to very interesting and important results 81

Deliverables • SPAM: Soft real-time distributed resource allocation protocol – Transferred to Zhang and Selman • Farm: Distributed simulator for large-scale multiagent systems – available via web • SRTA: Soft real-time local agent control – Transferred to Honeywell, also available via web • APO: Mediation-based distributed constraint satisfaction protocol 83

Technology Transition/Transfer : • New Book -- Distributed Sensor Networks – Edited by V. Lesser, M. Tambe and C. Ortiz – Kluwer Academic Publishers • Part of a book series on Multiagent Systems, Artificial Societies and Simulated Organizations – Contains articles from ANT’s researchers on • EW Challenge Problem Testbed • Distributed Resource Allocation: Architectures and Protocols • Formal Analysis of Protocols 84

Technology Transition/Transfer • Rockwell/Collins considering SRTA architecture components and SPAM/APO for DARPA HURT • Boeing investigating the potential of using SPAM/APO for the new DISA GIG project and for air space deconfliction for future JDAM projects. • Honeywell Laboratories has licensed the TÆMS/ SRTA technologies for research use. – Used in First-Responder Application • Real-time Tornado Tracking using Network of Phase. Arrayed Radars as part of NSF ERC – SPAM type real-time resource allocation – Initial deployment in 2006 in Oklahoma 85

Project Successes • APO -- Best performing distributed constraint satisfaction algorithm – Measured against currently best known distrtibuted constraint satisfaction algorithm (AWC) • SPAM -- close to optimal in small experiments, scales well to large number of sensors and vehicles, resource and time adaptive protocol – Experimentally validated in RADSIM – Large Scale Experiments in FARM – Used in EW Challenge Problem hardware • SRTA -- demonstrated a sophisticated agent arhcitecture with planning and scheduling technologies is appropraiate for use in a soft-real time application – Extensive experimentation in RADSIM – Used in EW Challenge Problem hardware 87

Project Successes (cont. ) • Organization Structuring -- demonstrated advantages of using an organization approach for distributed resource allocation; show ed that these advantages grow with scale; demonstrated ability to create and adapt organization on the fly. – Extensive experimentation in RADSIM – Used in EW Challenge Problem hardware • Extensive Publication – 1 book, 6 book chapters, 3 journal articles + 5 in preparation, 18 highly selective conference papers, 16 workshop papers • Received Honorable Mention in FIPA Software Prototypes Track Demonstration Competition for Distributed Sensor Network Application 88

Cumulative List of Publications Books/Book Chapters • Egyed, Alexander; Horling, Bryan; Becker, Raphen; and Balzer, Robert (2003). “Visualization and Debugging Tools. ” Distributed Sensor Nets: A Multiagent Perspective. V. Lesser, C. Ortiz, and M. Tambe (editors), Kluwer Academic Publishers, pp. 33– 41. • Horling, Bryan; Mailler, Roger; Shen, Jiaying; Vincent, Regis; and Lesser, Victor (2003). “Using Autonomy, Organizational Design and Negotiation in a Distributed Sensor Network. ” Distributed Sensor Nets: A Multiagent Perspective. V. Lesser, C. Ortiz, and M. Tambe (editors), Kluwer Academic Publishers, pp. 139– 183. • Lesser, V. ; Ortiz, C. ; Tambe, M. (editors). Distributed Sensor Networks: A Multiagent Perspective. In Series: Multiagent Systems, Artificial Societies, and Simulated Organizations, Volume 9, May 2003. • Wang, Guandong; Zhang, Weixiong; Mailler, Roger; and Lesser, Victor. (2003). “Analysis of Negotiation Protocols by Distributed Search. ” Distributed Sensor Nets: A Multiagent Perspective. V. Lesser, C. Ortiz, and M. Tambe (editors), Kluwer Academic Publishers, pp. 339– 361. • Horling, B. ; Mailler, R. ; Lesser, V. (To appear. ) “Farm: A Scalable Environment for Multi-Agent Development and Evaluation. ” Advances in Software Engineering for Multi-Agent Systems. 89

Publications in Journals and Highly Refereed Conferences • Lesser, V. ; Decker, K. ; Wagner, T. ; Carver, N. ; Garvey, A. ; Horling, B. ; Neiman, D. ; Podorozhny, R. ; Nagendra. Prasad, M. ; Raja, A. ; Vincent, R. ; Xuan, P. ; Zhang, X. Q. (To appear). “Evolution of the GPGP/TAEMS Domain-Independent Coordination Framework. ” Autonomous Agents and Multi-Agent Systems, Kluwer Academic Publishers. • Zhang, Xiao. Qin; Lesser, Victor; Podorozhny, Rodion (To appear). “Multi-Dimensional, Multi. Step Negotiation for Task Allocation in a Cooperative System. ” Autonomous Agents and Multi. Agent Systems (conditionally accepted for publication). • Mailler, Roger; Lesser, Victor; and Horling, Bryan (2003). “Cooperative Negotiation for Soft Real-Time Distributed Resource Allocation. ” Proceedings of Second International Joint Conference on Autonomous Agents and Multi. Agent Systems (AAMAS 2003), Melbourne, Australia, ACM Press, pp. 576– 583. • Sims, Mark; Goldman, Claudia; and Lesser, Victor (2003). “Self-Organization through Bottom Coalition Formation. ” Proceedings of Second International Joint Conference on Autonomous Agents and Multi. Agent Systems (AAMAS 2003), ACM Press, Melbourne, Australia, pp. 867– 874. • Xiang, Yang; Lesser, Victor. (2003). “On the Role of Multiply Sectioned Bayesian Networks for Cooperative Multiagent Systems. ” IEEE Transactions on Systems, Man, and Cybernetics, Part A, Vol. 33(4): 489– 501. 90

Publications in Journals and Highly Refereed Conferences, Continued • Zhang, X. Q. ; Lesser, V. R. ; Wagner, T. (2003). “Integrative Negotiation in Complex Organizational Agent Systems. ” In Proceedings of the 2003 IEEE/WIC International Conference on Intelligent Agent Technology (IAT 2003), pp. 140– 146. • Zhang, X. Q. ; Lesser, V. R. ; Wagner, T. (2003). “A Two-Level Negotiation Framework for Complex Negotiations. ” In Proceedings of the 2003 IEEE/WIC International Conference on Intelligent Agent Technology (IAT 2003), pp. 311– 317. • Zhang, XQ; Lesser, V. ; and Abdallah, S. (2003). “Efficient Ordering and Parameterization of Multi-Linked Negotiation. ” In Proceedings of Second International Joint Conference on Autonomous Agents and Multiagent Systems. (Extended abstract), ACM Press, pp. 11701171. Full version available as University of Massachusetts Computer Science Technical Report #02 -42. • Zhang, XQ; Lesser, V. ; and Wagner, T. (2003). “A Multi-Leveled Negotiation Framework. ” In Proceedings of Second International Joint Conference on Autonomous Agents and Multiagent Systems. (Extended abstract), ACM Press, pp. 1172 -1173. Full version available as University of Massachusetts Computer Science Technical Report #02 -44. • Horling, B. ; Neiman, D. ; Podorozhny, R. ; Nagendra. Prasad, M. ; Raja, A. ; Vincent, R. ; Xuan, P. ; Zhang, X. Q. (2002). “Evolution of the GPGP/TAEMS Domain-Independent Coordination Framework. ” (Plenary Lecture/Extended Abstract). Proceedings of the 1 st International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS’ 02), pp. 1 -2. (Also available in the full version as University of Massachusetts Computer Science Technical Report 02 -03. ) 91

Journals and Highly Refereed Conferences, Continued • Lesser, V. (2002). “Evolution of the GPGP/TÆMS Domain Independent Coordination Framework, ” (Plenary lecture/Extended abstract. ) Proceedings of the 1 st International Conference on Autonomous Agents and Multi-Agent Systems, Part 1. ACM Press, pp. 1– 2. • Xuan, P. and Lesser, V. (2002). “Multi-Agent Policies: From Centralized Ones to Decentralized Ones. ” Proceedings of the 1 st International Joint Conference on Autonomous Agents and Multiagent Systems, Part 3: pp. 1098 -1105. ACM Press. • Zhang, X. Q. ; Lesser, V. ; Wagner, T. (2002). “Integrative Negotiation in Complex Organizational Agent Systems. ” (Extended abstract. ) In Proceedings of the 1 st International Joint Conference on Autonomous Agents and Multiagent Systems, Part 1: pp. 503– 504. ACM Press. • Zhang, X. Q. and Lesser, V. (2002). “Multi-Linked Negotiation in Multi-Agent Systems. ” Proceedings of the 1 st International Joint Conference on Autonomous Agents and Multiagent Systems, Part 3: pp. 1207 -1214. ACM Press. • Horling, B. ; Benyo, B. ; Lesser, V. (2001). “Using Self-Diagnosis to Adapt Organizational Structures. ” Proceedings of the Fifth International Conference on Autonomous Agents (Agents 2001), Montreal, ACM Press, pp. 529– 536. • Horling, B. ; Vincent, R. ; Mailler, R. ; Shen, J. ; Becker, R. ; Rawlins, K. ; Lesser, V. (2001). “Distributed Sensor Network for Real-Time Tracking. ” Proceedings of the Fifth International Conference on Autonomous Agents (Agents 2001), Montreal, ACM Press, pp. 417 -424. • Raja, A. ; Wagner, T. ; Lesser, V. (2001). “Reasoning about Uncertainty in Agent Control. ” In Proceedings of the Fifth International Conference on Information Systems, Analysis, and Synthesis, Computer Science and Engineering: Part 1, Volume VII, pp. 156 -161, Orlando, FL. Received Best Paper Award for session on Mathematical Methods & Optimization in Problem Solving Systems II. 92

Highly Refereed Conferences, continued • Vincent, R. ; Horling, B. ; Lesser, V. ; Wagner, T. (2001). “Implementing Soft Real-Time Agent Control. ” In Proceedings of the Fifth International Conference on Autonomous Agents (Agents 2001), Montreal, ACM Press, June 2001, pp. 355– 362. (Honorable Mention in FIPA Software Prototypes Track Demonstration Competition. ) • Wagner, T. ; Lesser, V. (2001). “Evolving Real-Time Local Agent Control for Large-Scale Multi -Agent Systems. ” (Extended abstract) In Proceedings of the Fifth International Conference on Autonomous Agents, Montreal: ACM Press, pp. 17– 18. • Xuan, P. ; Lesser, V. ; Zilberstein, S. (2001). “Communication Decisions in Multi-agent Cooperation: Model and Experiments. ” In Proceedings of the Fifth International Conference on Autonomous Agents (Agents 2001), Montreal, ACM Press, pp. 616– 623. • Raja, A. ; Lesser, V. ; Wagner, T. (2000). “Toward Robust Agent Control in Open Environments. ” In Proceedings of the Fourth International Conference on Autonomous Agents (AA 2000), Barcelona, pp. 84– 91. • Xiang, Y; Lesser, V. (2000). “Justifying Multiply Sectioned Bayesean Networks. ” In Proceedings of the Fourth International Conference on Multi-Agent Systems (ICMAS), Boston, pp. 349– 356. • Xiang, Y; Lesser, V. (2000). “A Constructive Bayesian Approach for Vehicle Monitoring. ” In Proceedings of the Third International Conference on Information Fusion (Fusion 2000). Vol. 2, pp. 14– 21, Paris. • Zhang, X. Q. ; Podorozhny, R. ; Lesser, V. (2000). “Cooperative, Multi. Step Negotiation Over a Multi-Dimensional Utility Function. ” In Proceedings of the IASTED International Conference, Artificial Intelligence and Soft Computing (ASC), Banff, IASTED/ACTA Press, pp. 136– 142. 93

Workshops, Symposia, Collected Volumes & Technical Reports • Horling, Bryan; Mailler, Roger; Sims, Mark; and Lesser, Victor (2003). “Using and Maintaining Organization in a Large-Scale Distributed Sensor Network. ” In Proceedings of the Workshop on Autonomy, Delegation, and Control (AAMAS 03), Melbourne, Australia, July. • Horling, Bryan; Mailler, Roger; and Lesser, Victor. (2003) “Farm: A Scalable Environment for Multi-Agent Development and Evaluation. ” Proceedings of Second International Workshop on Software Engineering for Large-Scale Multi-Agent Systems (SELMAS), pp. 171– 177. • Mailler, Roger; and Lesser, Victor. (2003) “A Mediation-Based Protocol for Distributed Constraint Satisfaction. ” In The Fourth International Workshop on Distributed Constraint Reasoning, Acapulco, Mexico, pp. 49 -58. • Horling, Bryan; Lesser, Victor; Vincent, Regis; Wagner, Thomas (2002). “The Soft Real-Time Agent Control Architecture. ” In Proceedings of the AAAI/KDD/UAI-2002 Joint Workshop on Real-Time Decision Support and Diagnosis Systems. Technical Report WS-02 -15, pp. 54 -65. (Also available as University of Massachusetts Computer Science Technical Report 02 -14. ) • Lesser, V. ; Decker, K. ; Wagner, T. ; Carver, N. ; Garvey, A. ; Raja, A. ; Lesser, V. (2002). “Meta -Level Control in Multi-Agent Systems. ” In Proceedings of AAAI/KDD/UAI-2002 Joint Workshop on Real-Time Decision Support and Diagnosis Systems, Technical Report WS-0215, pp. 47 -53. (Also available as University of Massachusetts Computer Science Technical Report 01 -49. ) • Mailler, R. , Vincent, R. , Lesser, V. , Middlekoop, T. , and Shen, J. (2001). “Soft-Real Time, Cooperative Negotiation for Distributed Resource Allocation. ” In Proceedings of the AAAI Fall Symposium on Negotiation Methods for Autonomous Cooperative Systems, Falmouth, MA. 94

Workshops, Symposia, Collected Volumes & Technical Reports, Continued • Raja, A. ; Lesser, V. (2001). “Towards Bounded-Rationality in Multi-Agent Systems. ” In University of Massachusetts Computer Science Technical Report 01 -34. • Vincent, R. ; Horling, B. ; Lesser, V. (2001). “An Agent Infrastructure to Build and Evaluate Multi-Agent Systems: The Java Agent Framework and Multi-Agent System Simulator. ” Lecture Notes in Artificial Intelligence 1887: Infrastructure for Agents, Multi-Agent Systems, and Scalable Multi-Agent Systems. Wagner & Rana (eds. ), Springer, pp. 102– 127. • Zhang, X. Q. ; Lesser, V. ; Podorozhny, P. (2001). “New Results on Cooperative, Multi. Step Negotiation Over a Multi-Dimensional Utility Function. ” Proceedings of the AAAI Fall Symposium on Negotiation Methods for Autonomous Cooperative Systems. • Zhang, X. Q. ; Lesser, V. ; Wagner, T. (2001). “A Proposed Approach to Sophisticated Negotiation. ” Proceedings of the AAAI Fall Symposium on Negotiation Methods for Autonomous Cooperative Systems. • Horling, B. ; Benyo, B. ; Lesser, V. (2000). “Using Self-Diagnosis to Adapt Organizational Structures. ” (Extended abstract) In The Fourth International Conference on Multi. Agent Systems (ICMAS), Boston, MA: IEEE Computer Society, pp. 397 -398. Also available as University of Massachusetts/Amherst Computer Science Technical Report #1999 -64. • Raja, A. ; Wagner, T. ; Lesser, V. (2000). “Reasoning about Uncertainty in Design-to. Criteria Scheduling. ” Proceedings of AAAI 2000 Spring Symposium on Real-Time Autonomous Systems, pp. 76– 83, Stanford, CA. 95

Workshops, Symposia, Collected Volumes & Technical Reports, Continued • Wagner, T. ; Benyo, B. ; Lesser, V. ; Xuan, P. (2000). “Investigating Interactions Between Agent Conversations and Agent Control Components. ” Issues in Agent Communication, Vol. 1916, F. Dignum & M. Greaves (eds. ), Berlin: Springer-Verlag, pp. 314– 331. • Wagner, Thomas (2000). “Toward Quantified Control for Organizationally Situated Agents. ” University of Massachusetts/Amherst, Department of Computer Science, Ph. D. Thesis, February. • Wagner, T. ; Lesser, V. (2000). “Design-to-Criteria Scheduling: Real-Time Agent Control. ” Proceedings of AAAI 2000 Spring Symposium on Real-Time Autonomous Systems, pp. 89– 96. Also available as University of Massachusetts/ Amherst Computer Science Technical Report #1999 -58. • Wagner, T; Lesser, V. (2000). “State-based Control for Organizationally Situated Agents. ” (Extended abstract) In Proceedings of the Fourth International Conference on Multi-Agent Systems (ICMAS), Boston, MA: AAAI Press, pp. 457 -458. Also available as University of Massachusetts/Amherst Computer Science Technical Report #1999 -68. • Xuan, P. ; Lesser, V. ; Zilberstein, S. (2000). “Communication in Multi-Agent Markov Decision Processes. ” (Extended abstract) In Proceedings of the Fourth International Conference on Multi-Agent Systems (ICMAS), Boston, MA: AAAI Press, pp. 467 -468. 96