diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/.gitignore" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/.gitignore"
new file mode 100644
index 0000000000000000000000000000000000000000..dc683abc101a122659930791875ec01a828c0a98
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/.gitignore"	
@@ -0,0 +1,20 @@
+# Python
+**/__pycache__/
+*.pyc
+*.pyo
+*.pyd
+.Python
+env/
+venv/
+**/*_venv/
+
+# IDE
+.vscode/
+.vscode-server/
+.cursor/
+.cursor-server/
+.idea/
+
+# special
+**/results/
+**/tmp/
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/README.md" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/README.md"
new file mode 100644
index 0000000000000000000000000000000000000000..428df6008b990878642f8a93d7b68698c1d34a7b
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/README.md"	
@@ -0,0 +1,316 @@
+# 验收报告
+## 1. 项目概述
+### 1.1 项目背景
+OpenCloudOS 9在AI生态建设过程中面临的主要挑战：
+- AI软件栈数量庞大，逐一手工验证耗时耗力
+- 部分软件上游只针对Ubuntu等其他发行版开发，不兼容OpenCloudOS 9
+- 缺乏标准化的验证流程和自动化工具
+
+### 1.2 项目目标
+构建一个智能化的AI软件自动化验证工具，实现：
+- 自动分析软件文档，获取安装方式
+- 自动生成测试用例
+- 批量验证Python软件包兼容性
+- 标准化测试框架和输出格式
+
+## 2. 思路分析
+经过文献调研，以及基于出题组给出的PyPI软件包列表，我们发现软件包安装和测试失败的原因可以归为以下几类：
+- 依赖冲突：不同软件对于同一个库的依赖版本可能不同，或者有些软件无法在同一个环境中共存。
+- 系统环境缺失：一些特殊软件需要依赖系统环境。比如对于需要依赖GPU的软件，系统中需要有GPU相应的驱动程序，如`OpenGL`，`OpenCL`，`CUDA`等驱动。如果缺失这些驱动，软件在进行验证时就会报错，导致验证失败。
+- 包名不匹配：使用pip安装的包名可能与在代码中import时的包名不一致，导致import失败。比如`opencv-python`在安装时使用`pip install opencv-python`，但是在导入时使用的是`import cv2`。
+- 软件长时间未维护：比如`gnes`已经三年未维护，仍用`legacy setup.py bdist_wheel`与旧版Cython语法，导致软件无法成功编译安装。
+- 验证代码出错：比如大模型给`mmdnn`包生成的验证代码为`import mmdnn; print(mmdnn.__version__)`，但`mmdnn`并没有`__version__`变量，导致测试出错。
+
+针对上述问题，结合当下热门的大模型和MCP工具，本项目确定了如下思路：
+- 由于需要实现自动化验证，而不同软件的验证方式各不相同，且很难有通用的方式；因此可以结合当下热门的MCP工具，由MCP工具自动生成相关安装命令和验证代码。由于有些软件包十分复杂，比如`pandas`就有多达60个extras软依赖，对软件进行详细的功能测试并不现实，因此在生成验证代码时只生成核心功能的测试代码，包括软件包导入测试、软件包基本功能测试和GPU使用测试。
+- 针对依赖冲突问题，首先可以先使用MCP工具去分析软件包的依赖关系，进而生成拓扑排序，最后根据该排序依次安装软件包；另一方面，当发生由于依赖问题导致的安装失败时，创建一个虚拟环境后再在该环境下重新安装软件包。
+- 针对环境缺失问题，这里我们结合了`“验证前检测”`和`“报错时解决”`两种方式。
+  + 对于`“验证前检测”`方式，我们需要先了解python软件包的安装过程。软件包在打包发布到PyPI仓库时会将相关的动态库以及源代码打包到`wheel`中；当我们下载后，该`wheel`会被验证，解压，最后复制到`site-packages`目录下；然后`pip`管理器会根据这些文件生成`<package>-<version>.dist-info`目录来管理元数据，其中值得关注的是`top_level.txt`文件，因为它记录的是这个包的顶级模块名(即`import`时使用的包名)，这可以用于解决包名不匹配的问题；最后就是注册入口点、校验依赖以及记录文件清单。
+  + 我们通过分析下载的软件包能否正确链接到系统环境可以提前判断并解决环境缺失问题。因此在下载了软件包后，我们可以在该软件包的`site-packages`下查找软件包的动态库，并通过`ldd`命令查看该动态库是否能正确链接到系统环境中的依赖库，如果有依赖缺失，再通过大模型分析解决。
+  + 而对于`“报错时解决”`方式，又分为`安装时报错`和`测试时报错`，我们可以在报错时收集系统环境信息、所使用的指令以及报错信息，将这些信息传递给MCP工具，由MCP工具自动分析解决。
+- 针对包名不匹配问题，使用分析`top_level.txt+大模型分析`的方式找到正确import的包名。
+- 针对验证代码出错问题，我们同样可以在`“报错时解决”`中进行解决，通过给大模型提供验证代码和报错信息，大模型自动修改验证代码并验证，直到成功。
+
+## 3. 系统设计
+### 3.1 系统执行流程
+![系统执行流程](assets/AI软件自动验证工具.png)
+- 其中AI agent工具和MCP工具是可选功能，加上该工具后该系统才能更加智能地安装和验证软件包。
+- 下文将对各个组件的设计进行介绍。
+
+### 3.2 从Github仓库获取PyPI软件包
+#### 3.2.1 从Github收集仓库信息
+- 根据用户提供的topic来从github中筛选符合条件的仓库，并记录相关仓库的地址，便于后续分析。
+- 为了保证获取的仓库的影响力，选择获取star数大于1000的仓库。
+
+#### 3.2.2 PyPI软件包获取
+- 分析该Github仓库的主语言，如果是Python，再查看该仓库的`README.md`中是否有`pip install`命令，如果有则提取其中的PyPI包名并记录下来。 
+
+### 3.3 分析软件包
+- 使用MCP工具自动完成如下任务：
+- 查看该包名是否在PyPI仓库中存在，存在则继续进行下面的分析。
+- 由于观察到开发者在将包发布到PyPI仓库时会将依赖信息、环境要求一并上传，因此我们会根据包名从PyPI仓库中获取依赖信息，包括版本限制，以辅助后续分析；但是，开发者在PyPI仓库中存放的信息并不一定准确，比如软件包`accountant-0.0.6`在PyPI仓库中记录他的依赖包`enum>=1.1.5`，但是PyPI仓库中最新的`enum`版本是0.4.7，说明该依赖信息并不准确。因此，我们将进一步获取更多信息来尽可能得到准确信息。我们观察到开发者还会将该软件包的Github地址一并放入PyPI仓库的元数据中，因此我们也可以通过这个信息获取Github中的README.md，requirements.txt等文件进一步获取相关依赖。除此之外，我们还利用大模型自身的知识储备来进一步修改和完善获得的依赖列表。通过上述方法的结合，我们最终得到较为准确的依赖列表。
+- 由于有些AI软件包需要依赖GPU来运行，我们同样从PyPI仓库中的元数据、Github仓库中查看该软件包是否依赖GPU，并使用大模型来进一步确认该软件包是否依赖GPU。
+- 本项目需要验证软件包是否能够正确执行，因此我们对每个软件包都进行三种测试，分别是：导入测试、基本功能测试、GPU使用测试。由于这部分测试代码并没有通用的模式，因此我们选择让大模型根据自身知识储备来生成测试代码以及相应的预期结果。 
+
+### 3.4 软件包安装与验证
+#### 3.4.1 软件包的安装
+- 为了不对本地的python环境造成影响，我们在venv创建的虚拟环境中进行安装和验证。
+- 根据前面提供的软件包信息，我们可以获得每个软件包的依赖关系，在执行批量安装前，我们可以根据这些依赖关系基于图论算法生成软件包的安装顺序，这不仅可以降低安装失败的概率（因为安装某个包前他的依赖已经成功安装），同时，在安装某个包前我们就可以将这个包的依赖包的版本限制与已安装的包的版本进行对比，找到可能的版本冲突情况。
+- 本项目会选择一个与当前所要安装的包的版本限制不冲突的虚拟环境，如果该环境不存在，则创建一个新的虚拟环境，然后使用pip或者uv进行软件包的安装。
+- 安装完成之后，我们还需要进行`“验证前检测”`操作。PyPI软件包一般使用`wheel`进行打包，在打包过程中会将软件所需要的动态库也一并打包。因此，我们在`site-packages`下查找该软件包所包含的动态库，通过`ldd`命令查看这些动态库能否正确链接到环境中的库文件。如果有未链接到的情况，则交由大模型分析并解决（如使用系统的包管理工具自动安装缺失的依赖）。
+- 但在获取软件包的动态库时仍可能出现问题，因为`site-packages`下的软件包使用的是导入时的包名，可能与安装时使用的包名不一致。为了解决这个问题，我们可以利用`<package>-<version>.dist-info`目录中的`top_level.txt`文件找到正确的导入包名，进而正确定位该包的动态库。
+- 安装失败时，系统会生成安装失败信息。
+
+#### 3.4.2 软件包的验证
+- 如果能够正确安装与验证，我们就可以开始执行验证代码，验证分为导入验证、基本功能验证和GPU使用验证。
+- 对于导入验证，由于安装的包名可能与import时使用的包名不一致，且大模型的幻觉可能导致导入错误的包名，从而导致测试失败。比如`opencv-python`在安装时使用`pip install opencv-python`，但是在导入时使用的是`import cv2`。不幸的是，正确的导入包名只有在安装完成之后通过读取该包的`top_level.txt`才能够知道，且这些包名有些属于内部接口，比如`pyyaml`的`top_level.txt`中包括`yaml`和`_yaml`两个顶级模块，但是`_yaml`属于内部API，不应该在import时使用。基于上述观察，我们首先从下载的软件包中获取顶级模块名，再将这些顶级模块名交给大模型，由它判断哪些包属于import测试时使用的包。
+- 对于基本功能验证，由于使用大模型分析并生成的测试用例可能有错误，因此当发生验证失败时，我们将所使用的虚拟环境、验证代码、验证结果一并交给大模型分析，由大模型判断验证失败原因，如果是由于代码错误导致的验证失败，则生成新的验证代码并进行测试。
+- 对于GPU使用测试，如果在软件包分析阶段没有生成相关代码，则认为该软件无需GPU；否则进行GPU测试。在测试过程中我们发现，有些软件包在使用系统环境中的动态链接库时使用的是运行时动态链接方式，无法在测试前通过ldd命令发现，因此我们设计了`“报错时解决”`方法，即在测试发生错误时，使用大模型分析是否因动态链接库缺失导致测试失败，若是，则使用系统的包管理工具进行安装，再重新执行测试。
+- 结果收集：首先，对于在验证阶段对代码的修改，我们会同步到数据库/json文件中；其次，我们会标准化收集测试结果和日志，并输出最终结果。
+
+### 3.5 AI Agent和MCP
+- MCP是一个开放协议，它为应用程序向LLM提供上下文的方式进行了标准化，帮助你在LLM的基础上构建代理（agents）和复杂的工作流。
+- MCP Servers：服务器是提供外部数据和工具的组件。考虑到维护系统安全，我们不能直接允许大模型随意调用系统中的工具，因此，我们需要最小化MCP Server的能力范围，例如，在卸载PyPI包时，只允许LLM卸载自己安装的软件包，允许LLM使用系统包管理器安装系统软件，但不允许其卸载系统软件等。为了实现通过MCP来完成软件包的分析、安装和验证，我们提供了四类服务：Github信息获取服务、PyPI包分析服务、依赖分析服务和软件包测试服务。
+- MCP Clients：客户端是主机与服务器之间的桥梁。它与服务器保持一对一的连接，并为服务器初始化环境信息（如Python执行器路径、系统环境变量）；同时，它为大模型提供展示MCP工具、执行MCP工具等功能。
+- 整体工作流程：系统通过给LLM提供`System Message`来要求LLM使用MCP工具解决问题，LLM使用`Assistant Message`以json格式调用MCP工具，MCP工具以`User Message`或`Tool Message`返回执行结果，经过多轮消息传递，LLM最后返回处理结果给系统。
+
+## 4. 核心功能实现
+### 4.1 数据格式说明
+1. 为了代码设计的灵活性，我们在`package_info.py`中使用继承于BaseModel的PackageInfo类和Package类来在各个模块之间传输数据，每个类的属性如下：
+```python
+class PackageInfo(BaseModel):
+    """包信息的数据模型"""
+    dependency: List[str] = Field(default_factory=list) # 该包的依赖数组，如[numpy >=1.6.0, pandas Any]
+    import_test_code: str = ""  # 导入验证代码
+    import_test_expected_result: str = ""   # 导入验证预期结果
+    function_test_code: str = ""    # 基本功能验证代码
+    function_test_expected_result: str = "" # 基本功能验证预期结果
+    gpu_test_code: str = "" # GPU使用验证代码
+    gpu_test_expected_result: str = ""  # GPU使用验证预期结果
+    verified: str = "False" # 是否已完成验证
+```
+
+```python
+class Package(BaseModel):
+    """完整的包记录模型"""
+    package_name: str   # 包名
+    info: PackageInfo   # PackageInfo类
+    exists: bool        # 该包是否在PyPI仓库中存在
+```
+
+2. 在存储数据时，本项目支持数据库和json文件两种存储格式，并将格式转换功能封装在`package_converter.py`中，将数据库/文件操作封装在`package_repository.py`中
+
+### 4.2 从Github仓库获取PyPI软件包功能实现
+#### 4.2.1 根据指定的topic从Github仓库中获取PyPI包名，包括如下步骤:
+1. 使用指定topic，从Github获取仓库信息，为了保证获取的仓库认可度足够高，选择star数在1000以上的仓库，并将信息记录在.csv文件中，仓库信息包括如下内容： 
+- "name": 仓库名，
+- "url"：仓库地址，
+- "language"：仓库主语言,
+- "stars"：仓库star数,
+- "description"：仓库描述信息,
+- "updated_at"：仓库更新日期。
+2. 读取.csv文件，根据文件中仓库的地址，从仓库中读取README.md文件内容，通过大模型分析其中是否有PyPI包，如果有，则将该仓库的所有包名以列表的形式记录为`package_list`，如`[numpy, pandas, ...]`，并将`name`，`url`，`package_list`记录到.txt文件中。
+
+#### 4.2.2 根据获取的PyPI包名生成Package信息
+1. 读取pypi.txt文件，获取全部的PyPI包名，调用`assign_task_to_llm_mcp`函数，并将PyPI包名作为参数传递给MCP模块
+2. 在MCP模块中，通过`system`角色向大模型发起任务，要求大模型按照如下步骤分析该PyPI包
+- 调用`check_pypi_mcp`工具检测该PyPI包名是否在PyPI仓库中存在，若不存在，则说明该包名有误，直接停止；
+- 如果该包存在，则调用`find_dependency_for_pip_package_mcp`工具，并结合自身的知识储备找到该包的依赖包及每个包的版本限制；
+- 如果该包存在，则调用`find_gpu_requirement_for_pip_package_mcp`工具，并结合自身的知识储备判断该包是否需要GPU实现完整功能；
+- 如果该包存在，根据上述信息，生成验证代码和预期验证结果，包括：导入测试、核心功能测试和GPU使用测试。对于每个测试，还要生成相应的预期测试结果。由于有些结果在执行前无法预知，比如打印时间等功能，则要求生成的预期测试结果使用正则表达式书写，在正式测试时通过判断测试结果能否匹配正则表达式来决定测试是否通过。
+- 将生成的Package信息保存到数据库/json文件中。
+
+### 4.3 软件包安装与验证功能实现
+#### 4.3.1 软件拓扑排序生成
+1. 从数据库/json文件中获取全部`exists`字段为`true`的Package信息，读取`package_name`字段和`info.dependency`字段，并组织为`{"package_name": ["dependency", ...]}`的形式。
+2. 调用`build_dependency_graph`抽离出包名（剔除版本限制部分），然后构建`被依赖包->依赖包`的有向无环图结构。
+3. 调用`topological_sort`根据该有向无环图生成拓扑排序，若在生成拓扑排序过程中发现有环，则生成失败，退出程序。
+
+#### 4.3.2 环境预处理
+1. 根据生成的拓扑排序，系统逐一进行安装与验证。由于拓扑排序中包括一些被依赖包，他们并没有在数据库/json文件中记录相关的Package信息，因此这些包将在默认虚拟环境下安装。
+2. 对于在数据库/json文件中有记录Package信息的包，系统首先会查看默认的虚拟环境是否能满足要求；具体来说，系统调用`detect_potential_version_conflicts`来将默认虚拟环境中已安装包的版本与`info.dependency`中的版本限制作对比，如果在范围内，则认为该虚拟环境满足要求。如果存在冲突，则系统会遍历所有虚拟环境，直到找到符合要求的虚拟环境或所有虚拟环境都不能满足要求。如果所有环境都不能满足要求，则调用`create_venv`创建一个新的虚拟环境。
+
+#### 4.3.3 软件包安装
+1. 系统在选择正确的虚拟环境后，将在该虚拟环境下调用`install_package`安装软件。
+2. 如果软件安装成功，则继续往下执行；如果软件安装失败，会调用`assign_task_to_llm_mcp`进行分析并解决，所传递的参数如下所示：
+- PyPI包名，
+- 安装命令(pip或uv)，
+- 标准输出(stdout)，
+- 错误信息(stderr)。
+3. 在MCP模块中，通过`system`角色向大模型发起任务，要求大模型按照如下步骤解决安装失败的问题：
+- 分析安装失败原因。
+- 使用自身知识储备以及MCP工具来解决安装失败的问题，包括：
+  + 如果该错误是由于缺少系统依赖，使用`detect_system_package_manager_mcp`来查看当前系统使用什么包管理器(dnf/yum/apt等)，然后调用`install_system_package_mcp`安装缺失的系统包，再重新安装该PyPI包。
+  + 如果该错误是由于缺失PyPI软件包，则使用`install_pypi_package_mcp`工具安装缺失的PyPI包，再重新安装该PyPI包。
+  + 如果该错误是由于错误的虚拟环境配置，则使用`create_virtual_env_mcp`工具来创建新的虚拟环境，并在该环境中重新安装PyPI包。
+- 重复上述过程直到包能成功安装或者超过重试次数。
+- 如果安装失败则记录安装结果，成功则进行软件包的验证。
+
+#### 4.3.4 系统环境验证
+1. 成功安装软件包后，调用`pre_resolve_environment`分析软件包能否正确链接到系统的依赖库。`pre_resolve_environment`会首先调用`find_dynamic_libs`来找到该软件包的动态文件，然后调用`analyze_lib_with_ldd`查看该动态文件是否能正确链接到系统动态库，如果有未链接到的情况，则调用`auto_install_missing_dependencies`自动安装依赖，再次检查链接情况直到动态文件能正确链接到系统动态库。
+2. 在`find_dynamic_libs`中，它首先调用`detect_import_names`，根据`site-packages`目录下的`<package_name>-<version>.dist-info`找到软件的`top_level.txt`文件，进而通过大模型分析其中的顶级模块名中哪些是导入时使用的包名。通过这个包名，`find_dynamic_libs`可以在包名目录下遍历`.so`文件，并通过`ldd`命令查看文件的链接情况，最终确定有无链接失败的情况发生。
+3. 在`auto_install_missing_dependencies`中，系统会根据依赖缺失情况自动安装软件包，由于缺失的依赖可能来自某个软件包的某个动态库，因此需要使用大模型分析，找到使用系统包管理器安装时需要的包名，最后执行安装命令。
+
+#### 4.3.5 软件包验证
+1. 如果该包没有`Package`信息，说明该包仅是依赖包，而不是需要验证的包，因此跳过验证，否则开始生成验证代码。
+2. 通过`Package`的`info.import_test_code`、`info.import_test_expected_result`、`info.function_test_code`、`info.function_test_expected_result`、`info.gpu_test_code`、`info.gpu_test_expected_result`等信息生成测试用例和预期结果（使用正则表达式表示），在生成信息时，由于有些软件包需要从`HuggingFace`上下载小模型验证功能，但是国内并不能直接连接`HuggingFace`，因此在生成代码时会添加`import os; os.environ["HF_ENDPOINT"] = "https://hf-mirror.com"`以从国内的镜像网站下载模型测试。
+3. 在测试时，如果出现报错，会调用MCP模块解决，通过`system`角色向大模型发起任务，要求大模型按照如下步骤解决安装失败的问题：
+- 分析执行出错原因。
+- 使用自身知识储备以及MCP工具尝试解决如下错误：
+  + 如果是语法错误，则直接修改测试代码并重新测试；
+  + 如果提示模块名未找到，则调用`detect_import_name_mcp`找到正确的导入模块名，并重新测试；
+  + 如果该错误是由于缺少系统依赖，使用`detect_system_package_manager_mcp`来查看当前系统使用什么包管理器(dnf/yum/apt等)，然后调用`install_system_package_mcp`安装缺失的系统包，然后重新测试；
+  + 如果该错误是由于错误的虚拟环境配置，则使用`create_virtual_env_mcp`工具来创建新的虚拟环境，并在该环境中重新测试。
+- 重复上述过程直到解决问题或者超过重试次数。
+4. 最后将测试结果归类为`COMPATIBLE`和`INCOMPATIBLE`两类，分别代表通过全部测试和有测试未通过。
+
+## 运行命令
+1. 环境准备
+- 安装GPU相关的驱动，安装jq工具
+- 创建虚拟环境作为本系统执行的环境
+- 修改mcp_servers_config.json文件
+- 建立results文件夹和tmp文件夹，将软件列表.xlsx放到tmp文件夹下
+- 创建venvs目录用于存放安装和验证时的虚拟环境
+- 修改config.json文件
+2. 运行
+- 若要运行从Github获取PyPI包名到验证的完整过程，运行run.sh脚本，命令如下：
+  ```shell
+  sh ./run.sh --venv=/root/.main_venv --use-llm --topic=ai --fetch-github-repos --generate-package-info --verify-packages
+  ```
+- 如果要测试出题组提供的软件列表，则运行run_for_软件列表.sh
+  ```shell
+  sh ./run_for_软件列表.sh --venv=/root/.main_venv --use-llm --generate-package-info --verify-packages
+  ```
+3. 运行结果说明
+- results文件夹下保存着每个软件的详细测试结果信息和最终报告信息
+- 最终报告说明
+  最终报告会统计如下信息：
+  + `total_packages`: 总共需要验证的包数；
+  + `not_found_packages`: 无法在PyPI仓库中找到的包数，可能由于包名列表中的信息有误或者其他原因；
+  + `total_exists_packages`: 在PyPI仓库中能找到的包数；
+  + `create_env_failed_packages`: 创建虚拟环境失败而导致失败的包数,
+  + `install_failed_packages`: 安装失败的包数；
+  + `env_resolve_failed_packages`: 系统环境无法解决而失败的包数；
+  + `verify_failed_packages`: 验证失败的包数，即验证状态为`INCOMPATIBLE`；
+  + `successful_packages`: 成功验证的包数,即验证状态为`COMPATIBLE`；
+  + `install_rate`: 安装成功率，即安装成功包数/在PyPI仓库中能找到的包数；
+  + `compatibility_rate`: 验证成功率，即验证成功的包数/在PyPI仓库中能找到的包数；
+  + `install_rate_total`: 总安装成功率，即安装成功包数/总共需要验证的包数；
+  + `compatibility_rate_total`: 总验证成功率，即验证成功包数/总共需要验证的包数；
+  + `details`: 每个包的安装验证细节,
+  + `timestamp`: 当前时间戳
+
+4. 推荐运行方式
+- 环境准备
+  + 首先使用官方提供的`opencloudos/opencloudos9-minimal:latest` docker镜像，使用该镜像创建一个docker实例，记得指定`--gpus all`，推荐命令如下：
+    ```shell
+    docker run -it --name opencloudos9 --gpus all -p 8000:8000 opencloudos/opencloudos9-minimal:latest bash
+    ```
+  + 设置python软链接，安装pip
+    ```shell
+    ln -s /usr/bin/python3 /usr/bin/python
+    dnf install -y python3-pip
+    python -m pip install -U pip # 更新pip到最新版
+    ```
+  + 安装jq，用于解析json文件
+    ```shell
+    dnf install jq
+    ```
+  + 在code目录下创建虚拟环境作为项目执行环境
+    ```shell
+    cd code
+    python -m venv .main_venv
+    ```
+  + 在code目录下新建tmp目录，results目录，venvs目录
+  + 修改code/mcp_chat_bot/mcp_servers/mcp_servers_config.json文件（推荐使用绝对路径）
+    ```json
+    {
+      "mcpServers": {
+        "github_analyst": {
+          "command": "/root/oc_contributor_huangzhenye/code/.main_venv/bin/python",
+          "args": [
+            "-u",
+            "/root/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/github_analyst.py"
+          ]
+        },
+        "pypi_analyst": {
+          "command": "/root/oc_contributor_huangzhenye/code/.main_venv/bin/python",
+          "args": [
+            "-u",
+            "/root/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/pypi_analyst.py"
+          ]
+        },
+        "dependency_analyst": {
+          "command": "/root/oc_contributor_huangzhenye/code/.main_venv/bin/python",
+          "args": [
+            "-u",
+            "/root/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/dependency_analyst.py"
+          ]
+        },
+        "test_executor": {
+          "command": "/root/oc_contributor_huangzhenye/code/.main_venv/bin/python",
+          "args": [
+            "-u",
+            "/root/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/test_executor.py"
+          ]
+        }
+      }
+    }
+    ``` 
+  + 在code目录下修改config.json文件（推荐使用绝对路径）
+    ```json
+    {
+      "github_access_token": "",
+      "llm_model_name": "deepseek-chat",
+      "llm_access_token": "",
+      "llm_base_url": "https://api.deepseek.com/chat/completions",
+
+      "//": "数据存储的方式和位置，支持json文件保存和数据库保存",
+      "save_method": "db",
+      "json_file_path": "/root/oc_contributor_huangzhenye/code/package_manager/package_info.json",
+      "db_path": "/root/oc_contributor_huangzhenye/code/package_info.db",
+      "result_path": "/root/oc_contributor_huangzhenye/code/results",
+
+      "//": "tmp目录位置，用于存放软件列表.xlsx以及临时文件",
+      "tmp_path": "/root/oc_contributor_huangzhenye/code/tmp",
+
+      "//": "安装和验证时存放虚拟环境的路径，注意不要将项目运行时使用的虚拟环境放在这里，避免破坏项目的运行环境",
+      "venvs_path": "/root/oc_contributor_huangzhenye/code/venvs"
+    }
+    ```
+  + 测试MCP能否正常工作
+    ```shell
+    source code/.main_venv/bin/activate
+    pip install -r code/requirements.txt
+    python code/mcp_chat_bot/unit_tests/single_prompt.py --url https://github.com/numpy/numpy
+    ```
+- 运行
+  + 在code目录下运行shell脚本
+    ```shell
+    sh ./run_for_软件列表.sh --venv=/root/oc_contributor_huangzhenye/code/.main_venv --use-llm --generate-package-info --verify-packages
+    ``` 
+### 注意事项
+1. 由于MCP会使用dnf安装软件，推荐在root权限下运行（这个后续可以继续优化）
+2. 由于生成信息、安装和测试的过程较为复杂，在本地测试过程中，根据`软件列表.xlsx`生成`Package`信息可能需要半天至一天时间；而安装和验证过程由于本地磁盘空间不够，没有进行过完整测试，估计完全安装和验证需要一到两天。如果希望进行样本测试，可以减少`软件列表.xlsx`文件的行数，再进行测试。
+3. 如果希望加快速度，我们还在important文件夹下提供了一个`package_info.db`数据库，里面存放的是由项目根据`软件列表.xlsx`生成的`Package`信息。将该数据库放到code目录下，然后再执行
+```shell
+sh ./run_for_软件列表.sh --venv=/root/oc_contributor_huangzhenye/code/.main_venv --use-llm --verify-packages
+```
+即可开始验证代码。
+
+## 总结与展望
+### 总结
+1. 在工作初期，我将目光聚焦于如何更详细地获取PyPI包及其相关信息上，并最终确定了先分析包是否存在，再从PyPI仓库中获取详细信息的方式。在安装代码时，我第一想法是希望按照包的依赖顺序依次安装，以便预先发现版本冲突的情况，再通过创建新虚拟环境的方式避免依赖冲突。在验证代码时，我想到AI软件包大部分需要使用GPU来实现完整功能，因此设计了导入测试、基本功能测试和GPU使用测试三类测试方法。
+2. 我发现生成包信息的过程复杂且多变，还需要使用大模型生成验证代码，为了能更智能的获取生成包信息，我了解了MCP协议，并通过使用一个[开源项目](https://github.com/keli-wen/mcp_chatbot)的源码实现了自己的MCP机器人。在此感谢`keliwen@stu.pku.edu.cn`对开源社区的贡献。随后这个MCP机器人还被用于包的安装、包的验证等多个过程。
+3. 项目创新点：
+- 结合Github和PyPI仓库的信息，更准确地获取软件包的详细信息。
+- 使用MCP工具以更智能的方式实现包的分析、安装和验证。
+- 通过分析PyPI包的安装过程及生成的文件，更准确地判断安装包过程中可能遇到的问题，比如环境依赖缺失、版本冲突、安装包名和导入包名不匹配等问题。
+- 提供系统软件管理工具来解决由于系统依赖缺失导致的安装/验证失败问题。
+
+## 参考文献
+[1] Peng Y, Hu R, Wang R, et al. Less is more? an empirical study on configuration issues in python pypi ecosystem[C]//Proceedings of the IEEE/ACM 46th international conference on software engineering. 2024: 1-12.
+
+[2] MCP Specification: https://modelcontextprotocol.io/specification/2025-06-18
+
+[3] mcp_chatbot: https://github.com/keli-wen/mcp_chatbot
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/assets/AI\350\275\257\344\273\266\350\207\252\345\212\250\351\252\214\350\257\201\345\267\245\345\205\267.png" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/assets/AI\350\275\257\344\273\266\350\207\252\345\212\250\351\252\214\350\257\201\345\267\245\345\205\267.png"
new file mode 100644
index 0000000000000000000000000000000000000000..03954efb83513417682f3e9af245d88c175bf17e
Binary files /dev/null and "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/assets/AI\350\275\257\344\273\266\350\207\252\345\212\250\351\252\214\350\257\201\345\267\245\345\205\267.png" differ
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/ai_agent/llm_api.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/ai_agent/llm_api.py"
new file mode 100644
index 0000000000000000000000000000000000000000..95435f4256dc8b5e47b600a707025b463a9dca94
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/ai_agent/llm_api.py"	
@@ -0,0 +1,134 @@
+# -*- coding: utf-8 -*-
+
+import uuid
+import json
+import requests
+import os
+import sys
+
+CONFIG_FILE = os.path.join(os.path.dirname(__file__), '../config.json')
+
+def call_llm_api(prompt, 
+                 temperature=1.0,
+                 top_p=1.0,
+                 max_tokens=8192,
+                 api_key=None,
+                 verbose=False):
+    """
+    调用LLM API的函数
+    
+    Args:
+        prompt (json): 用户输入的提示词
+        temperature (float): 温度参数，控制输出的随机性，默认1.0
+        top_p (float): top_p参数，控制nucleus采样，默认1.0
+        max_tokens (int): 最大输出token数，默认32000
+        api_key (str): API密钥，默认为None时从环境变量TAIJI_API_KEY读取
+        verbose (bool): 是否打印详细信息，默认False
+    
+    Returns:
+        str: API返回的响应内容
+    """
+    
+    # 获取模型名称
+    model = json.load(open(CONFIG_FILE, 'r'))['llm_model_name']
+
+    # 获取API密钥
+    if api_key is None:
+        api_key = json.load(open(CONFIG_FILE, 'r'))['llm_access_token']
+    
+    # API配置
+    ss_url = json.load(open(CONFIG_FILE, 'r'))['llm_base_url']
+    
+    headers = {
+        "Content-Type": "application/json",
+        "Authorization": f"Bearer {api_key}",
+    }
+    
+    # 构建请求数据
+    json_data = {
+        "query_id": "query_id_" + str(uuid.uuid4()),
+        "model": model,
+        # "messages": [
+        #     {"role": "user", "content": prompt}
+        # ],
+        "messages": prompt, 
+        "temperature": temperature,
+        "top_p": top_p,
+        "max_tokens": max_tokens,
+        "stream": False
+    }
+    
+    if verbose:
+        print('Input:\n{} | {} | {}'.format(ss_url, headers, json_data))
+    
+    try:
+        resp = requests.post(ss_url, headers=headers, json=json_data)
+        
+        if verbose:
+            print(f'Output: {resp}')
+        
+        # 非流式输出处理
+        if resp.status_code == 200:
+            response_data = resp.json()
+            if 'choices' in response_data and len(response_data['choices']) > 0:
+                content = response_data['choices'][0]['message']['content']
+                return content
+            else:
+                return resp.text
+        else:
+            if verbose:
+                print(f"请求失败，状态码: {resp.status_code}")
+            return resp.text
+                
+    except Exception as e:
+        if verbose:
+            print(f"调用API时出错: {e}")
+        return f"错误: {str(e)}"
+
+
+def read_file_content(file_path):
+    """
+    读取文件内容
+    
+    Args:
+        file_path (str): 文件路径
+    
+    Returns:
+        str: 文件内容，如果读取失败则返回None
+    """
+    try:
+        with open(file_path, 'r', encoding='utf-8') as f:
+            return f.read().strip()
+    except Exception as e:
+        print(f"读取文件 {file_path} 失败: {e}")
+        return None
+
+
+def main():
+    """
+    主函数，用于测试API调用
+    支持从命令行参数指定输入文件
+    """
+    # 检查命令行参数
+    if len(sys.argv) > 1:
+        file_path = sys.argv[1]
+        if os.path.exists(file_path):
+            print(f"=== 从文件读取输入: {file_path} ===")
+            test_prompt = read_file_content(file_path)
+            if test_prompt is None:
+                return
+        else:
+            print(f"文件不存在: {file_path}")
+            return
+    else:
+        # 默认测试用例
+        test_prompt = "你是谁"
+        print("=== 使用默认测试输入 ===")
+    
+    print(f"输入内容: {test_prompt[:100]}{'...' if len(test_prompt) > 100 else ''}")
+    result = call_llm_api([{"role": "user", "content": test_prompt}], verbose=True)
+    print(f"\n结果: {result}")
+
+
+if __name__ == "__main__":
+    main()
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/config.json" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/config.json"
new file mode 100644
index 0000000000000000000000000000000000000000..faff9d75ba5f3c419111385139afe22272b2955b
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/config.json"	
@@ -0,0 +1,18 @@
+{
+    "github_access_token": "",
+    "llm_model_name": "deepseek-chat",
+    "llm_access_token": "",
+    "llm_base_url": "https://api.deepseek.com/chat/completions",
+
+    "//": "数据存储的方式和位置，支持json文件保存和数据库保存",
+    "save_method": "db",
+    "json_file_path": "/root/contributor_rhino-bird/2025实战任务_作品文件夹/OpenCloudOS 9 AI软件自动化验证工具/黄振业_作品/oc_contributor_huangzhenye/code/package_manager/package_info.json",
+    "db_path": "/root/contributor_rhino-bird/2025实战任务_作品文件夹/OpenCloudOS 9 AI软件自动化验证工具/黄振业_作品/oc_contributor_huangzhenye/code/package_info.db",
+    "result_path": "/root/contributor_rhino-bird/2025实战任务_作品文件夹/OpenCloudOS 9 AI软件自动化验证工具/黄振业_作品/oc_contributor_huangzhenye/code/results",
+
+    "//": "tmp目录位置，用于存放软件列表.xlsx以及临时文件",
+    "tmp_path": "/root/contributor_rhino-bird/2025实战任务_作品文件夹/OpenCloudOS 9 AI软件自动化验证工具/黄振业_作品/oc_contributor_huangzhenye/code/tmp",
+
+    "//": "安装和验证时存放虚拟环境的路径，注意不要将项目运行时使用的虚拟环境放在这里，避免破坏项目的运行环境",
+    "venvs_path": "/root/contributor_rhino-bird/2025实战任务_作品文件夹/OpenCloudOS 9 AI软件自动化验证工具/黄振业_作品/oc_contributor_huangzhenye/code/venvs"
+}
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/data_manager/package_converter.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/data_manager/package_converter.py"
new file mode 100644
index 0000000000000000000000000000000000000000..549afeae703d2c37c5cc59f1d5e3057b16c46af1
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/data_manager/package_converter.py"	
@@ -0,0 +1,145 @@
+import sys
+import json
+from pathlib import Path
+from typing import Dict, Any, Optional
+
+root_dir = Path(__file__).parent.parent
+sys.path.insert(0, str(root_dir))
+from data_manager.package_info import Package, PackageInfo
+
+
+class PackageConverter:
+    """包信息格式转换器"""
+    
+    @staticmethod
+    def dict_to_model(data: Dict[str, Any]) -> Package:
+        """将字典转换为Package模型"""
+        # 处理info字段为字符串错误信息的情况
+        if isinstance(data.get("info"), str):
+            # 如果info是字符串错误信息，创建一个全是空字符串的PackageInfo对象
+            data_copy = data.copy()
+            data_copy["info"] = PackageInfo()  # 使用默认值创建空的PackageInfo
+            return Package.model_validate(data_copy)
+        return Package.model_validate(data)
+    
+    @staticmethod
+    def model_to_dict(package: Package) -> Dict[str, Any]:
+        """将Package模型转换为字典"""
+        return package.model_dump()
+    
+    @staticmethod
+    def db_row_to_model(row: tuple) -> Package:
+        """将数据库行转换为Package模型"""
+        # 检查是否存在包信息
+        if len(row) >= 10 and row[9]:  # exists为True
+            package_info = PackageInfo(
+                dependency=json.loads(row[1]) if row[1] else [],
+                import_test_code=row[2] or "",
+                import_test_expected_result=row[3] or "",
+                function_test_code=row[4] or "",
+                function_test_expected_result=row[5] or "",
+                gpu_test_code=row[6] or "",
+                gpu_test_expected_result=row[7] or "",
+                verified=row[8] or "False"
+            )
+        else:
+            # 创建空的PackageInfo对象而不是None
+            package_info = PackageInfo()
+            
+        return Package(
+            package_name=row[0],
+            info=package_info,
+            exists=row[9] if len(row) >= 10 else False
+        )
+
+    @staticmethod
+    def json_item_to_model(package_name: str, item_info: Dict[str, Any]) -> Package:
+        """将JSON项转换为Package模型"""
+        if item_info['exists'] == "True":
+            package_info = PackageInfo.model_validate(item_info)
+        else:
+            # 创建空的PackageInfo对象
+            package_info = PackageInfo()
+            
+        return Package(
+                package_name = package_name, 
+                info = package_info, 
+                exists = True if item_info['exists'] == "True" else False
+            )
+
+if __name__ == "__main__":
+    # 测试代码
+    print("------ dict-to-model 1-------------")
+    sample_data = {
+        "package_name": "example-package",
+        "info": {
+            "dependency": ["numpy"],
+            "import_test_code": "import example",
+            "import_test_expected_result": "",
+            "function_test_code": "print(example.func())",
+            "function_test_expected_result": "42",
+            "gpu_test_code": "",
+            "gpu_test_expected_result": "",
+            "verified": "False"
+        },
+        "exists": True
+    }
+    package1 = PackageConverter.dict_to_model(sample_data)
+    print(package1)
+    
+    print("------ dict-to-model 2 (not exists)-------------")
+    sample_data = {
+        "package_name": "numpy",
+        "info": "Not package found in PyPI",
+        "exists": False
+    }
+    package2 = PackageConverter.dict_to_model(sample_data)
+    print(package2)
+
+    print("------ model-to-dict -------------")
+    package_dict = PackageConverter.model_to_dict(package2)
+    print(json.dumps(package_dict, indent=4))
+
+    print("------ db-row-to-model -------------")
+    # 测试存在的包
+    if package1.exists and package1.info and isinstance(package1.info, PackageInfo):
+        db_row = (
+            package1.package_name,
+            json.dumps(package1.info.dependency),
+            package1.info.import_test_code,
+            package1.info.import_test_expected_result,
+            package1.info.function_test_code,
+            package1.info.function_test_expected_result,
+            package1.info.gpu_test_code,
+            package1.info.gpu_test_expected_result,
+            package1.info.verified,
+            package1.exists,
+        )
+        package_from_db = PackageConverter.db_row_to_model(db_row)
+        print("Existing package from DB:", package_from_db)
+    else:
+        print("Skipping db_row_to_model test for non-existent package")
+    
+    # 测试不存在的包的数据库行
+    db_row_non_existent = ("non-existent-package", "[]", "", "", "", "", "", "", "False", False)
+    package_from_db_non_existent = PackageConverter.db_row_to_model(db_row_non_existent)
+    print("Non-existent package from DB:", package_from_db_non_existent)
+
+    print("------ json-item-to-model -------------")
+    sample_data = {
+        "protobuf": {
+            "dependency": [],
+            "import_test_code": "from google import protobuf",
+            "import_test_expected_result": "",
+            "function_test_code": "import sys; from google import protobuf; result = protobuf.__version__; print(result)",
+            "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+            "gpu_test_code": "",
+            "gpu_test_expected_result": "",
+            "exists": "True"
+        }
+    }
+    name_and_info_to_model = PackageConverter.json_item_to_model(
+        "protobuf",
+        sample_data["protobuf"]
+    )
+    print(name_and_info_to_model)
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/data_manager/package_info.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/data_manager/package_info.py"
new file mode 100644
index 0000000000000000000000000000000000000000..ba1175b26fc8f0baa96a07826dd68f0c719fd102
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/data_manager/package_info.py"	
@@ -0,0 +1,19 @@
+from typing import List, Optional, Union
+from pydantic import BaseModel, Field
+
+class PackageInfo(BaseModel):
+    """包信息的数据模型"""
+    dependency: List[str] = Field(default_factory=list)
+    import_test_code: str = ""
+    import_test_expected_result: str = ""
+    function_test_code: str = ""
+    function_test_expected_result: str = ""
+    gpu_test_code: str = ""
+    gpu_test_expected_result: str = ""
+    verified: str = "False"
+
+class Package(BaseModel):
+    """完整的包记录模型"""
+    package_name: str
+    info: PackageInfo
+    exists: bool
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/data_manager/package_repository.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/data_manager/package_repository.py"
new file mode 100644
index 0000000000000000000000000000000000000000..759089fec2af20a241734582bc178b63fac479d2
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/data_manager/package_repository.py"	
@@ -0,0 +1,153 @@
+import sqlite3
+import sys
+import json
+from typing import List, Optional
+from pathlib import Path
+
+root_dir = Path(__file__).parent.parent
+sys.path.insert(0, str(root_dir))
+
+from data_manager.package_info import Package, PackageInfo
+from data_manager.package_converter import PackageConverter
+
+class PackageRepository:
+    """包信息仓库"""
+    
+    def __init__(self, db_path: str, json_path: str):
+        self.db_path = db_path
+        self.json_path = json_path
+
+    def get_certain_package_list_from_db(self, condition: Optional[str] = None) -> List[Package]:
+        with sqlite3.connect(self.db_path) as conn:
+            cursor = conn.cursor()
+            query = f"SELECT * FROM packages"
+            if condition:
+                query += f" WHERE {condition}"
+            cursor.execute(query)
+            rows = cursor.fetchall()
+            return [PackageConverter.db_row_to_model(row) for row in rows]
+
+    def modify_package_in_db(self, package_name: str, column: str, new_value) -> None:
+        """修改数据库中某个包的指定字段"""
+        with sqlite3.connect(self.db_path) as conn:
+            cursor = conn.cursor()
+            query = f"UPDATE packages SET {column} = ? WHERE package_name = ?"
+            cursor.execute(query, (new_value, package_name))
+            conn.commit()
+
+    def save_to_db(self, package: Package) -> None:
+        """保存包信息到数据库"""
+        with sqlite3.connect(self.db_path) as conn:
+            cursor = conn.cursor()
+            self._ensure_table_exists(cursor)
+            
+            info = package.info
+            cursor.execute("""
+                INSERT INTO packages 
+                VALUES (?, ?, ?, ?, ?, ?, ?, ?, ?, ?)
+                ON CONFLICT(package_name) DO UPDATE SET
+                    package_name=excluded.package_name,
+                    dependency=excluded.dependency,
+                    import_test_code=excluded.import_test_code,
+                    import_test_expected_result=excluded.import_test_expected_result,
+                    function_test_code=excluded.function_test_code,
+                    function_test_expected_result=excluded.function_test_expected_result,
+                    gpu_test_code=excluded.gpu_test_code,
+                    gpu_test_expected_result=excluded.gpu_test_expected_result,
+                    verified=excluded.verified,
+                    `exists`=excluded.`exists`
+            """, (
+                package.package_name,
+                json.dumps(info.dependency),
+                info.import_test_code,
+                info.import_test_expected_result,
+                info.function_test_code,
+                info.function_test_expected_result,
+                info.gpu_test_code,
+                info.gpu_test_expected_result,
+                info.verified,
+                package.exists
+            ))
+
+    def get_certain_package_list_from_json(self, attr: Optional[str] = None, value: Optional[str] = None) -> List[Package]:
+        packages = []
+        with open(self.json_path, 'r', encoding='utf-8') as f:
+            data = json.load(f)
+            for pkg_name, item_info in data.items():
+                if (attr is None or item_info.get(attr) == value):
+                    package = PackageConverter.json_item_to_model(pkg_name, item_info)
+                    packages.append(package)
+        return packages
+            
+    def modify_package_in_json(self, package_name: str, key: str, new_value) -> None:
+        """修改JSON文件中某个包的指定字段"""
+        try:
+            with open(self.json_path, 'r', encoding='utf-8') as f:
+                data = json.load(f)
+        except (FileNotFoundError, json.JSONDecodeError):
+            data = {}
+        
+        if package_name in data:
+            data[package_name][key] = new_value
+            with open(self.json_path, 'w', encoding='utf-8') as f:
+                json.dump(data, f, ensure_ascii=False, indent=4)
+
+    def save_to_json(self, package: Package) -> None:
+        """保存包信息到JSON文件"""
+        try:
+            with open(self.json_path, 'r', encoding='utf-8') as f:
+                data = json.load(f)
+        except (FileNotFoundError, json.JSONDecodeError):
+            data = {}
+        
+        package_dict = PackageConverter.model_to_dict(package)
+        # 将exists字段添加到info中
+        item_info = package_dict['info']
+        item_info['exists'] = package_dict['exists']
+        data[package.package_name] = item_info
+
+        with open(self.json_path, 'w', encoding='utf-8') as f:
+            json.dump(data, f, ensure_ascii=False, indent=4)
+
+    def _ensure_table_exists(self, cursor: sqlite3.Cursor) -> None:
+        """确保数据表存在"""
+        cursor.execute("""
+            CREATE TABLE IF NOT EXISTS packages (
+                package_name TEXT PRIMARY KEY,
+                dependency TEXT,
+                import_test_code TEXT,
+                import_test_expected_result TEXT,
+                function_test_code TEXT,
+                function_test_expected_result TEXT,
+                gpu_test_code TEXT,
+                gpu_test_expected_result TEXT,
+                verified TEXT,
+                `exists` BOOLEAN
+            )
+        """)
+
+if __name__ == "__main__":
+    package_respository = PackageRepository("package_info_test.db", "package_info_test.json")
+    package1 = Package(
+        package_name="numpy",
+        info=PackageInfo(
+            dependency=["setuptools"],
+            import_test_code="import numpy as np",
+            import_test_expected_result="",
+            function_test_code="print(np.__version__)",
+            function_test_expected_result=r"\d+\.\d+\.\d+",
+            gpu_test_code="",
+            gpu_test_expected_result="",
+            verified="True"
+        ),
+        exists=True
+    )
+    package_respository.save_to_db(package1)
+    package_respository.save_to_json(package1)
+    package2 = Package(
+        package_name = "abc",
+        info = PackageInfo(),
+        exists = False
+    )
+    package_respository.save_to_db(package2)
+    package_respository.save_to_json(package2)
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/info_crawler/analyse_dependency.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/info_crawler/analyse_dependency.py"
new file mode 100644
index 0000000000000000000000000000000000000000..74091c60755baaab12854b8ad6ccea38b604b7f9
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/info_crawler/analyse_dependency.py"	
@@ -0,0 +1,788 @@
+#!/usr/bin/env python3
+"""Resolve runtime dependency package names for a PyPI package using multiple sources.
+
+This tool combines:
+- PyPI metadata (requires_dist)
+- pip download -> METADATA/PKG-INFO parsing as a fallback
+- repository inspection (pyproject.toml / setup.cfg / requirements.txt)
+
+It filters out optional extras, test/dev dependencies and any requires_dist entries
+whose environment markers do not evaluate true for the current environment.
+
+Usage: python analyse_pypi2.py <package> [--version X.Y] [--no-repo]
+"""
+from __future__ import annotations
+
+import argparse
+import json
+import os
+import re
+import shutil
+import subprocess
+import sys
+import tempfile
+from typing import Dict, List, Optional, Set
+
+import requests
+
+# Optional helpers from packaging
+try:
+    from packaging.requirements import Requirement  # type: ignore
+    from packaging.markers import default_environment  # type: ignore
+except Exception:
+    Requirement = None  # type: ignore
+    default_environment = None  # type: ignore
+
+try:
+    import tomllib as toml  # Python 3.11+
+except Exception:
+    try:
+        import toml  # type: ignore
+    except Exception:
+        toml = None  # type: ignore
+
+PYPI_JSON = "https://pypi.org/pypi/{pkg}/json"
+NAME_RE = re.compile(r"^\s*([A-Za-z0-9_\-\.]+)")
+
+
+def fetch_pypi_json(package: str, version: Optional[str] = None, timeout: int = 10) -> Optional[Dict]:
+    try:
+        url = PYPI_JSON.format(pkg=package) if version is None else f"https://pypi.org/pypi/{package}/{version}/json"
+        r = requests.get(url, timeout=timeout)
+        r.raise_for_status()
+        return r.json()
+    except Exception:
+        return None
+
+
+def _parse_requires_dist(entry: str):
+    """Return tuple (name, version_spec, extras_set, marker) or None on parse failure.
+
+    Uses packaging.Requirement when available for robust parsing.
+    Returns version_spec as "Any" if no version constraints are found.
+    """
+    if not entry:
+        return None
+    if Requirement:
+        try:
+            req = Requirement(entry)
+            name = req.name
+            extras = set(req.extras) if req.extras else set()
+            marker = req.marker  # may be None
+            # Extract version specification
+            version_spec = str(req.specifier) if req.specifier else "Any"
+            return name, version_spec, extras, marker
+        except Exception:
+            pass
+
+    # fallback simple parse
+    parts = entry.split(";", 1)
+    req_part = parts[0].strip()
+    marker = parts[1].strip() if len(parts) > 1 else None
+    
+    # Extract extras
+    extras = set()
+    if "[" in req_part and "]" in req_part:
+        before_bracket = req_part.split("[", 1)[0]
+        bracket_content = req_part.split("[", 1)[1].split("]", 1)[0]
+        req_part = before_bracket + (req_part.split("]", 1)[1] if "]" in req_part else "")
+        extras = set(e.strip() for e in bracket_content.split(",") if e.strip())
+    
+    # Extract name and version specification
+    name_part = req_part
+    version_spec = "Any"
+    
+    # Look for version specifiers
+    version_pattern = r"([A-Za-z0-9_\-\.]+)\s*([><=!~,\s\d\.]+)"
+    match = re.match(version_pattern, req_part)
+    if match:
+        name_part = match.group(1)
+        version_spec = match.group(2).strip()
+    else:
+        # Simple name extraction
+        m = NAME_RE.match(req_part)
+        if m:
+            name_part = m.group(1)
+    
+    if not name_part:
+        return None
+    
+    return name_part, version_spec, extras, marker
+
+
+def requires_dist_filtered(package: str, version: Optional[str] = None, include_extras: bool = False) -> List[str]:
+    """Get dependency info from PyPI requires_dist while filtering extras and markers.
+    
+    Returns list of strings in format 'package_name version_spec' or 'package_name Any'
+    """
+    meta = fetch_pypi_json(package, version)
+    if not meta:
+        return []
+    requires = meta.get("info", {}).get("requires_dist") or []
+    env = default_environment() if default_environment else None
+    seen: Set[str] = set()
+    result: List[str] = []
+    
+    for entry in requires:
+        parsed = _parse_requires_dist(entry)
+        if not parsed:
+            continue
+        name, version_spec, extras, marker = parsed
+        
+        # skip optional extras unless requested
+        if extras and not include_extras:
+            continue
+        # skip markers that reference 'extra' unless including extras
+        marker_str = str(marker) if marker is not None else ""
+        if "extra" in marker_str and not include_extras:
+            continue
+        # evaluate marker in current environment
+        if marker and env is not None:
+            try:
+                if not marker.evaluate(env):
+                    continue
+            except Exception:
+                # conservative: skip if marker cannot be evaluated
+                continue
+        
+        if name and name not in seen:
+            seen.add(name)
+            dependency_spec = f"{name} {version_spec}"
+            result.append(dependency_spec)
+    return result
+
+
+def _extract_metadata_requires_from_wheel(wheel_path: str, include_extras: bool = False) -> List[str]:
+    import zipfile
+
+    deps: List[str] = []
+    seen: Set[str] = set()
+    with zipfile.ZipFile(wheel_path) as z:
+        for f in z.namelist():
+            if f.endswith("/METADATA") or f.endswith(".dist-info/METADATA"):
+                with z.open(f) as fh:
+                    content = fh.read().decode(errors="ignore")
+                    for line in content.splitlines():
+                        if line.startswith("Requires-Dist:"):
+                            entry = line.split(":", 1)[1].strip()
+                            parsed = _parse_requires_dist(entry)
+                            if not parsed:
+                                continue
+                            name, version_spec, extras, marker = parsed
+                            
+                            # skip optional extras unless requested
+                            if extras and not include_extras:
+                                continue
+                            marker_str = str(marker) if marker is not None else ""
+                            if "extra" in marker_str and not include_extras:
+                                continue
+                            env = default_environment() if default_environment else None
+                            if marker and env is not None:
+                                try:
+                                    if not marker.evaluate(env):
+                                        continue
+                                except Exception:
+                                    continue
+                            if name and name not in seen:
+                                seen.add(name)
+                                dependency_spec = f"{name} {version_spec}"
+                                deps.append(dependency_spec)
+    return deps
+
+
+def _extract_metadata_requires_from_sdist(sdist_path: str, include_extras: bool = False) -> List[str]:
+    deps: List[str] = []
+    seen: Set[str] = set()
+    
+    def process_line(line: str):
+        if line.startswith("Requires-Dist:") or line.startswith("Requires:"):
+            entry = line.split(":", 1)[1].strip()
+            parsed = _parse_requires_dist(entry)
+            if not parsed:
+                return
+            name, version_spec, extras, marker = parsed
+            if extras and not include_extras:
+                return
+            marker_str = str(marker) if marker is not None else ""
+            if "extra" in marker_str and not include_extras:
+                return
+            env = default_environment() if default_environment else None
+            if marker and env is not None:
+                try:
+                    if not marker.evaluate(env):
+                        return
+                except Exception:
+                    return
+            if name and name not in seen:
+                seen.add(name)
+                dependency_spec = f"{name} {version_spec}"
+                deps.append(dependency_spec)
+    
+    if sdist_path.endswith((".zip", ".whl")):
+        import zipfile
+
+        with zipfile.ZipFile(sdist_path) as z:
+            for n in z.namelist():
+                if n.endswith("PKG-INFO") or n.endswith("METADATA"):
+                    with z.open(n) as fh:
+                        content = fh.read().decode(errors="ignore")
+                        for line in content.splitlines():
+                            process_line(line)
+    else:
+        import tarfile
+
+        with tarfile.open(sdist_path, "r:*") as tar:
+            members = [m for m in tar.getmembers() if m.name.endswith("PKG-INFO") or m.name.endswith("METADATA")]
+            for m in members:
+                f = tar.extractfile(m)
+                if not f:
+                    continue
+                content = f.read().decode(errors="ignore")
+                for line in content.splitlines():
+                    process_line(line)
+    return deps
+
+
+def pip_download_metadata(package: str, version: Optional[str] = None, include_extras: bool = False) -> List[str]:
+    """Use pip download --no-deps and parse artifacts for Requires-Dist as fallback."""
+    tmpdir = tempfile.mkdtemp(prefix="pypi_dl_")
+    try:
+        pkg_spec = f"{package}=={version}" if version else package
+        cmd = [sys.executable, "-m", "pip", "download", "--no-deps", "--dest", tmpdir, pkg_spec]
+        proc = subprocess.run(cmd, capture_output=True, text=True, timeout=120)
+        if proc.returncode != 0:
+            return []
+        files = os.listdir(tmpdir)
+        deps: List[str] = []
+        for fname in files:
+            path = os.path.join(tmpdir, fname)
+            if fname.endswith(".whl"):
+                deps.extend(_extract_metadata_requires_from_wheel(path, include_extras=include_extras))
+            elif fname.endswith((".tar.gz", ".zip", ".tar.bz2", ".tar")):
+                deps.extend(_extract_metadata_requires_from_sdist(path, include_extras=include_extras))
+        
+        # dedupe preserving order
+        seen: Set[str] = set()
+        out: List[str] = []
+        for dep_spec in deps:
+            # Extract package name for deduplication (before first space)
+            dep_name = dep_spec.split()[0]
+            if dep_name not in seen:
+                seen.add(dep_name)
+                out.append(dep_spec)
+        return out
+    except Exception:
+        return []
+    finally:
+        shutil.rmtree(tmpdir, ignore_errors=True)
+
+
+def _fetch_raw(url: str, timeout: int = 10) -> Optional[str]:
+    try:
+        r = requests.get(url, timeout=timeout)
+        if r.status_code == 200:
+            return r.text
+    except Exception:
+        pass
+    return None
+
+
+def repo_inspect_from_homepage(home_url: str, include_extras: bool = False) -> List[str]:
+    """Best-effort: if homepage is a GitHub repo, fetch common files to extract deps."""
+    if not home_url:
+        return []
+    m = re.search(r"github\.com[:/]+([^/]+)/([^/]+)(?:/|$)", home_url)
+    if not m:
+        return []
+    owner, repo = m.group(1), m.group(2).rstrip(".git")
+    candidates = [
+        f"https://raw.githubusercontent.com/{owner}/{repo}/HEAD/pyproject.toml",
+        f"https://raw.githubusercontent.com/{owner}/{repo}/HEAD/setup.cfg",
+        f"https://raw.githubusercontent.com/{owner}/{repo}/HEAD/setup.py",
+        f"https://raw.githubusercontent.com/{owner}/{repo}/HEAD/requirements.txt",
+    ]
+    results: List[str] = []
+    seen: Set[str] = set()
+    env = default_environment() if default_environment else None
+    
+    for url in candidates:
+        txt = _fetch_raw(url)
+        if not txt:
+            continue
+        if url.endswith("pyproject.toml") and toml:
+            try:
+                data = toml.loads(txt)
+                deps: List[str] = []
+                proj = data.get("project") if isinstance(data, dict) else None
+                if not proj:
+                    # poetry support
+                    tool = data.get("tool", {})
+                    poetry = tool.get("poetry") if isinstance(tool, dict) else None
+                    proj = poetry
+                if proj:
+                    if isinstance(proj.get("dependencies"), dict):
+                        for k, v in proj["dependencies"].items():
+                            if k not in seen:
+                                seen.add(k)
+                                version_spec = str(v) if v != "python" and v else "Any"
+                                dependency_spec = f"{k} {version_spec if version_spec != k else 'Any'}"
+                                results.append(dependency_spec)
+                    elif isinstance(proj.get("dependencies"), list):
+                        for entry in proj["dependencies"]:
+                            parsed = _parse_requires_dist(entry) if isinstance(entry, str) else None
+                            if not parsed:
+                                continue
+                            name, version_spec, extras, marker = parsed
+                            if extras and not include_extras:
+                                continue
+                            marker_str = str(marker) if marker is not None else ""
+                            if "extra" in marker_str and not include_extras:
+                                continue
+                            if marker and env is not None:
+                                try:
+                                    if not marker.evaluate(env):
+                                        continue
+                                except Exception:
+                                    continue
+                            if name not in seen:
+                                seen.add(name)
+                                dependency_spec = f"{name} {version_spec}"
+                                results.append(dependency_spec)
+                continue
+            except Exception:
+                pass
+        if url.endswith("setup.cfg"):
+            try:
+                import configparser
+
+                cfg = configparser.ConfigParser()
+                cfg.read_string(txt)
+                if cfg.has_section("options") and cfg.has_option("options", "install_requires"):
+                    raw = cfg.get("options", "install_requires")
+                    for line in raw.splitlines():
+                        line = line.strip()
+                        if not line:
+                            continue
+                        # attempt parse using packaging if available
+                        parsed = _parse_requires_dist(line)
+                        if not parsed:
+                            continue
+                        name, version_spec, extras, marker = parsed
+                        if extras and not include_extras:
+                            continue
+                        marker_str = str(marker) if marker is not None else ""
+                        if "extra" in marker_str and not include_extras:
+                            continue
+                        if marker and env is not None:
+                            try:
+                                if not marker.evaluate(env):
+                                    continue
+                            except Exception:
+                                continue
+                        if name and name not in seen:
+                            seen.add(name)
+                            dependency_spec = f"{name} {version_spec}"
+                            results.append(dependency_spec)
+                continue
+            except Exception:
+                pass
+        if url.endswith("requirements.txt"):
+            lines = txt.splitlines()
+            for line in lines:
+                line = line.strip()
+                if not line or line.startswith("#"):
+                    continue
+                # Parse using _parse_requires_dist for version info
+                parsed = _parse_requires_dist(line)
+                if parsed:
+                    name, version_spec, extras, marker = parsed
+                    if name not in seen:
+                        seen.add(name)
+                        dependency_spec = f"{name} {version_spec}"
+                        results.append(dependency_spec)
+                else:
+                    # fallback: strip extras and specifiers for name only
+                    if "[" in line:
+                        line = line.split("[", 1)[0]
+                    original_line = line
+                    for sep in ("==", ">=", "<=", "~=", ">", "<", "!="):
+                        if sep in line:
+                            line = line.split(sep, 1)[0].strip()
+                            break
+                    mname = NAME_RE.match(line)
+                    if mname:
+                        n = mname.group(1)
+                        if n not in seen:
+                            seen.add(n)
+                            # Try to extract version from original line
+                            version_spec = "Any"
+                            for sep in ("==", ">=", "<=", "~=", ">", "<", "!="):
+                                if sep in original_line:
+                                    version_spec = original_line.split(sep, 1)[1].strip()
+                                    version_spec = sep + version_spec
+                                    break
+                            dependency_spec = f"{n} {version_spec}"
+                            results.append(dependency_spec)
+            continue
+        if url.endswith("setup.py"):
+            # naive parse: look for install_requires = [ ... ]
+            mlist = re.search(r"install_requires\s*=\s*\[([^\]]+)\]", txt, re.S)
+            if mlist:
+                block = mlist.group(1)
+                parts = re.findall(r"['\"]([^'\"]+)['\"]", block)
+                for p in parts:
+                    parsed = _parse_requires_dist(p)
+                    if not parsed:
+                        continue
+                    name, version_spec, extras, marker = parsed
+                    if extras and not include_extras:
+                        continue
+                    marker_str = str(marker) if marker is not None else ""
+                    if "extra" in marker_str and not include_extras:
+                        continue
+                    if marker and env is not None:
+                        try:
+                            if not marker.evaluate(env):
+                                continue
+                        except Exception:
+                            continue
+                    if name and name not in seen:
+                        seen.add(name)
+                        dependency_spec = f"{name} {version_spec}"
+                        results.append(dependency_spec)
+            # attempt referenced requirements
+            mreq = re.search(r"requirements(?:.*?)\.txt", txt)
+            if mreq:
+                req_url = f"https://raw.githubusercontent.com/{owner}/{repo}/HEAD/requirements.txt"
+                txt2 = _fetch_raw(req_url)
+                if txt2:
+                    for line in txt2.splitlines():
+                        line = line.strip()
+                        if not line or line.startswith("#"):
+                            continue
+                        parsed = _parse_requires_dist(line)
+                        if parsed:
+                            name, version_spec, extras, marker = parsed
+                            if name not in seen:
+                                seen.add(name)
+                                dependency_spec = f"{name} {version_spec}"
+                                results.append(dependency_spec)
+                        else:
+                            # fallback
+                            if "[" in line:
+                                line = line.split("[", 1)[0]
+                            original_line = line
+                            for sep in ("==", ">=", "<=", "~=", ">", "<", "!="):
+                                if sep in line:
+                                    line = line.split(sep, 1)[0].strip()
+                                    break
+                            mname = NAME_RE.match(line)
+                            if mname:
+                                n = mname.group(1)
+                                if n not in seen:
+                                    seen.add(n)
+                                    version_spec = "Any"
+                                    for sep in ("==", ">=", "<=", "~=", ">", "<", "!="):
+                                        if sep in original_line:
+                                            version_spec = original_line.split(sep, 1)[1].strip()
+                                            version_spec = sep + version_spec
+                                            break
+                                    dependency_spec = f"{n} {version_spec}"
+                                    results.append(dependency_spec)
+    return results
+
+
+def find_dependency_for_pip_package(package: str, version: Optional[str] = None, include_extras: bool = False, include_repo: bool = True) -> List[str]:
+    final: List[str] = []
+    seen: Set[str] = set()
+    
+    # 定义常见的非PyPI包（标准库和系统包）
+    non_pypi_packages = {
+        # Python 标准库模块
+        'sys', 'os', 'json', 're', 'math', 'time', 'datetime', 'collections', 
+        'itertools', 'functools', 'operator', 'copy', 'pickle', 'sqlite3',
+        'threading', 'multiprocessing', 'subprocess', 'socket', 'urllib',
+        'http', 'email', 'xml', 'html', 'hashlib', 'base64', 'uuid',
+        'logging', 'warnings', 'traceback', 'inspect', 'types', 'typing',
+        'pathlib', 'glob', 'shutil', 'tempfile', 'io', 'argparse',
+        'configparser', 'csv', 'gzip', 'zipfile', 'tarfile', 'zlib',
+        'unittest', 'doctest', 'pdb', 'profile', 'cProfile', 'timeit',
+        'gc', 'weakref', 'ctypes', 'struct', 'array', 'bisect', 'heapq',
+        'random', 'statistics', 'decimal', 'fractions', 'cmath',
+        
+        # 系统和特殊包
+        'python', 'python3', 'pip', 'setuptools', 'wheel', 'distutils',
+        'pkg_resources', 'site', 'sysconfig',
+        
+        # 常见的虚拟包或元包
+        'win32', 'win32api', 'win32com', 'winerror', 'msvcrt',
+        'posix', 'nt', 'pwd', 'grp', 'termios', 'tty', 'pty',
+        
+        # 其他常见的非安装包
+        'builtins', '__builtin__', '__future__', '__main__',
+    }
+
+    # 1. PyPI metadata (filtered)
+    pypi_list = requires_dist_filtered(package, version, include_extras=include_extras)
+    for dep in pypi_list:
+        # dep is now a string "name version_spec"
+        name = dep.split()[0].lower()
+        if name not in seen:
+            seen.add(name)
+            final.append(dep)
+
+    # 2. pip download metadata (fallback)
+    pip_list = pip_download_metadata(package, version, include_extras=include_extras)
+    for dep in pip_list:
+        # dep is now a string "name version_spec"
+        name = dep.split()[0].lower()
+        if name not in seen:
+            seen.add(name)
+            final.append(dep)
+
+    # 3. repository inspection (use project_urls/home_page from PyPI)
+    if include_repo:
+        meta = fetch_pypi_json(package, version)
+        home = None
+        if meta:
+            info = meta.get("info", {}) or {}
+            project_urls = info.get("project_urls") or {}
+            if isinstance(project_urls, dict):
+                home = project_urls.get("Source") or project_urls.get("Homepage") or info.get("home_page")
+            else:
+                home = info.get("home_page")
+        if home:
+            repolist = repo_inspect_from_homepage(home, include_extras=include_extras)
+            for dep in repolist:
+                # dep is now a string "name version_spec"
+                name = dep.split()[0].lower()
+                if name not in seen:
+                    seen.add(name)
+                    final.append(dep)
+
+    # 过滤结果：删除自身和非PyPI包
+    filtered_final = []
+    package_name_lower = package.lower()
+    
+    for dep in final:
+        dep_name = dep.split()[0].lower()
+        
+        # 跳过自身
+        if dep_name == package_name_lower:
+            continue
+            
+        # 跳过常见的非PyPI包
+        if dep_name in non_pypi_packages:
+            continue
+            
+        # 跳过包名中包含常见非PyPI特征的包
+        if (dep_name.startswith('__') and dep_name.endswith('__')) or \
+           dep_name in ['python2', 'python3'] or \
+           dep_name.startswith('python-') and dep_name.endswith('-dev'):
+            continue
+            
+        filtered_final.append(dep)
+
+    return filtered_final
+
+
+GPU_KEYWORDS = [
+    "gpu",
+    "cuda",
+    "nvidia",
+    "cudnn",
+    "cublas",
+    "rocm",
+    "tensorrt",
+    "cupy",
+    "gpu-accelerated",
+    "cuda-toolkit",
+]
+
+README_CANDIDATES = [
+    'README.md', 'readme.md', 'README.rst', 
+    'readme.rst', 'README.txt', 'README',
+    'README.MD'
+]
+
+def detect_gpu_requirement(package: str, version: Optional[str] = None, include_repo: bool = True) -> Dict:
+    """Return a dict with GPU detection: {'gpu': bool, 'matches': [keywords], 'sources': [which fields matched] }.
+
+    The detector inspects PyPI metadata (summary, description, keywords, classifiers) and,
+    if requested, attempts to fetch the repository README for additional hints.
+    """
+    matches: Set[str] = set()
+    sources: Set[str] = set()
+    meta = fetch_pypi_json(package, version)
+
+    def _scan_text(name: str, text: Optional[str]):
+        if not text:
+            return
+        t = text.lower()
+        for kw in GPU_KEYWORDS:
+            if kw in t:
+                matches.add(kw)
+                sources.add(name)
+
+    if meta:
+        info = meta.get("info", {}) or {}
+        _scan_text("summary", info.get("summary"))
+        _scan_text("description", info.get("description"))
+        # keywords may be a space/comma separated string
+        kws = info.get("keywords")
+        if isinstance(kws, str):
+            _scan_text("keywords", kws)
+        # classifiers
+        classifiers = info.get("classifiers") or []
+        if classifiers:
+            _scan_text("classifiers", "\n".join(classifiers))
+
+    # optional: fetch README from GitHub if homepage points there
+    if include_repo and meta:
+        info = meta.get("info", {}) or {}
+        project_urls = info.get("project_urls") or {}
+        home = None
+        if isinstance(project_urls, dict):
+            home = project_urls.get("Source") or project_urls.get("Homepage") or info.get("home_page")
+        else:
+            home = info.get("home_page")
+        if home and re.search(r"github\.com[:/]+([^/]+)/([^/]+)(?:/|$)", home):
+            # try common README locations
+            m = re.search(r"github\.com[:/]+([^/]+)/([^/]+)(?:/|$)", home)
+            owner, repo = m.group(1), m.group(2).rstrip(".git")
+            for readme_name in README_CANDIDATES:
+                url = f"https://raw.githubusercontent.com/{owner}/{repo}/HEAD/{readme_name}"
+                txt = _fetch_raw(url)
+                if txt:
+                    _scan_text(f"readme:{readme_name}", txt)
+                    # stop early if we already found matches
+                    if matches:
+                        break
+
+    return {"gpu": bool(matches), "matches": sorted(matches), "sources": sorted(sources)}
+
+
+def detect_gpu_level(package: str, version: Optional[str] = None, include_repo: bool = True) -> Dict:
+    """Classify GPU requirement into levels: 'required', 'optional', 'unknown', or 'none'.
+
+    Heuristics:
+    - 'required' if requires_dist lists a GPU-related package (name matches GPU keywords),
+      or description/classifiers contain strong 'requires' phrasing near GPU keywords.
+    - 'optional' if requires_dist contains extras referencing GPU or README/description mentions
+      'optional' near GPU keywords.
+    - 'unknown' if GPU keywords exist in metadata/README but no clear required/optional evidence.
+    - 'none' if no GPU keywords found.
+    """
+    meta = fetch_pypi_json(package, version)
+    found_keywords = set()
+    required_evidence = False
+    optional_evidence = False
+
+    # helper for phrase checks
+    def _has_require_phrase(text: str) -> bool:
+        return bool(re.search(r"\b(require|requires|required|need|needs)\b.{0,40}\b(?:" + "|".join(GPU_KEYWORDS) + r")\b", text, flags=re.I))
+
+    def _has_optional_phrase(text: str) -> bool:
+        return bool(re.search(r"\b(optional|optionally|support for|supports)\b.{0,40}\b(?:" + "|".join(GPU_KEYWORDS) + r")\b", text, flags=re.I))
+
+    if meta:
+        info = meta.get("info", {}) or {}
+        # scan requires_dist entries
+        requires = info.get("requires_dist") or []
+        for entry in requires:
+            parsed = _parse_requires_dist(entry)
+            if not parsed:
+                continue
+            name, version_spec, extras, marker = parsed
+            lname = (name or "").lower()
+            # direct dependency on a gpu-related package -> required
+            for kw in GPU_KEYWORDS:
+                if kw in lname:
+                    required_evidence = True
+                    found_keywords.add(kw)
+            # extras mentioning gpu -> optional
+            for ex in extras:
+                lex = ex.lower()
+                for kw in GPU_KEYWORDS:
+                    if kw in lex:
+                        optional_evidence = True
+                        found_keywords.add(kw)
+
+        # scan summary/description/classifiers/keywords
+        summary = (info.get("summary") or "")
+        desc = (info.get("description") or "")
+        classifiers = "\n".join(info.get("classifiers") or [])
+        kws = info.get("keywords") or ""
+        for txt, src in ((summary, "summary"), (desc, "description"), (classifiers, "classifiers"), (kws, "keywords")):
+            if not txt:
+                continue
+            tl = txt.lower()
+            for kw in GPU_KEYWORDS:
+                if kw in tl:
+                    found_keywords.add(kw)
+            if _has_require_phrase(txt):
+                required_evidence = True
+            if _has_optional_phrase(txt):
+                optional_evidence = True
+
+    # optional README scan via repo if requested
+    if include_repo and meta:
+        info = meta.get("info", {}) or {}
+        project_urls = info.get("project_urls") or {}
+        home = None
+        if isinstance(project_urls, dict):
+            home = project_urls.get("Source") or project_urls.get("Homepage") or info.get("home_page")
+        else:
+            home = info.get("home_page")
+        if home and re.search(r"github\.com[:/]+([^/]+)/([^/]+)(?:/|$)", home):
+            m = re.search(r"github\.com[:/]+([^/]+)/([^/]+)(?:/|$)", home)
+            owner, repo = m.group(1), m.group(2).rstrip(".git")
+            for readme_name in README_CANDIDATES:
+                url = f"https://raw.githubusercontent.com/{owner}/{repo}/HEAD/{readme_name}"
+                txt = _fetch_raw(url)
+                if not txt:
+                    continue
+                t = txt.lower()
+                for kw in GPU_KEYWORDS:
+                    if kw in t:
+                        found_keywords.add(kw)
+                if _has_require_phrase(txt):
+                    required_evidence = True
+                if _has_optional_phrase(txt):
+                    optional_evidence = True
+                # stop early if we found required evidence
+                if required_evidence:
+                    break
+
+    # decide level
+    if required_evidence:
+        level = "required"
+    elif optional_evidence:
+        level = "optional"
+    elif found_keywords:
+        level = "unknown"
+    else:
+        level = "none"
+
+    return {"level": level, "keywords": sorted(found_keywords), "required_evidence": required_evidence, "optional_evidence": optional_evidence}
+
+
+def main() -> None:
+    parser = argparse.ArgumentParser(description="Resolve runtime dependency package names for a PyPI package.")
+    parser.add_argument("package", help="PyPI package name")
+    parser.add_argument("--version", help="Specific version (optional)", default=None)
+    parser.add_argument("--include-extras", action="store_true", help="Include extras optional dependencies")
+    parser.add_argument("--no-repo", action="store_true", help="Do not attempt repository inspection")
+    args = parser.parse_args()
+
+    deps = find_dependency_for_pip_package(args.package, args.version, include_extras=args.include_extras, include_repo=not args.no_repo)
+    gpu_info = detect_gpu_requirement(args.package, args.version, include_repo=not args.no_repo)
+    gpu_level = detect_gpu_level(args.package, args.version, include_repo=not args.no_repo)
+    # combine into a single gpu object
+    gpu_obj = {**gpu_info, "classification": gpu_level}
+    out = {"package": args.package, "version": args.version, "dependencies": deps, "gpu": gpu_obj}
+    print(json.dumps(out, indent=2, ensure_ascii=False))
+
+
+if __name__ == "__main__":
+    main()
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/info_crawler/analyse_pypi.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/info_crawler/analyse_pypi.py"
new file mode 100644
index 0000000000000000000000000000000000000000..94f5aeceee6f0c33fe3efa7d3235845329a1fcc7
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/info_crawler/analyse_pypi.py"	
@@ -0,0 +1,172 @@
+import argparse
+import json
+from pathlib import Path
+from typing import List, Dict, Optional, Tuple
+import requests
+import sys
+import re
+
+root_dir = Path(__file__).parent.parent
+sys.path.insert(0, str(root_dir))
+
+from info_crawler.github_tools import list_tree, fetch_file
+
+PYPI_API_TEMPLATE = "https://pypi.org/pypi/{name}/json"
+
+def _fetch_pypi_json(package: str, version: Optional[str] = None, timeout: int = 10):
+    url = PYPI_API_TEMPLATE.format(name=package) if version is None else PYPI_API_TEMPLATE.format(name=f"{package}/{version}")
+    return requests.get(url, timeout=timeout)
+
+def check_pypi(candidate_package_name: str, version: Optional[str] = None) -> Tuple[bool, Optional[Dict]]:
+    """Check a single candidate package name on PyPI. If exists, return (True, package info), else (False, None)."""
+    r = _fetch_pypi_json(candidate_package_name, version)
+    result = True if r.status_code == 200 else False
+    return result, r.json() if result else None
+
+def check_pypi_list(candidate_package_names: List[str], version: Optional[str] = None) -> Dict[str, {bool, List[str]}]:
+    """Check candidate package names on PyPI. Returns map name->exists."""
+    result = {}
+    for name in candidate_package_names:
+        result[name], _ = check_pypi(name, version)
+    return result
+
+README_CANDIDATES = [
+    'README.md', 'readme.md', 'README.rst', 
+    'readme.rst', 'README.txt', 'README',
+    'README.MD'
+]
+
+# 简化的pip包名匹配模式 - 只提取标准PyPI包名
+PIP_PATTERN = re.compile(
+    r'pip\s+install\s+'                      # 基础命令
+    r'(?:--?\w+(?:\s+[^\s-]+)?\s+)*'         # 跳过各种参数
+    r'(?:-[a-zA-Z]\s+)*'                     # 跳过短参数
+    r"(?:['\"])?([a-zA-Z0-9][a-zA-Z0-9\-_\.]*(?:\[[^\]]+\])?)(?:['\"])?"  # 捕获包名（支持引号和方括号）
+    r'(?:[><=!~]=?[\d\w\.\-\+]+)*',          # 可选版本约束
+    re.IGNORECASE
+)
+
+def _parse_pip_commands(text: str) -> list:
+    """解析pip安装命令，提取标准PyPI包名"""
+    packages = []
+    
+    # 按行处理，跳过包含特殊安装方式的行
+    lines = text.split('\n')
+    for line in lines:
+        # 跳过包含这些模式的行：requirements文件、本地安装、git仓库
+        if any(pattern in line.lower() for pattern in ['-r ', 'git+', 'install .', 'install -e']):
+            continue
+            
+        # 匹配标准包名
+        for match in PIP_PATTERN.finditer(line):
+            package = match.group(1).strip()
+            
+            # 验证包名有效性
+            if (package and len(package) > 1 and 
+                not package.endswith('.txt') and 
+                not package.startswith('.') and
+                not package.startswith('http') and # 排除URL  
+                not package.lower() == 'pip' and # 排除--upgrade pip
+                not package.lower() == 'poetry' and # 排除安装poetry
+                not package.lower() == 'uv'): # 排除安装uv
+                packages.append(package)
+    
+    # 去重并保持顺序
+    unique_packages = []
+    seen = set()
+    for pkg in packages:
+        base_pkg = pkg.split('[')[0].lower().replace('-', '_')
+        if base_pkg not in seen:
+            unique_packages.append(pkg)
+            seen.add(base_pkg)
+    return unique_packages
+
+def _read_project_name_from_readme(url: str, file_list: List[str]) -> Optional[str]:
+    for readme_name in README_CANDIDATES:
+        # find any path in file_list that contains the filename as a substring
+        match = next((f for f in file_list if readme_name in f), None)
+        if match:
+            content = fetch_file(url, match)
+            if content:
+                package_list = _parse_pip_commands(content)
+                if package_list:
+                    return package_list[0]
+    return None
+
+def _read_project_name_from_pyproject(url: str, file_list: List[str]) -> Optional[str]:
+    for p in ['pyproject.toml']:
+        # find any path in file_list that contains the filename as a substring
+        match = next((f for f in file_list if p in f), None)
+        if match:
+            content = fetch_file(url, match)
+            if content:
+                m = re.search(r'name\s*=\s*"([^"]+)"', content)
+                if m:
+                    return m.group(1)
+    return None
+
+def _read_project_name_from_setup_cfg(url: str, file_list: List[str]) -> Optional[str]:
+    match = next((f for f in file_list if 'setup.cfg' in f), None)
+    if match:
+        content = fetch_file(url, match)
+        if content:
+            m = re.search(r'^name\s*=\s*(.+)$', content, re.MULTILINE)
+            if m:
+                return m.group(1).strip()
+    return None
+
+def _read_project_name_from_setup_py(url: str, file_list: List[str]) -> Optional[str]:
+    match = next((f for f in file_list if 'setup.py' in f), None)
+    if match:
+        content = fetch_file(url, match)
+        if content:
+            m = re.search(r'name\s*=\s*["\']([^"\']+)["\']', content)
+            if m:
+                return m.group(1)
+    return None
+
+def detect_candidate_package_names(url: str, repo: str, file_list: List[str]) -> List[str]:
+    """Try to determine likely PyPI package name(s)."""
+    names = []
+    # explicit files
+    for fn in (_read_project_name_from_pyproject, _read_project_name_from_setup_cfg, _read_project_name_from_setup_py, _read_project_name_from_readme):
+        try:
+            name = fn(url, file_list)
+            if name:
+                names.append(name)
+        except Exception:
+            continue
+    # fallback to repo name
+    if repo and repo not in names:
+        names.append(repo)
+    # also consider normalized names (replace _ with -)
+    for n in list(names):
+        alt = n.replace('_', '-')
+        if alt not in names:
+            names.append(alt)
+    print(f"Detected candidate package names: {names}")
+    return names
+
+def main():
+    """Main function to analyze a GitHub repository."""
+    repo_url = input("Enter GitHub repository URL: ").strip()
+    branch = input("Enter branch name (default: HEAD): ").strip() or "HEAD"
+
+    # Fetch file list from the repository
+    file_list = list_tree(repo_url, branch)
+    if not file_list:
+        print("No files found in the repository.")
+        return
+
+    # Detect candidate package names
+    repo_name = repo_url.split('/')[-1]  # Extract repo name from URL
+    candidate_package_names = detect_candidate_package_names(repo_url, repo_name, file_list)
+    
+    # Check PyPI for these names
+    pypi_check = check_pypi_list(candidate_package_names)
+
+    print("Candidate package names:", candidate_package_names)
+    print("PyPI check results:", pypi_check)
+
+if __name__ == "__main__":
+    main()
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/info_crawler/get_abstracts_from_github.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/info_crawler/get_abstracts_from_github.py"
new file mode 100644
index 0000000000000000000000000000000000000000..f0ccd623e9cc4a5b84e2578fe416f16e2b3c93e2
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/info_crawler/get_abstracts_from_github.py"	
@@ -0,0 +1,74 @@
+import argparse
+from pathlib import Path
+import requests
+import csv
+import json
+
+# -------------------- 配置 --------------------
+root_dir = Path(__file__).parent.parent
+CONFIG_FILE = f"{root_dir}/config.json"
+API_BASE = "https://api.github.com"
+TOKEN   = json.load(open(CONFIG_FILE, 'r'))['github_access_token'] 
+HEADERS = {
+    "Authorization": f"Bearer {TOKEN}",
+    "Accept": "application/vnd.github+json"
+}
+# ---------------------------------------------
+
+# CSV配置（新增description字段）[2,4](@ref)
+CSV_FILENAME = f"{root_dir}/tmp/github_repos_with_desc.csv"
+CSV_HEADERS = ["name", "url", "language", "stars", "description", "updated_at"]
+
+def fetch_repositories(topic: str = "ai") :
+    """仅获取前1000条数据"""
+    # 查询参数（按题+星数筛选）[1,2](@ref)
+    query_params = {
+        "q": f"topic:{topic} stars:>=1000",
+        "sort": "stars",
+        "order": "desc",
+        "per_page": 100  # 每页最大返回数
+    }
+    max_pages = 10  # 1000/100=10页
+    repos = []
+    for page in range(1, max_pages + 1):
+        print("抓取第 {} 页...".format(page))
+        url = f"{API_BASE}/search/repositories?page={page}"
+        response = requests.get(url, headers=HEADERS, params=query_params)
+        if response.status_code != 200:
+            print(f"请求失败: {response.status_code}")
+            break
+        data = response.json()
+        repos.extend(data["items"])
+        # 提前终止条件（实际结果数不足1000）
+        if len(data["items"]) < query_params["per_page"]:
+            break
+    return repos
+
+def save_to_csv(repos, result_file_name):
+    """写入CSV文件，处理描述为空的情况"""
+    with open(result_file_name, "w", newline="", encoding="utf-8") as csvfile:
+        writer = csv.DictWriter(csvfile, fieldnames=CSV_HEADERS)
+        writer.writeheader()
+        for repo in repos:
+            # 处理description字段的空值[2](@ref)
+            description = repo["description"].strip() if repo["description"] else "未填写"
+            writer.writerow({
+                "name": repo["name"],
+                "url": repo["html_url"],
+                "language": repo["language"] if repo["language"] else "未标注",
+                "stars": repo["stargazers_count"],
+                "description": description,
+                "updated_at": repo["updated_at"]
+            })
+
+if __name__ == "__main__":
+    parser = argparse.ArgumentParser(description='抓取GitHub指定Topic的热门仓库')
+    parser.add_argument('--topic', type=str, default='ai', help='要抓取的GitHub话题')
+    args = parser.parse_args()
+    repositories = fetch_repositories(args.topic)
+    with open(CONFIG_FILE, 'r') as f:
+        config = json.load(f)
+        tmp_path = config.get('tmp_path', f"{root_dir}/tmp")
+    output = f'{tmp_path}/github_{args.topic}_repos_with_desc.csv'
+    save_to_csv(repositories, output)
+    print(f"成功导出 {len(repositories)} 条数据至 {output}")
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/info_crawler/get_pypi_name.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/info_crawler/get_pypi_name.py"
new file mode 100644
index 0000000000000000000000000000000000000000..2bf693bf790b329ba97166e077ad3c17c185abc3
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/info_crawler/get_pypi_name.py"	
@@ -0,0 +1,664 @@
+import argparse
+import re
+import requests
+import base64
+import json
+import time
+import sqlite3
+import os
+import sys
+from urllib.parse import urlparse
+from typing import Tuple, Optional
+from datetime import datetime
+from pathlib import Path
+
+root_dir = Path(__file__).parent.parent
+sys.path.insert(0, str(root_dir))
+from ai_agent.llm_api import call_llm_api
+
+class ReadmeAnalyzer:
+    """GitHub仓库README分析工具"""
+    
+    README_CANDIDATES = [
+        'README.md', 'readme.md', 'README.rst', 
+        'readme.rst', 'README.txt', 'README',
+        'README.MD'
+    ]
+    
+    # 简化的pip包名匹配模式 - 只提取标准PyPI包名
+    PIP_PATTERN = re.compile(
+        r'pip\s+install\s+'                      # 基础命令
+        r'(?:--?\w+(?:\s+[^\s-]+)?\s+)*'         # 跳过各种参数
+        r'(?:-[a-zA-Z]\s+)*'                     # 跳过短参数
+        r"(?:['\"])?([a-zA-Z0-9][a-zA-Z0-9\-_\.]*(?:\[[^\]]+\])?)(?:['\"])?"  # 捕获包名（支持引号和方括号）
+        r'(?:[><=!~]=?[\d\w\.\-\+]+)*',          # 可选版本约束
+        re.IGNORECASE
+    )
+
+    # 添加频率控制相关的类变量
+    _last_llm_call_time = 0
+    _qpm_interval = 60.0  # 默认每分钟1次请求的间隔
+
+    @classmethod
+    def extract_repo_info(cls, url: str) -> Tuple[Optional[str], Optional[str]]:
+        """解析GitHub仓库信息（基于网页8的URL处理逻辑）"""
+        parsed = urlparse(url)
+        path_segments = parsed.path.strip('/').split('/')
+        return (path_segments[0], path_segments[1]) if len(path_segments)>=2 else (None, None)
+    
+    @classmethod
+    def fetch_readme(cls, owner: str, repo: str, token: str = None) -> Optional[str]:
+        """获取README内容（支持多格式，参考网页3的.rst处理）"""
+        headers = {'Authorization': f'token {token}'} if token else {}
+        
+        for readme_name in cls.README_CANDIDATES:
+            try:
+                response = requests.get(
+                    f"https://api.github.com/repos/{owner}/{repo}/contents/{readme_name}",
+                    headers=headers,
+                    timeout=10
+                )
+                if response.status_code == 200:
+                    print(f"{owner}/{repo} 仓库的readme是{readme_name}")
+                    return base64.b64decode(response.json()['content']).decode('utf-8')
+            except Exception as e:
+                print(f"Error fetching {readme_name}: {str(e)}")
+        
+        return None
+    
+    @classmethod
+    def parse_pip_commands(cls, text: str) -> list:
+        """解析pip安装命令，提取标准PyPI包名"""
+        packages = []
+        
+        # 按行处理，跳过包含特殊安装方式的行
+        lines = text.split('\n')
+        for line in lines:
+            # 跳过包含这些模式的行：requirements文件、本地安装、git仓库
+            if any(pattern in line.lower() for pattern in ['-r ', 'git+', 'install .', 'install -e']):
+                continue
+                
+            # 匹配标准包名
+            for match in cls.PIP_PATTERN.finditer(line):
+                package = match.group(1).strip()
+                
+                # 验证包名有效性
+                if (package and len(package) > 1 and 
+                    not package.endswith('.txt') and 
+                    not package.startswith('.') and
+                    not package.startswith('http') and # 排除URL  
+                    not package.lower() == 'pip' and # 排除--upgrade pip
+                    not package.lower() == 'poetry' and # 排除安装poetry
+                    not package.lower() == 'uv'): # 排除安装uv
+                    packages.append(package)
+        
+        # 去重并保持顺序
+        unique_packages = []
+        seen = set()
+        for pkg in packages:
+            base_pkg = pkg.split('[')[0].lower().replace('-', '_')
+            if base_pkg not in seen:
+                unique_packages.append(pkg)
+                seen.add(base_pkg)
+        
+        print(f"找到 {len(unique_packages)} 个PyPI包: {unique_packages}")
+        return unique_packages
+
+    @classmethod
+    def analyze_with_llm(cls, readme_content: str, repo_name: str, qpm: int = 1) -> list:
+        """使用LLM分析README中的pip安装目标（单个仓库）"""
+        result = cls.analyze_batch_with_llm([{
+            'repo_name': repo_name,
+            'readme_content': readme_content
+        }], qpm)
+        return result.get(repo_name, [])
+    
+    @classmethod
+    def analyze_batch_with_llm(cls, repo_data_list: list, qpm: int = 1) -> dict:
+        """使用LLM批量分析多个README中的pip安装目标
+        
+        Args:
+            repo_data_list: 列表，每个元素是 {'repo_name': str, 'readme_content': str}
+            qpm: 每分钟请求次数限制
+            
+        Returns:
+            dict: {repo_name: [packages_list], ...}
+        """
+        # 频率控制：根据QPM限制请求频率
+        interval = 60.0 / qpm  # 计算每次请求的间隔（秒）
+        
+        # 如果不是第一次调用，需要等待
+        if cls._last_llm_call_time > 0:
+            current_time = time.time()
+            time_since_last_call = current_time - cls._last_llm_call_time
+            if time_since_last_call < interval:
+                sleep_time = interval - time_since_last_call
+                print(f"QPM限制：等待 {sleep_time:.2f} 秒...")
+                time.sleep(sleep_time)
+        
+        # 构建批量分析的prompt
+        repos_section = ""
+        for i, repo_data in enumerate(repo_data_list, 1):
+            repos_section += f"""
+=== 仓库 {i}: {repo_data['repo_name']} ===
+{repo_data['readme_content']}
+
+"""
+        
+        prompt = f"""
+请分析以下多个GitHub仓库的README文档，为每个仓库分别提取所有pip install命令中的标准PyPI包名。
+
+{repos_section}
+
+请按以下要求分析：
+1. 找出每个仓库README中的所有pip install命令
+2. 只提取标准PyPI包名（忽略requirements文件、本地安装、git仓库等）
+3. 保留包名和可选的extras（如package[extra]）
+4. 忽略版本约束，只要包名
+
+请以JSON格式返回结果，格式如下：
+{{
+    "仓库名1": ["package1", "package2[extras]"],
+    "仓库名2": ["package3", "package4"],
+    ...
+}}
+
+只返回JSON，不要添加其他说明文字。
+"""
+        
+        try:
+            message = {"role": "user", "content": prompt}
+            # Build proper chat-style messages list for the LLM API
+            messages = [message]
+            # 调用内部的 LLM API; llm_api expects 'messages' to be a list of message dicts
+            content = call_llm_api(messages, verbose=False)
+            print(f"LLM批量分析结果: {content}")
+            
+            # 尝试解析JSON响应
+            try:
+                # 移除可能的markdown代码块标记
+                if content.startswith('```json'):
+                    content = content[7:]
+                if content.endswith('```'):
+                    content = content[:-3]
+                
+                parsed_result = json.loads(content)
+                print(f"LLM批量分析成功，处理了 {len(parsed_result)} 个仓库")
+                print(f"处理结果: {parsed_result}")
+                # 在返回结果后，更新最后调用时间并强制等待完整间隔
+                cls._last_llm_call_time = time.time()
+                print(f"QPM限制：强制等待完整间隔 {interval:.2f} 秒...")
+                time.sleep(interval)
+                
+                return parsed_result
+                
+            except json.JSONDecodeError as e:
+                print(f"解析LLM响应JSON失败: {e}")
+                print(f"响应内容: {content}")
+                
+                # 即使解析失败也要更新时间并等待
+                cls._last_llm_call_time = time.time()
+                print(f"QPM限制：强制等待完整间隔 {interval:.2f} 秒...")
+                time.sleep(interval)
+                return {}
+                
+        except Exception as e:
+            print(f"LLM批量分析出错: {str(e)}")
+            
+            # 即使出错也要更新时间并等待
+            cls._last_llm_call_time = time.time()
+            print(f"QPM限制：强制等待完整间隔 {interval:.2f} 秒...")
+            time.sleep(interval)
+            return {}
+
+class FileProcessor:
+    
+    @staticmethod
+    def init_database(db_path: str = "repos.db"):
+        """初始化SQLite数据库"""
+        conn = sqlite3.connect(db_path)
+        cursor = conn.cursor()
+        
+        # 创建仓库信息表
+        cursor.execute('''
+            CREATE TABLE IF NOT EXISTS repositories (
+                id INTEGER PRIMARY KEY AUTOINCREMENT,
+                name TEXT NOT NULL,
+                url TEXT NOT NULL UNIQUE,
+                owner TEXT,
+                repo_name TEXT,
+                stars INTEGER,
+                last_updated TEXT,
+                description TEXT,
+                language TEXT,
+                pip_packages TEXT,
+                license_name TEXT,
+                size_kb INTEGER,
+                created_at TEXT,
+                processed_at TEXT DEFAULT CURRENT_TIMESTAMP,
+                UNIQUE(owner, repo_name)
+            )
+        ''')
+        
+        conn.commit()
+        conn.close()
+        print(f"数据库已初始化: {db_path}")
+    
+    @staticmethod
+    def fetch_repo_metadata(owner: str, repo: str, token: str = None) -> dict:
+        """获取GitHub仓库的元数据信息"""
+        headers = {'Authorization': f'token {token}'} if token else {}
+        
+        try:
+            response = requests.get(
+                f"https://api.github.com/repos/{owner}/{repo}",
+                headers=headers,
+                timeout=10
+            )
+            
+            if response.status_code == 200:
+                data = response.json()
+                return {
+                    'stars': data.get('stargazers_count', 0),
+                    'last_updated': data.get('updated_at', ''),
+                    'description': data.get('description', ''),
+                    'language': data.get('language', ''),
+                    'license_name': data.get('license', {}).get('name', '') if data.get('license') else '',
+                    'size_kb': data.get('size', 0),
+                    'created_at': data.get('created_at', '')
+                }
+            else:
+                print(f"获取仓库元数据失败 {owner}/{repo}: HTTP {response.status_code}")
+                return {}
+                
+        except Exception as e:
+            print(f"获取仓库元数据出错 {owner}/{repo}: {str(e)}")
+            return {}
+    
+    @staticmethod
+    def save_to_database(name: str, url: str, owner: str, repo: str, 
+                        packages: list, metadata: dict = None, 
+                        db_path: str = "repos.db"):
+        """将仓库信息保存到数据库"""
+        conn = sqlite3.connect(db_path)
+        cursor = conn.cursor()
+        
+        # 准备数据
+        packages_str = json.dumps(packages) if packages else '[]'
+        processed_at = datetime.now().isoformat()
+        
+        # 如果没有提供元数据，使用默认值
+        if metadata is None:
+            metadata = {}
+        
+        try:
+            # 使用INSERT OR REPLACE来处理重复数据
+            cursor.execute('''
+                INSERT OR REPLACE INTO repositories 
+                (name, url, owner, repo_name, stars, last_updated, description, 
+                 language, pip_packages, license_name, size_kb, created_at, processed_at)
+                VALUES (?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?)
+            ''', (
+                name,
+                url,
+                owner,
+                repo,
+                metadata.get('stars', 0),
+                metadata.get('last_updated', ''),
+                metadata.get('description', ''),
+                metadata.get('language', ''),
+                packages_str,
+                metadata.get('license_name', ''),
+                metadata.get('size_kb', 0),
+                metadata.get('created_at', ''),
+                processed_at
+            ))
+            
+            conn.commit()
+            print(f"已保存到数据库: {name} ({owner}/{repo})")
+            
+        except sqlite3.Error as e:
+            print(f"数据库保存失败 {name}: {str(e)}")
+        finally:
+            conn.close()
+    
+
+    @staticmethod
+    def process_io(input_path: str, output_path: str, token: str = None, 
+                   use_llm: bool = False, qpm: int = 1, batch_size: int = 5,
+                   save_to_db: bool = False, db_path: str = "repos.db", 
+                   cache_days: int = 30):
+        """执行批处理流程 - 支持LLM分析选项、批量处理、数据库存储和缓存
+        
+        Args:
+            cache_days: 缓存有效期（天数），默认30天
+        """
+        
+        # 如果启用数据库存储，初始化数据库
+        if save_to_db:
+            FileProcessor.init_database(db_path)
+        with open(input_path, 'r', encoding='utf-8') as infile:
+            lines = [line.strip() for line in infile if line.strip()]
+        
+        # 开始时清空输出文件
+        with open(output_path, 'w', encoding='utf-8') as outfile:
+            pass  # 只是为了清空文件
+        
+        i = 0
+        while i < len(lines):
+            if use_llm and i + batch_size <= len(lines):
+                # 批量处理
+                batch_lines = lines[i:i+batch_size]
+                batch_data = []
+                valid_entries = []
+                error_results = []  # 收集错误结果
+                
+                # 收集批次中的有效条目
+                for line in batch_lines:
+                    parts = re.split(r'\s+', line, maxsplit=1)
+                    if len(parts) < 2:
+                        error_results.append(f"{line} INVALID_FORMAT []\n")
+                        continue
+                    
+                    name, url = parts[0], parts[1]
+                    owner, repo = ReadmeAnalyzer.extract_repo_info(url)
+                    
+                    if not owner or not repo:
+                        error_results.append(f"{name} {url} INVALID_URL []\n")
+                        continue
+                    
+                    # 检查缓存
+                    is_cached, cached_data = FileProcessor.check_cache_validity(owner, repo, db_path, cache_days)
+                    if is_cached and cached_data:
+                        # 使用缓存数据直接输出
+                        packages_list = f"[{', '.join(cached_data['pip_packages'])}]"
+                        cached_result = f"{name} {url} {packages_list}\n"
+                        
+                        with open(output_path, 'a', encoding='utf-8') as outfile:
+                            outfile.write(cached_result)
+                        
+                        print(f"使用缓存数据: {name} ({owner}/{repo}) - 处理时间: {cached_data['processed_at']}")
+                        continue
+                    
+                    try:
+                        content = ReadmeAnalyzer.fetch_readme(owner, repo, token)
+                        if not content:
+                            error_results.append(f"{name} {url} README_NOT_FOUND []\n")
+                            # 即使README未找到，也要保存到数据库避免重复处理
+                            if save_to_db:
+                                owner_info, repo_info = ReadmeAnalyzer.extract_repo_info(url)
+                                if owner_info and repo_info:
+                                    metadata = FileProcessor.fetch_repo_metadata(owner_info, repo_info, token)
+                                    FileProcessor.save_to_database(name, url, owner_info, repo_info, 
+                                                                 [], metadata, db_path)
+                            continue
+                        
+                        batch_data.append({
+                            'repo_name': repo,
+                            'readme_content': content
+                        })
+                        valid_entries.append((name, url, repo))
+                        
+                    except Exception as e:
+                        error_results.append(f"{name} {url} ERROR: {str(e)} []\n")
+                
+                # 先写入错误结果
+                with open(output_path, 'a', encoding='utf-8') as outfile:
+                    for error_result in error_results:
+                        outfile.write(error_result)
+                
+                # 如果有有效的条目，批量调用LLM
+                if batch_data:
+                    print(f"批量LLM分析 {len(batch_data)} 个仓库的README...")
+                    llm_results = ReadmeAnalyzer.analyze_batch_with_llm(batch_data, qpm)
+                    
+                    # 处理LLM结果并写入文件
+                    batch_results = []
+                    for j, (name, url, repo) in enumerate(valid_entries):
+                        packages = llm_results.get(repo, [])
+                        
+                        # 后续处理逻辑保持不变
+                        unique_packages = []
+                        seen = set()
+                        for pkg in packages:
+                            base_pkg = pkg.split('[')[0].lower().replace('-', '_')
+                            if base_pkg not in seen:
+                                unique_packages.append(pkg)
+                                seen.add(base_pkg)
+                        
+                        # 不区分大小写匹配仓库名，将匹配的包放在第一位
+                        repo_name_lower = repo.lower().replace('-', '_')
+                        matched_packages = []
+                        other_packages = []
+                        
+                        for pkg in unique_packages:
+                            base_pkg = pkg.split('[')[0].lower().replace('-', '_')
+                            if base_pkg == repo_name_lower:
+                                matched_packages.append(pkg)
+                            else:
+                                other_packages.append(pkg)
+                        
+                        # 重新排序：匹配的包在前，其他包在后
+                        reordered_packages = matched_packages + other_packages
+                        
+                        # 格式化包列表
+                        packages_list = f"[{', '.join(reordered_packages)}]"
+                        
+                        # 收集结果
+                        batch_results.append(f"{name} {url} {packages_list}\n")
+                        
+                        # 如果启用数据库存储，保存到数据库
+                        if save_to_db:
+                            # 获取仓库元数据
+                            owner, repo_name = ReadmeAnalyzer.extract_repo_info(url)
+                            if owner and repo_name:
+                                metadata = FileProcessor.fetch_repo_metadata(owner, repo_name, token)
+                                FileProcessor.save_to_database(name, url, owner, repo_name, 
+                                                             reordered_packages, metadata, db_path)
+                    
+                    # 将batch结果追加到输出文件
+                    with open(output_path, 'a', encoding='utf-8') as outfile:
+                        for result in batch_results:
+                            outfile.write(result)
+                    
+                    print(f"已完成批次处理，结果已追加到 {output_path}")
+                
+                i += batch_size
+                
+            else:
+                # 单个处理（当不使用LLM或剩余条目不足批次大小时）
+                line = lines[i]
+                parts = re.split(r'\s+', line, maxsplit=1)
+                if len(parts) < 2:
+                    with open(output_path, 'a', encoding='utf-8') as outfile:
+                        outfile.write(f"{line} INVALID_FORMAT []\n")
+                    i += 1
+                    continue
+                
+                name, url = parts[0], parts[1]
+                owner, repo = ReadmeAnalyzer.extract_repo_info(url)
+                
+                if not owner or not repo:
+                    with open(output_path, 'a', encoding='utf-8') as outfile:
+                        outfile.write(f"{name} {url} INVALID_URL []\n")
+                    i += 1
+                    continue
+                
+                # 检查缓存
+                is_cached, cached_data = FileProcessor.check_cache_validity(owner, repo, db_path, cache_days)
+                if is_cached and cached_data:
+                    # 使用缓存数据直接输出
+                    packages_list = f"[{', '.join(cached_data['pip_packages'])}]"
+                    
+                    with open(output_path, 'a', encoding='utf-8') as outfile:
+                        outfile.write(f"{name} {url} {packages_list}\n")
+                    
+                    print(f"使用缓存数据: {name} ({owner}/{repo}) - 处理时间: {cached_data['processed_at']}")
+                    i += 1
+                    continue
+                
+                try:
+                    content = ReadmeAnalyzer.fetch_readme(owner, repo, token)
+                    if not content:
+                        with open(output_path, 'a', encoding='utf-8') as outfile:
+                            outfile.write(f"{name} {url} README_NOT_FOUND []\n")
+                        
+                        # 即使README未找到，也要保存到数据库避免重复处理
+                        if save_to_db:
+                            metadata = FileProcessor.fetch_repo_metadata(owner, repo, token)
+                            FileProcessor.save_to_database(name, url, owner, repo, 
+                                                         [], metadata, db_path)
+                        i += 1
+                        continue
+                    
+                    # 选择分析方法
+                    if use_llm:
+                        print(f"使用LLM分析 {name} 的README...")
+                        packages = ReadmeAnalyzer.analyze_with_llm(content, repo, qpm)
+                    else:
+                        print(f"使用正则表达式分析 {name} 的README...")
+                        packages = ReadmeAnalyzer.parse_pip_commands(content)
+                    
+                    # 去重处理
+                    unique_packages = []
+                    seen = set()
+                    for pkg in packages:
+                        # 处理带extras的包名进行更精确的去重
+                        base_pkg = pkg.split('[')[0].lower().replace('-', '_')
+                        if base_pkg not in seen:
+                            unique_packages.append(pkg)
+                            seen.add(base_pkg)
+                    
+                    # 不区分大小写匹配仓库名，将匹配的包放在第一位
+                    repo_name_lower = repo.lower().replace('-', '_')
+                    matched_packages = []
+                    other_packages = []
+                    
+                    for pkg in unique_packages:
+                        base_pkg = pkg.split('[')[0].lower().replace('-', '_')
+                        if base_pkg == repo_name_lower:
+                            matched_packages.append(pkg)
+                        else:
+                            other_packages.append(pkg)
+                    
+                    # 重新排序：匹配的包在前，其他包在后
+                    reordered_packages = matched_packages + other_packages
+                    
+                    # 格式化包列表
+                    packages_list = f"[{', '.join(reordered_packages)}]"
+                    
+                    # 直接追加到输出文件
+                    with open(output_path, 'a', encoding='utf-8') as outfile:
+                        outfile.write(f"{name} {url} {packages_list}\n")
+                    
+                    # 如果启用数据库存储，保存到数据库
+                    if save_to_db:
+                        metadata = FileProcessor.fetch_repo_metadata(owner, repo, token)
+                        FileProcessor.save_to_database(name, url, owner, repo, 
+                                                     reordered_packages, metadata, db_path)
+
+                except Exception as e:
+                    with open(output_path, 'a', encoding='utf-8') as outfile:
+                        outfile.write(f"{name} {url} ERROR: {str(e)} []\n")
+
+                i += 1
+
+
+    @staticmethod
+    def check_cache_validity(owner: str, repo: str, db_path: str = "repos.db", cache_days: int = 30) -> tuple:
+        """检查数据库中是否存在有效的缓存数据
+        
+        Args:
+            owner: 仓库所有者
+            repo: 仓库名
+            db_path: 数据库路径
+            cache_days: 缓存有效期（天数）
+            
+        Returns:
+            tuple: (is_cached, cached_data)
+            - is_cached: bool, 是否存在有效缓存
+            - cached_data: dict, 缓存的数据（如果存在）
+        """
+        if not os.path.exists(db_path):
+            return False, None
+        
+        conn = sqlite3.connect(db_path)
+        cursor = conn.cursor()
+        
+        try:
+            cursor.execute('''
+                SELECT name, url, owner, repo_name, stars, last_updated, 
+                       description, language, pip_packages, license_name, 
+                       size_kb, created_at, processed_at
+                FROM repositories 
+                WHERE owner = ? AND repo_name = ?
+            ''', (owner, repo))
+            
+            row = cursor.fetchone()
+            if not row:
+                return False, None
+            
+            # 检查处理时间是否超过缓存期限
+            processed_at = row[12]  # processed_at字段
+            if processed_at:
+                try:
+                    processed_time = datetime.fromisoformat(processed_at)
+                    current_time = datetime.now()
+                    time_diff = current_time - processed_time
+                    
+                    if time_diff.days <= cache_days:
+                        # 缓存仍然有效，返回缓存数据
+                        cached_data = {
+                            'name': row[0],
+                            'url': row[1],
+                            'owner': row[2],
+                            'repo_name': row[3],
+                            'stars': row[4],
+                            'last_updated': row[5],
+                            'description': row[6],
+                            'language': row[7],
+                            'pip_packages': json.loads(row[8]) if row[8] else [],
+                            'license_name': row[9],
+                            'size_kb': row[10],
+                            'created_at': row[11],
+                            'processed_at': row[12]
+                        }
+                        return True, cached_data
+                except ValueError:
+                    # 如果时间格式解析失败，视为无效缓存
+                    pass
+            
+            return False, None
+            
+        except sqlite3.Error as e:
+            print(f"数据库查询缓存失败: {str(e)}")
+            return False, None
+        finally:
+            conn.close()
+
+# 使用示例
+if __name__ == "__main__":
+    parser = argparse.ArgumentParser(description="GitHub仓库获取PyPI包名工具")
+    parser.add_argument('--input', type=str, default='repos.txt', help='输入文件路径，包含仓库列表')
+    parser.add_argument('--output', type=str, default='results.txt', help='输出文件路径')
+
+    args = parser.parse_args()
+
+    CONFIG_FILE = f"{root_dir}/config.json"
+    GITHUB_TOKEN = json.load(open(CONFIG_FILE, 'r'))['github_access_token']
+    
+    # 可以选择使用LLM分析
+    USE_LLM = False  # 设置为True以启用LLM分析
+    QPM = 1  # 每分钟请求次数限制
+    BATCH_SIZE = 5  # 批处理大小，默认每次处理5个仓库
+    
+    # 数据库存储选项
+    SAVE_TO_DB = False  # 设置为True以启用数据库存储
+    DB_PATH = "repos.db"  # 数据库文件路径
+    CACHE_DAYS = 30  # 缓存有效期（天数）
+    
+    FileProcessor.process_io(args.input, args.output, GITHUB_TOKEN, 
+                           use_llm=USE_LLM, qpm=QPM, batch_size=BATCH_SIZE,
+                           save_to_db=SAVE_TO_DB, db_path=DB_PATH, 
+                           cache_days=CACHE_DAYS)
+    print(f"处理完成！结果已保存至 {args.output}。")
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/info_crawler/github_tools.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/info_crawler/github_tools.py"
new file mode 100644
index 0000000000000000000000000000000000000000..21161fc2c2310776041e1c3f37054c12d7751f35
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/info_crawler/github_tools.py"	
@@ -0,0 +1,143 @@
+#!/usr/bin/env python3
+# -*- coding: utf-8 -*-
+"""
+GitHub Repo Spider
+Usage:
+    # 公开仓库
+    python github_spider.py https://github.com/owner/repo
+
+    # 私有仓库（需 token）
+    GITHUB_TOKEN=ghp_xxx python github_spider.py https://github.com/owner/private-repo
+
+    # 只抓取文件树
+    python github_spider.py https://github.com/owner/repo --tree-only
+
+    # 只抓取指定文件
+    python github_spider.py https://github.com/owner/repo --target README.md requirements.txt
+"""
+import sys
+import json
+import base64
+import argparse
+import requests
+from typing import List, Dict, Any
+from pathlib import Path
+
+# -------------------- 配置 --------------------
+root_dir = Path(__file__).parent.parent
+CONFIG_FILE = f"{root_dir}/config.json"
+API_BASE = "https://api.github.com"
+TOKEN   = json.load(open(CONFIG_FILE, 'r'))['github_access_token'] 
+HEADERS = {
+    "Authorization": f"Bearer {TOKEN}",
+    "Accept": "application/vnd.github+json"
+}
+# ---------------------------------------------
+
+sys.path.insert(0, str(root_dir))
+from logger.logger import get_logger
+
+logger = get_logger("GitHubTools")
+
+def api_get(url: str) -> Dict[str, Any]:
+    """通用 GET 封装，自动处理分页 & 限频"""
+    items = []
+    while url:
+        r = requests.get(url, headers=HEADERS, timeout=30)
+        if r.status_code != 200:
+            logger.error(f"{r.status_code} {r.text}")
+            raise Exception(f"GitHub API 请求失败: {r.status_code}")
+        # 如果返回数组，则可能是分页
+        if isinstance(r.json(), list):
+            items.extend(r.json())
+            link = r.headers.get("Link", "")
+            next_url = None
+            for part in link.split(","):
+                if 'rel="next"' in part:
+                    next_url = part[part.find("<") + 1: part.find(">")]
+            url = next_url
+        else:
+            return r.json()
+    return items
+
+
+def parse_repo_url(url: str) -> tuple[str, str]:
+    """从任意 GitHub URL 提取 owner, repo"""
+    url = url.rstrip("/")
+    if url.endswith(".git"):
+        url = url[:-4]
+    parts = url.split("/")
+    if len(parts) < 5 or parts[2] != "github.com":
+        logger.error("仅支持 https://github.com/owner/repo 格式")
+        raise ValueError("Invalid GitHub URL")
+    return parts[3], parts[4]
+
+
+def list_tree(repo_url: str, branch: str = "HEAD") -> List[str]:
+    """递归列出仓库内所有文件路径"""
+    try:
+        owner, repo = parse_repo_url(repo_url)
+        url = f"{API_BASE}/repos/{owner}/{repo}/git/trees/{branch}?recursive=1"
+        data = api_get(url)
+        return [node["path"] for node in data.get("tree", []) if node["type"] == "blob"]
+    except Exception as e:
+        logger.error(f"{e}")
+        return []
+
+
+def fetch_file(repo_url: str, path: str) -> str:
+    """读取文件内容（已自动 base64 解码）"""
+    try:
+        owner, repo = parse_repo_url(repo_url)
+        url = f"{API_BASE}/repos/{owner}/{repo}/contents/{path}"
+        data = api_get(url)
+        if data.get("encoding") == "base64":
+            return base64.b64decode(data["content"]).decode("utf-8")
+    except Exception as e:
+        logger.error(f"{e}")
+    return data.get("content", "")
+
+
+def main():
+    parser = argparse.ArgumentParser(description="GitHub Repo Spider")
+    parser.add_argument("url", help="仓库地址，例如 https://github.com/owner/repo")
+    parser.add_argument("--tree-only", action="store_true", help="只输出文件树")
+    parser.add_argument("--target", nargs="*", default=["README.md", "requirements.txt"],
+                        help="需要抓取内容的文件名（支持通配符匹配）")
+    parser.add_argument("--branch", default="HEAD", help="分支或 commit SHA，默认 HEAD")
+    args = parser.parse_args()
+
+    # 1. 文件树
+    tree = list_tree(args.url, args.branch)
+    print(f"[info] 共发现 {len(tree)} 个文件")
+    if args.tree_only:
+        for p in tree:
+            print(p)
+        return
+
+    # 2. 抓取目标文件
+    out: Dict[str, Any] = {"tree": tree, "files": {}}
+    for pattern in args.target:
+        # 支持通配符：README* / *.py
+        import fnmatch
+        matched = fnmatch.filter(tree, pattern)
+        if not matched:
+            print(f"[warn] 未匹配到 {pattern}")
+            continue
+        for file_path in matched:
+            print(f"[info] 读取 {file_path}")
+            try:
+                out["files"][file_path] = fetch_file(args.url, file_path)
+            except Exception as e:
+                out["files"][file_path] = f"[error] {e}"
+
+    # 3. 保存结果
+    owner, repo = parse_repo_url(args.url)
+    save_to = f"{owner}_{repo}.json"
+    with open(save_to, "w", encoding="utf-8") as f:
+        json.dump(out, f, ensure_ascii=False, indent=2)
+    print(f"[done] 结果已保存到 {save_to}")
+
+
+if __name__ == "__main__":
+    main()
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/logger/logger.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/logger/logger.py"
new file mode 100644
index 0000000000000000000000000000000000000000..3074ff4ef3b2c3b8a4bc6277184d52651e3e9661
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/logger/logger.py"	
@@ -0,0 +1,170 @@
+import os
+import sys
+import json
+from typing import Dict, List
+import logging
+from datetime import datetime
+try:
+    import colorama  # type: ignore
+    colorama.init()
+    _HAS_COLORAMA = True
+except ImportError:
+    _HAS_COLORAMA = False
+
+"""终端日志工具"""
+_LEVEL2COLOR = {
+    logging.DEBUG: "\033[36m",      # cyan
+    logging.INFO: "\033[32m",       # green
+    logging.WARNING: "\033[33m",    # yellow
+    logging.ERROR: "\033[31m",      # red
+    logging.CRITICAL: "\033[35m",   # magenta
+}
+_RESET = "\033[0m"
+
+class _ColoredFormatter(logging.Formatter):
+    """彩色控制台格式器"""
+    def format(self, record):
+        # 仅对 levelname 上色
+        if _HAS_COLORAMA or os.name != "nt":
+            color = _LEVEL2COLOR.get(record.levelno, "")
+            record.levelname = f"{color}{record.levelname}{_RESET}"
+        return super().format(record)
+
+
+def _add_handlers(logger: logging.Logger,
+                  console_level: int = logging.INFO,
+                  max_bytes: int = 10 * 1024 * 1024,
+                  backup_count: int = 5):
+    """给 logger 添加控制台 + 文件 handler"""
+    # 防止重复
+    if logger.handlers:
+        return
+    logger.setLevel(logging.DEBUG)  # 全局最低
+
+    # 1) 控制台
+    console = logging.StreamHandler(sys.stdout)
+    console.setLevel(console_level)
+    console.setFormatter(
+        _ColoredFormatter(
+            "[%(asctime)s] [%(name)s] [%(levelname)s] %(message)s",
+            datefmt="%H:%M:%S"
+        )
+    )
+    logger.addHandler(console)
+    logger.propagate = False  # 避免向上传递重复输出
+
+
+def get_logger(name: str,
+               console_level: int = logging.INFO,
+               log_dir: str = None,
+               file_level: int = logging.DEBUG):
+    """
+    获取一个彩色 logger
+    :param name: logger 名，通常 __name__
+    :param console_level: 控制台级别
+    :param log_dir: 日志文件夹，None 则不写文件
+    :param file_level: 文件日志级别
+    :return: logging.Logger
+    """
+    logger = logging.getLogger(name)
+    _add_handlers(logger, console_level, log_dir, file_level)
+    return logger
+
+# 状态集合，包括NOT_FOUND，INSTALL_FAILED, INCOMPATIBLE, COMPATIBLE四种，
+class Status:
+    CREATE_ENV_FAILED = "CREATE_ENV_FAILED"
+    INSTALL_FAILED = "INSTALL_FAILED"
+    ENV_RESOLVE_FAILED = "ENV_RESOLVE_FAILED"
+    INCOMPATIBLE = "INCOMPATIBLE"
+    COMPATIBLE = "COMPATIBLE"
+    OTHER_ERROR = "OTHER_ERROR"
+
+"""日志文件生成工具"""
+class ResultLogger:
+    """结果记录器"""
+    
+    def __init__(self, output_dir: str = "../results"):
+        self.logger = get_logger("日志记录")
+        self.output_dir = output_dir
+        os.makedirs(output_dir, exist_ok=True)
+    
+    def save_results(self, package_name: str, results: Dict) -> str:
+        """保存测试结果到JSON文件"""
+        filename = f"{package_name}_{datetime.now().strftime('%Y%m%d_%H%M%S')}.json"
+        filepath = os.path.join(self.output_dir, filename)
+        
+        with open(filepath, 'w', encoding='utf-8') as f:
+            json.dump(results, f, indent=2, ensure_ascii=False)
+        
+        self.logger.info(f"结果已保存到: {filepath}")
+        return filepath
+
+    def generate_summary_report(self, all_results: List[Dict], exists_packages_num: int, total_packages_num: int) -> Dict:
+        """
+        生成汇总报告
+        
+        Arguments:
+        - all_results: 记录的结果列表，一个包可能有多个记录，比如
+        - exists_packages_num: 存在的包数量
+        - total_packages_num: 总包数量
+        Returns: - 汇总报告字典
+        """
+        summary = {
+            'total_packages': total_packages_num,
+            'not_found_packages': total_packages_num - exists_packages_num,
+            'total_exists_packages': exists_packages_num,
+            'successful_packages': 0,
+            'install_failed_packages': 0,
+            'create_env_failed_packages': 0,
+            'env_resolve_failed_packages': 0,
+            'verify_failed_packages': 0,
+            'install_rate((total_exists_packages - install_failed_packages)/total_exists_packages)': 0.0,
+            'compatibility_rate(successful_packages/total_exists_packages)': 0.0,
+            'install_rate_total((total_packages - install_failed_packages)/total_packages)': 0.0,
+            'compatibility_rate_total(successful_packages/total_packages)': 0.0,
+            'details': [],
+            'timestamp': datetime.now().isoformat()
+        }
+        
+        for result in all_results:
+            if result.get('status') == Status.COMPATIBLE:
+                summary['successful_packages'] += 1
+            elif result.get('status') == Status.INSTALL_FAILED:
+                summary['install_failed_packages'] += 1
+            elif result.get('status') == Status.CREATE_ENV_FAILED:
+                summary['create_env_failed_packages'] += 1
+            elif result.get('status') == Status.ENV_RESOLVE_FAILED:
+                summary['env_resolve_failed_packages'] += 1
+            elif result.get('status') == Status.INCOMPATIBLE:
+                summary['verify_failed_packages'] += 1
+            else:
+                self.logger.error(f"未知的结果状态: {result}")
+            
+            summary['details'].append({
+                'package_name': result['package_name'],
+                'status': result.get('status', 'UNKNOWN'),
+                'test_summary': result.get('summary', {})
+            })
+        
+        if total_packages_num > 0:
+            summary['install_rate_total'] = (total_packages_num - summary['install_failed_packages']) / total_packages_num
+            summary['compatibility_rate_total'] = summary['successful_packages'] / total_packages_num
+        if exists_packages_num > 0:
+            summary['install_rate'] = (exists_packages_num - summary['install_failed_packages']) / exists_packages_num
+            summary['compatibility_rate'] = summary['successful_packages'] / exists_packages_num
+        # 保存汇总报告
+        summary_file = os.path.join(self.output_dir, f"summary_{datetime.now().strftime('%Y%m%d_%H%M%S')}.json")
+        with open(summary_file, 'w', encoding='utf-8') as f:
+            json.dump(summary, f, indent=2, ensure_ascii=False)
+        
+        self.logger.info(f"汇总报告已保存到: {summary_file}")
+        return summary
+
+if __name__ == "__main__":
+    result_logger = ResultLogger()
+    result_logger.save_results("example_package", {"status": "COMPATIBLE"})
+    result_logger.generate_summary_report([
+        {'package_name': 'pkg1', 'status': Status.COMPATIBLE},
+        {'package_name': 'pkg2', 'status': Status.INSTALL_FAILED},
+        {'package_name': 'pkg3', 'status': Status.INCOMPATIBLE}
+    ], exists_packages_num=3, total_packages_num=5)
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/configuration.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/configuration.py"
new file mode 100644
index 0000000000000000000000000000000000000000..7ec530d7ef414a5f2458ecefdd0a0c6c857c75ff
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/configuration.py"	
@@ -0,0 +1,100 @@
+import json
+import os
+from typing import Any, Optional
+
+import dotenv
+
+
+class Configuration:
+    """Manages configuration and environment variables for the MCP client."""
+
+    def __init__(self) -> None:
+        """Initialize configuration with environment variables."""
+        self.load_env()
+        self._llm_api_key = os.getenv("LLM_API_KEY")
+        self._llm_base_url = os.getenv("LLM_BASE_URL")
+        self._llm_model_name = os.getenv("LLM_MODEL_NAME")
+
+        self._ollama_model_name = os.getenv("OLLAMA_MODEL_NAME")
+        self._ollama_base_url = os.getenv("OLLAMA_BASE_URL")
+
+    @staticmethod
+    def load_env() -> None:
+        """Load environment variables from .env file."""
+        dotenv.load_dotenv()
+
+    @staticmethod
+    def load_config(file_path: str) -> dict[str, Any]:
+        """Load server configuration from JSON file.
+
+        Args:
+            file_path: Path to the JSON configuration file.
+
+        Returns:
+            Dict containing server configuration.
+
+        Raises:
+            FileNotFoundError: If configuration file doesn't exist.
+            JSONDecodeError: If configuration file is invalid JSON.
+        """
+        with open(file_path, "r") as f:
+            return json.load(f)
+
+    @property
+    def llm_api_key(self) -> str:
+        """Get the LLM API key.
+
+        Returns:
+            The API key as a string.
+
+        Raises:
+            ValueError: If the API key is not found in environment variables.
+        """
+        if not self._llm_api_key:
+            raise ValueError("LLM_API_KEY not found in environment variables")
+        return self._llm_api_key
+
+    @property
+    def llm_base_url(self) -> Optional[str]:
+        """Get the LLM base URL.
+
+        Returns:
+            The base URL as a string.
+        """
+        return self._llm_base_url
+
+    @property
+    def llm_model_name(self) -> str:
+        """Get the LLM model name.
+
+        Returns:
+            The model name as a string.
+
+        Raises:
+            ValueError: If the model name is not found in environment variables.
+        """
+        if not self._llm_model_name:
+            raise ValueError("LLM_MODEL_NAME not found in environment variables")
+        return self._llm_model_name
+
+    @property
+    def ollama_model_name(self) -> str:
+        """Get the Ollama model name.
+
+        Returns:
+            The model name as a string.
+        """
+        if not self._ollama_model_name:
+            raise ValueError("OLLAMA_MODEL_NAME not found in environment variables")
+        return self._ollama_model_name
+
+    @property
+    def ollama_base_url(self) -> Optional[str]:
+        """Get the Ollama base URL.
+
+        Returns:
+            The base URL as a string.
+        """
+        if not self._ollama_base_url:
+            raise ValueError("OLLAMA_BASE_URL not found in environment variables")
+        return self._ollama_base_url
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_client.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_client.py"
new file mode 100644
index 0000000000000000000000000000000000000000..1dc54beb232d13a53177ed3a7e950c3c716df6aa
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_client.py"	
@@ -0,0 +1,144 @@
+import asyncio
+import os
+import sys
+import shutil
+from pathlib import Path
+from contextlib import AsyncExitStack
+from typing import Any, List
+
+from mcp import ClientSession, StdioServerParameters
+from mcp.client.stdio import stdio_client
+
+
+root_dir = Path(__file__).parent.parent
+sys.path.insert(0, str(root_dir))
+
+from logger.logger import get_logger
+from mcp_chat_bot.mcp_tool import MCPTool
+
+class MCPClient:
+    """MCPClient manages connections to MCP server."""
+
+    def __init__(self, name: str, conifg: dict[str, Any]) -> None:
+        self.name: str = name
+        self.config: dict[str, Any] = conifg
+        self.stdio_context: Any | None = None
+        self.session: ClientSession | None = None
+        self._cleanup_lock: asyncio.Lock = asyncio.Lock()
+        self.exit_stack: AsyncExitStack = AsyncExitStack()
+        self.logger = get_logger(f"MCPClient-{self.name}")
+
+    async def initialize(self) -> None:
+        """Initialize the server connection."""
+        command = (
+            shutil.which("npx")
+            if self.config["command"] == "npx"
+            else self.config["command"]
+        )
+        server_params = StdioServerParameters(
+            command=command,
+            args=self.config["args"],
+            env={**os.environ, **self.config["env"]}
+            if self.config.get("env")
+            else None,
+        )
+        try:
+            stdio_transport = await self.exit_stack.enter_async_context(
+                stdio_client(server_params)
+            )
+            read, write = stdio_transport
+            session = await self.exit_stack.enter_async_context(
+                ClientSession(read, write)
+            )
+            await session.initialize()
+            self.session = session
+        except Exception as e:
+            self.logger.error(f"Error initializing server {self.name}: {e}")
+            await self.cleanup()
+            raise
+
+    async def list_tools(self) -> List[MCPTool]:
+        """List available tools from the server.
+
+        Returns:
+            A list of available tools.
+
+        Raises:
+            RuntimeError: If the server is not initialized.
+        """
+        if not self.session:
+            raise RuntimeError(f"Server {self.name} not initialized")
+
+        tools_response = await self.session.list_tools()
+        tools = []
+
+        for item in tools_response:
+            if isinstance(item, tuple) and item[0] == "tools":
+                for tool in item[1]:
+                    tools.append(MCPTool(tool.name, tool.description, tool.inputSchema))
+
+        return tools
+
+    async def execute_tool(
+        self,
+        tool_name: str,
+        arguments: dict[str, Any],
+        retries: int = 2,
+        delay: float = 1.0,
+    ) -> Any:
+        """Execute a tool with retry mechanism.
+
+        Args:
+            tool_name: Name of the tool to execute.
+            arguments: Tool arguments.
+            retries: Number of retry attempts.
+            delay: Delay between retries in seconds.
+
+        Returns:
+            Tool execution result.
+
+        Raises:
+            RuntimeError: If server is not initialized.
+            Exception: If tool execution fails after all retries.
+        """
+        if not self.session:
+            raise RuntimeError(f"Server {self.name} not initialized")
+
+        attempt = 0
+        while attempt < retries:
+            try:
+                self.logger.info(f"Executing {tool_name}...")
+                result = await self.session.call_tool(tool_name, arguments)
+
+                return result
+
+            except Exception as e:
+                attempt += 1
+                self.logger.warning(
+                    f"Error executing tool: {e}. Attempt {attempt} of {retries}."
+                )
+                if attempt < retries:
+                    self.logger.info(f"Retrying in {delay} seconds...")
+                    await asyncio.sleep(delay)
+                else:
+                    self.logger.error("Max retries reached. Failing.")
+                    raise
+
+    async def cleanup(self) -> None:
+        """Clean up server resources."""
+        async with self._cleanup_lock:
+            try:
+                await self.exit_stack.aclose()
+                self.session = None
+                self.stdio_context = None
+            except Exception as e:
+                self.logger.error(f"Error during cleanup of server {self.name}: {e}")
+
+    async def __aenter__(self):
+        """Enter the async context manager."""
+        await self.initialize()
+        return self
+
+    async def __aexit__(self, exc_type, exc_val, exc_tb):
+        """Exit the async context manager."""
+        await self.cleanup()
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/dependency_analyst.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/dependency_analyst.py"
new file mode 100644
index 0000000000000000000000000000000000000000..9a583087ffe7031f3182c55d59f4cc7efc71ea65
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/dependency_analyst.py"	
@@ -0,0 +1,43 @@
+import sys
+from pathlib import Path
+from typing import List
+
+from mcp.server.fastmcp import FastMCP
+
+root_dir = Path(__file__).parent.parent.parent
+sys.path.insert(0, str(root_dir))
+
+from info_crawler.analyse_dependency import find_dependency_for_pip_package, detect_gpu_requirement
+
+mcp = FastMCP("Dependency Analyse Tool")
+
+@mcp.tool()
+def find_dependency_for_pip_package_mcp(package: str, version: str = None, include_extras: bool = False) -> List[str]:
+    """Analyze the dependency of a given PyPI package.
+    
+    Args:
+        package: The name of the PyPI package.
+        version: The version of the package (optional).
+        include_extras: Whether to include extra dependencies (default: False).
+        
+    Returns:
+        A list of dependencies for the specified package and version.
+    """
+    return find_dependency_for_pip_package(package, version, include_extras)
+
+@mcp.tool()
+def detect_gpu_requirement_for_pip_package_mcp(package: str, version: str = None) -> bool:
+    """Detect the GPU requirement of a given PyPI package.
+    
+    Args:
+        package: The name of the PyPI package.
+        version: The version of the package (optional).
+        
+    Returns:
+        A boolean indicating whether the package requires a GPU(Even the GPU is optional, the result still be true).
+    """
+    return detect_gpu_requirement(package, version)['gpu']
+
+if __name__ == "__main__":
+    # Initialize and run the server
+    mcp.run()
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/github_analyst.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/github_analyst.py"
new file mode 100644
index 0000000000000000000000000000000000000000..8cd15e4bf0db9e7eef76ad41407bb97314d6e32a
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/github_analyst.py"	
@@ -0,0 +1,49 @@
+import sys
+from pathlib import Path
+from typing import List
+
+from mcp.server.fastmcp import FastMCP
+
+root_dir = Path(__file__).parent.parent.parent
+sys.path.insert(0, str(root_dir))
+
+from info_crawler.github_tools import list_tree, fetch_file
+
+# Create a Simple MCP Server
+mcp = FastMCP("Github Analyse Tool")
+
+@mcp.tool()
+def find_files_mcp(repo_url: str, pattern: str, branch: str = 'HEAD') -> List[str]:
+    """Find files matching a pattern in a GitHub repository.
+
+    Args:
+        repo_url: The URL of the GitHub repository.
+        pattern: The pattern to match file paths against (e.g., 'README.md' for introduction files).
+        branch: The branch or commit SHA, default value is HEAD (Optional).
+
+    Returns:
+        A list of file paths that match the given pattern.
+    """
+    tree = list_tree(repo_url, branch)
+    import fnmatch
+
+    matched_files = [f for f in tree if fnmatch.fnmatch(f, pattern)]
+    return matched_files
+
+@mcp.tool()
+def fetch_file_mcp(repo_url: str, path: str) -> str:
+    """Fetch file content from a GitHub repository.
+
+    Args:
+        repo_url: The URL of the GitHub repository.
+        path: The path to the file in the repository.
+    
+    Returns:
+        The content of the file as a string.
+    """
+    content = fetch_file(repo_url, path)
+    return content
+
+if __name__ == "__main__":
+    # Initialize and run the server
+    mcp.run()
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/mcp_servers_config.json" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/mcp_servers_config.json"
new file mode 100644
index 0000000000000000000000000000000000000000..2a46857dc3085219d55048a938f5874a7c517615
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/mcp_servers_config.json"	
@@ -0,0 +1,32 @@
+{
+  "mcpServers": {
+    "github_analyst": {
+      "command": "/root/contributor_rhino-bird/2025实战任务_作品文件夹/OpenCloudOS 9 AI软件自动化验证工具/黄振业_作品/oc_contributor_huangzhenye/code/.main_venv/bin/python",
+      "args": [
+        "-u",
+        "/root/contributor_rhino-bird/2025实战任务_作品文件夹/OpenCloudOS 9 AI软件自动化验证工具/黄振业_作品/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/github_analyst.py"
+      ]
+    },
+    "pypi_analyst": {
+      "command": "/root/contributor_rhino-bird/2025实战任务_作品文件夹/OpenCloudOS 9 AI软件自动化验证工具/黄振业_作品/oc_contributor_huangzhenye/code/.main_venv/bin/python",
+      "args": [
+        "-u",
+        "/root/contributor_rhino-bird/2025实战任务_作品文件夹/OpenCloudOS 9 AI软件自动化验证工具/黄振业_作品/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/pypi_analyst.py"
+      ]
+    },
+    "dependency_analyst": {
+      "command": "/root/contributor_rhino-bird/2025实战任务_作品文件夹/OpenCloudOS 9 AI软件自动化验证工具/黄振业_作品/oc_contributor_huangzhenye/code/.main_venv/bin/python",
+      "args": [
+        "-u",
+        "/root/contributor_rhino-bird/2025实战任务_作品文件夹/OpenCloudOS 9 AI软件自动化验证工具/黄振业_作品/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/dependency_analyst.py"
+      ]
+    },
+    "test_executor": {
+      "command": "/root/contributor_rhino-bird/2025实战任务_作品文件夹/OpenCloudOS 9 AI软件自动化验证工具/黄振业_作品/oc_contributor_huangzhenye/code/.main_venv/bin/python",
+      "args": [
+        "-u",
+        "/root/contributor_rhino-bird/2025实战任务_作品文件夹/OpenCloudOS 9 AI软件自动化验证工具/黄振业_作品/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/test_executor.py"
+      ]
+    }
+  }
+}
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/pypi_analyst.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/pypi_analyst.py"
new file mode 100644
index 0000000000000000000000000000000000000000..9dc6d4325e6c877f0d99248c1e721b4005f8fe54
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/pypi_analyst.py"	
@@ -0,0 +1,79 @@
+import sys
+from pathlib import Path
+from typing import Dict, List, Any
+
+from mcp.server.fastmcp import FastMCP
+
+root_dir = Path(__file__).parent.parent.parent
+sys.path.insert(0, str(root_dir))
+
+
+from info_crawler.analyse_pypi import check_pypi, check_pypi_list, detect_candidate_package_names
+from info_crawler.github_tools import list_tree
+
+mcp = FastMCP("PyPI Analysis Tool")
+
+@mcp.tool()
+def check_pypi_mcp(candidate_package_name: str) -> Dict[str, Any]:
+    """Check if a candidate package name exists on PyPI.
+
+    Args:
+        candidate_package_name: The candidate package name to check.
+
+    Returns:
+        A dictionary containing:
+            - 'exists': True if the package exists on PyPI, False otherwise.
+            - 'info': The package information if it exists, Empty otherwise.
+    """
+    exists, info = check_pypi(candidate_package_name)
+    if not info:
+        analysis = {
+            "exists": True,
+            "name": info.get("name"),
+            "version": info.get("version"), 
+            "summary": info.get("summary"),
+            "author": info.get("author"),
+            "license": info.get("license"),
+            "home_page": info.get("home_page"),
+            "requires_python": info.get("requires_python"),
+            "dependencies": info.get("requires_dist", []),
+            "keywords": info.get("keywords"),
+            "classifiers": info.get("classifiers", []),
+            "project_urls": info.get("project_urls", {}),
+            "upload_time": info.get("upload_time"),
+            "yanked": info.get("yanked", False),
+        }
+    return {"exists": exists, "info": analysis if exists else {}}
+
+@mcp.tool()
+def check_pypi_list_mcp(candidate_package_names: List[str]) -> Dict[str,bool]:
+    """Check if candidate package names exist on PyPI.
+
+    Args:
+        candidate_package_names: List of candidate package names to check.
+
+    Returns:
+        A dictionary mapping candidate package names to their existence status (True means exists, False means no exists) on PyPI.
+    """
+    return check_pypi_list(candidate_package_names)
+
+
+@mcp.tool()
+def try_detect_candidate_package_names_mcp(repo_url: str, file_list: List[str] = None) -> List[str]:
+    """Try to detect likely PyPI package names from GitHub repository files. Only support the repo's url like 'https://github.com/owner/repo'.
+
+    Args:
+        repo_url: The URL of the GitHub repository.
+        file_list: List of file paths in the repository (Optional).
+
+    Returns:
+        A list of detected candidate package names.
+    """
+    repo_name = repo_url.split('/')[-1]  # Extract repo name from URL
+    if file_list is None:
+        file_list = list_tree(repo_url)
+    return detect_candidate_package_names(repo_url, repo_name, file_list)
+
+if __name__ == "__main__":
+    # Initialize and run the server
+    mcp.run()
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/test_executor.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/test_executor.py"
new file mode 100644
index 0000000000000000000000000000000000000000..382c9eadf04d24b28ae2fdeed1ddec799c578f5c
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_servers/test_executor.py"	
@@ -0,0 +1,173 @@
+import asyncio
+import sys
+import json
+import time
+from pathlib import Path
+from typing import Any, Dict, List, Optional, Union
+
+from mcp.server.fastmcp import FastMCP
+
+root_dir = Path(__file__).parent.parent.parent
+sys.path.insert(0, str(root_dir))
+
+from package_manager.package_tester import TestExecutionEngine
+from package_manager.environment_resolver import EnvironmentResolver
+from package_manager.package_installer import PackageInstaller
+from utils.create_venv import create_venv
+
+CONFIG_PATH = f'{root_dir}/config.json'
+with open(CONFIG_PATH, 'r') as f:
+    config = json.load(f)
+    VENVS_DIR = config['venvs_path']
+
+mcp = FastMCP("Test Execution Error Resolution Tool")
+
+@mcp.tool()
+async def execute_test_case_mcp(package_name: str, venv_name: str, test_type: str, test_case: str, expected_result: str) -> Dict:
+    """Execute a test case in a specified virtual environment and compare the normalized output with the normalized expected result.
+
+    Args:
+        package_name: The name of the package being tested.
+        venv_name: The name of the virtual environment where the test case will be executed.
+        test_type: The type of the test including "import test", "functional test" and "gpu test".
+        test_case: The test case code.
+        expected_result: The expected test-case result, as determined by the regex.
+
+    Returns:
+        A dictionary like that:
+        {
+            "test_type": "<the type of the test>",
+            "test_case": "<the test case code>",
+            "status": "PASS" or "FAIL",
+            "actual_output": "<standard output from execution>",
+            "expected_output": "<expected output>",
+            "stderr": "<standard error from execution>",
+            "return_code": <the return code from execution>,
+            "execution_time": "<time taken to execute the test case>"
+        }
+    """
+    venv_dir = f'{VENVS_DIR}/{venv_name}'
+    result = await TestExecutionEngine.execute_single_test(package_name, venv_dir, test_type, test_case, expected_result)
+    result.pop("modified", None)
+    return result
+
+@mcp.tool()
+async def detect_import_name_mcp(venv_name: str, package_name: str) -> List[str]:
+    """Detect possible import names for a given package name. Since the import name may differ from the package name, this function helps identify potential import names.
+
+    Args:
+        venv_name: The name of the virtual environment where the package is installed.
+        package_name: The name of the package to detect import names for.
+
+    Returns:
+        A list of possible import names. It may contains some internal or irrelevant names, so further filtering may be needed.
+    """
+    env_resolver = EnvironmentResolver()
+    venv_dir = f'{VENVS_DIR}/{venv_name}'
+    return list(await env_resolver.detect_import_names(venv_dir, package_name))
+
+@mcp.tool()
+async def detect_system_package_manager_mcp() -> str:
+    """Detect the system's package manager (e.g., apt, dnf, yum).
+
+    Returns:
+        The name of the detected package manager, or an empty string if none is found.
+    """
+    env_resolver = EnvironmentResolver()
+    pm = await env_resolver.detect_system_package_manager()
+    return pm if pm else ""
+
+@mcp.tool()
+async def install_system_package_mcp(package_name: str) -> Dict:
+    """Install a system package using the detected system package manager.
+
+    Args:
+        package_name: The name of the package to install.
+
+    Returns:
+        A dictionary containing the result of the installation attempt.
+    """
+    env_resolver = EnvironmentResolver()
+    await env_resolver.detect_system_package_manager()
+    return await env_resolver.install_system_package(package_name)
+
+@mcp.tool()
+async def install_pypi_package_mcp(venv_name: str, package_name: str, options: Any) -> Dict[str, Any]:
+    """Install a PyPI package into the specified virtual environment.
+
+    Args:
+        venv_name: The name of the virtual environment where the package will be installed.
+        package_name: The name of the PyPI package to install.
+        options: A dictionary of installation options (Default is None), including:
+            - upgrade: Upgrade the package if already installed
+            - force_reinstall: Force reinstallation of the package
+            - no_deps: Do not install dependencies
+            - user: Install to the user site
+            - target: Install to the specified directory
+            - constraint: Constraint file
+            - requirement: Requirement file
+            - editable: Install in editable mode
+            - index_url: Index URL
+            - extra_index_url: Extra index URL
+            - trusted_host: Trusted host
+            - find_links: Find links
+            - no_index: Ignore index
+            - no_cache_dir: Do not use cache
+            - cache_dir: Cache directory
+            - timeout: Timeout for connections
+            - retries: Number of retries
+            - abi: ABI tag
+            - pre: Include pre-release and development versions
+            - no_binary: Do not use binary packages
+            - only_binary: Only use binary packages
+            - compile: Compile Python source files to bytecode
+            - no_compile: Do not compile Python source files to bytecode
+
+    Returns:
+        A dictionary containing:
+            - success: A boolean indicating whether the package was installed successfully.
+            - message: A string with the error message if installation failed, or an empty string if successful.
+    """
+    venv_dir = Path(f'{VENVS_DIR}/{venv_name}')
+    success, message, _ = await PackageInstaller.install_package(venv_path=venv_dir, package_name=package_name, options=options)
+    return {
+        "success": success,
+        "message": message
+    }
+
+@mcp.tool()
+async def uninstall_pypi_package_mcp(venv_name: str, package_name: str) -> Dict[bool, str]:
+    """Uninstall a PyPI package from the specified virtual environment. Important: You can only uninstall packages that were previously installed using the install_pypi_package_mcp function and the package you are testing.
+
+    Args:
+        venv_name: The name of the virtual environment where the package will be uninstalled.
+        package_name: The name of the PyPI package to uninstall.
+
+    Returns:
+        A dictionary containing:
+            - success: A boolean indicating whether the package was uninstalled successfully.
+            - message: A string with the error message if uninstallation failed, or an empty string if successful.
+    """
+    venv_dir = Path(f'{VENVS_DIR}/{venv_name}')
+    return await PackageInstaller.uninstall_package(venv_dir, package_name)
+
+@mcp.tool()
+async def create_virtual_env_mcp() -> Dict[str, Union[bool, str]]:
+    """Create a new virtual environment at the specified directory.
+
+    Returns:
+        A dictionary containing:
+            - success: A boolean indicating whether the virtual environment was created successfully.
+            - venv_name: The name of the created virtual environment, or an empty string if creation failed.
+    """
+    venv_name = f'.{str(int(time.time()))}_venv'
+    success, _ = await create_venv(VENVS_DIR, venv_name)
+    result = {
+        "success": success,
+        "venv_name": venv_name if success else ""
+    }
+    return result
+
+if __name__ == "__main__":
+    # Initialize and run the server
+    mcp.run()
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_task_executor.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_task_executor.py"
new file mode 100644
index 0000000000000000000000000000000000000000..dcbf3462b52d3f48d109242219709fda4cc716cd
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_task_executor.py"	
@@ -0,0 +1,919 @@
+import argparse
+import asyncio
+import json
+from pathlib import Path
+import sys
+from typing import Any, Dict, List
+
+root_dir = Path(__file__).parent.parent
+sys.path.insert(0, str(root_dir))
+
+from mcp_chat_bot.session import ChatSession
+from mcp_chat_bot.mcp_client import MCPClient 
+from mcp_chat_bot.configuration import Configuration
+
+from logger.logger import get_logger
+from utils.validte_llm_result import validate_package_info_payload, validate_test_result_payload, validate_installation_result_payload
+from data_manager.package_info import Package, PackageInfo
+from data_manager.package_converter import PackageConverter
+from data_manager.package_repository import PackageRepository
+
+GITHUB_INFO_GENERATION_PROMPT = """
+You are a Python expert who is responsible for using MCP tools to determine whether the given GitHub repository has a corresponding package in PyPI, and conduct an in-depth analysis if it exists.
+Follow the 5-step process below and only output one legitimate JSON object (no Markdown code blocks or extra instructions).
+Step 1: Infer the PyPI candidate package names, you could combine these methods to get the candidate package names:
+- Use your own knowledge;
+- Use the `try_detect_candidate_package_names_mcp` tool;
+- Analyze the README.md etc, or other files in the repository.
+Step 2: Use the `check_pypi_list_mcp` tool to check whether the candidate package names exist in PyPI, and get the existence status (True means exists, False means no exists).
+Step 3: If any candidate package name exists in PyPI, try to find the dependency with version range restrictions(if could not find the version range restrictions, set `Any` as restriction), you could combine those methods:
+- Use your own knowledge; 
+- Use the `find_dependency_for_pip_package_mcp`;
+- Analyze the README.md etc, or other files in the repository to get the dependency.
+Step 4: If any candidate package name exists in PyPI, try to find the GPU requirement, you could combine those methods:
+- Use your own knowledge; 
+- Use the `detect_gpu_requirement_for_pip_package_mcp` tool;
+- Analyze the README.md etc, or other files in the repository to get the GPU requirement.
+Step 5: Generate validation codes and expected results, including:
+- Import validation code: import the package(Note that the import name could be different from the package name, e.g., package `opencv-python` is imported with `import cv2`);
+- Import validation code's expected result: the expected output written by regex of the import validation code;
+- Functional validation code: import the package and execute a minimum demo to reflect the package's core functionality;
+- Functional validation code's expected result: the expected output written by regex of the functional validation code;
+- If the GPU is required for the package, generate an additional piece of GPU validation code;
+- If the GPU is required for the package, generate the expected output written by regex of the GPU validation code.
+Final output format
+A. Package does not exist:
+{
+    "package_name": "< the repo of the given github url like 'https://github.com/owner/repo' >",
+    "info": "No package found in PyPI",
+    "exists": false
+}
+
+B. Package Exists:
+{
+    "package_name": "< the name of the package in PyPI >",
+    "info": {
+        "dependency": [],
+        "import_test_code": "import package_name",
+        "import_test_expected_result": "",
+        "function_test_code": "< a one-line escaped Python code >",
+        "function_test_expected_result": "< regex of the expected output of the function_test_code >",
+        "gpu_test_code": "< a one-line escaped Python code or empty string if no GPU required >",
+        "gpu_test_expected_result": "< regex of the expected output of the gpu_test_code or empty string if no GPU required >",
+        "verified": "False"
+    },
+    "exists": true
+}
+
+Example Output:
+A. Package does not exist:
+{
+    "package_name": "MLAlgorithms",
+    "info": "No package found in PyPI",
+    "exists": false
+}
+
+B. Package Exists:
+{
+    "package_name": "tensorflow",
+    "info": {
+        "dependency": ["numpy>=1.6.0", "six==2.0.0", "protobuf Any"],
+        "import_test_code": "import tensorflow as tf",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import tensorflow as tf; result = tf.constant([1, 2, 3]).numpy().sum(); print(result)",
+        "function_test_expected_result": "^6$",
+        "gpu_test_code": "import sys; import tensorflow as tf; result = tf.test.is_built_with_cuda(); print(result)",
+        "gpu_test_expected_result": "^(True|False)$",
+        "verified": "False"
+    },
+    "exists": true
+}
+
+Important Notes:
+1. All strings must be JSON escaped (\n → \\n, "→ \").
+2. All list fields cannot be empty; If there are no dependencies, return [].
+3. Output only JSON, don't wrap the json, don't explain.
+"""
+
+PYPI_INFO_GENERATION_PROMPT = """
+You are a Python expert who is responsible for using MCP tools to analyse the given PyPI package.
+Follow the 4-step process below and only output one legitimate JSON object (no Markdown code blocks or extra instructions).
+Step 1: Use the `check_pypi_mcp` tool to check whether the package name exists in PyPI, and get the existence status (True means exists, False means no exists).
+Step 2: If the package exists, find the dependency of the package with version range restrictions(if could not find the version range restrictions, set Any as restriction), you could combine these methods to get the dependency:
+- Use your own knowledge;
+- Use the `find_dependency_for_pip_package_mcp` tool;
+- Analyze the PyPI repository and others to get the dependency.
+Step 3: If the package exists, find the GPU requirement of the package, you could combine these methods:
+- Use your own knowledge; 
+- Use the `find_gpu_requirement_for_pip_package_mcp` tool;
+- Analyze the PyPI repository and others to get the GPU requirement.
+Step 4: If the package exists, search the github repository of the package(if exists) and read its README.md to generate validation codes and expected results, including:
+- Import validation code: import the package(Note that the import name could be different from the package name, e.g., package `opencv-python` is imported with `import cv2`);
+- Import validation code's expected result: the expected output written by regex of the import validation code;
+- Functional validation code: import the package and execute a minimum demo to reflect the package's core functionality, not just imports or version checks;
+- Functional validation code's expected result: the expected output written by regex of the functional validation code;
+- If the GPU is required for the package, generate an additional piece of GPU validation code, not just imports or version checks.;
+- If the GPU is required for the package, generate the expected output written by regex of the GPU validation code.
+Final output format
+A. Package does not exist:
+{
+    "package_name": "< the given package name >",
+    "info": "No package found in PyPI",
+    "exists": false
+}
+
+B. Package Exists:
+{
+    "package_name": "< the name of the package in PyPI >",
+    "info": {
+        "dependency": [],
+        "import_test_code": "< import the package >",
+        "import_test_expected_result": "< regex of the expected output of the import_test_code >",
+        "function_test_code": "< a one-line escaped Python code >",
+        "function_test_expected_result": "< regex of the expected output of the function_test_code >",
+        "gpu_test_code": "< a one-line escaped Python code or empty string if no GPU required >",
+        "gpu_test_expected_result": "< regex of the expected output of the gpu_test_code or empty string if no GPU required >",
+        "verified": "False"
+    },
+    "exists": true
+}
+
+Example Output:
+A. Package does not exist:
+{
+    "package_name": "light-rag",
+    "info": "No package found in PyPI",
+    "exists": false
+}
+
+B. Package Exists:
+{
+    "package_name": "tensorflow",
+    "info": {
+        "dependency": ["numpy>=1.6.0", "six==2.0.0", "protobuf Any"],
+        "import_test_code": "import tensorflow as tf",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import tensorflow as tf; result = tf.constant([1, 2, 3]).numpy().sum(); print(result)",
+        "function_test_expected_result": "^6$",
+        "gpu_test_code": "import sys; import tensorflow as tf; result = tf.test.is_built_with_cuda(); print(result)",
+        "gpu_test_expected_result": "^(True|False)$",
+        "verified": "False"
+    },
+    "exists": true
+}
+
+Note:
+1. All strings must be JSON escaped (\n → \\n, "→ \").
+2. All list fields cannot be empty; If there are no dependencies, return [].
+3. Output only JSON, don't wrap the json, don't explain.
+"""
+
+INSTALLATION_PROMPT = """
+You are a Python expert who is responsible for using MCP tools to analyse error result from installing a required package and to attempt to resolve the issues.
+Follow the 4-step process below and only output one legitimate JSON object (no Markdown code blocks or extra instructions).
+Step 1: Analyse the installation error.
+Step 2: Using your own knowledge and the mcp tools to resolve the errors listed below. If the error is not listed below, just end it.
+- If the error is related to a missing system package, you can use the `detect_system_package_manager_mcp` tool to detect the system package manager, then use the `install_system_package_mcp` tool to install the missing system package, and then re-attempt the installation of the required package.
+- If the error is related to a missing PyPI package, you can use the `install_pypi_package_mcp` tool to install the missing PyPI package, and then re-attempt the installation of the required package.
+- If the error is related to the wrong PyPI package environment configuration, you can use the `create_virtual_env_mcp` tool to create a new virtual environment, and then re-attempt the installation of the required package. 
+Step 3: Repeat Step 1 and Step 2 until the code is executed successfully or you have tried 10 times.
+Step 4: Return the final conclusion in JSON.
+
+Final output format:
+{
+    "status": "success" | "failed",
+    "retry_count": int,
+    "final_venv_name": str,
+    "actions_taken": [
+        {"step": 1, "detail": "Description of the error analysis and how to fix it"},
+        {"step": 2, "detail": "Description of the error analysis and how to fix it"},
+        ...
+        {"step": 1, "detail": "Description of the error analysis and how to fix it"},
+        ...
+    ],
+    "conclusion": "A summary of the resolution process and final status."
+}
+
+Example Output:
+{
+    "status": "success",
+    "retry_count": 2,
+    "final_venv_name": ".common_venv",
+    "actions_taken": [
+        {"step": 1, "detail": "Found error is ModuleNotFoundError for 'mmcv'"},
+        {"step": 2, "detail": "Used `install_pypi_package_mcp` tool to install mmcv==2.2.0"},
+        {"step": 1, "detail": "Second installation succeeded"}
+    ],
+    "conclusion": "The package was installed successfully after installing the missing 'mmcv' package."
+}
+"""
+
+TEST_EXECUTION_PROMPT = """
+You are a Python expert who is responsible for using MCP tools to analyse the error results from verifying the PyPI package and to attempt to resolve the issues.
+There are three types of tests including "import test", "functional test" and "gpu test". For "import test", the test case is usually a simple import statement like "import package_name". For "functional test", the test case is usually a small code snippet that demonstrates the basic functionality of the package(Functional test MUST actually call at least one public API of the package). For "gpu test", the test case is usually a small code snippet that checks if the package can utilize GPU resources.
+Follow the 4-step process below and only output one legitimate JSON object (no Markdown code blocks or extra instructions).
+Step 1: Analyse the execution error.
+Step 2: Using your own knowledge and the mcp tools to resolve the errors listed below. If the error is not listed below, just end it.
+- If the error is related to syntax errors or logical errors, you can directly modify the test case code and the expected result, and then re-execute the python code.
+- If the error is "ModuleNotFoundError: No module named 'xxx'", since the PyPI package can be sure to be installed, and the import name may be different from the package name, you can use the `detect_import_name_mcp` tool to map the import name to the PyPI package name, then modify the python code, and then re-execute the python code.
+- If the error is related to a missing system package, you can use the `detect_system_package_manager_mcp` tool to detect the system package manager, then use the `install_system_package_mcp` tool to install the missing system package, and then re-execute the python code.
+- If the error is related to the wrong PyPI package environment configuration, you can use the `create_virtual_env_mcp` tool to create a new virtual environment, then use the `install_pypi_package_mcp` tool to install the PyPI package, and then re-execute the python code.
+Step 3: Repeat Step 1 and Step 2 until the code is executed successfully or you have tried 10 times.
+Step 4: Return the final conclusion in JSON.
+
+Final output format:
+{
+  "status": "success" | "failed",
+  "retry_count": int,
+  "final_test_case": str,
+  "final_expected_result": str,
+  "final_exit_code": int,
+  "final_stderr": str,
+  "final_stdout": str,
+  "actions_taken": [
+    {"step": 1, "detail": "Description of the error analysis and how to fix it"},
+    {"step": 2, "detail": "Description of the error analysis and how to fix it"},
+    ...
+    {"step": 1, "detail": "Description of the error analysis and how to fix it"}
+    ...
+  ],
+  "conclusion": "A summary of the resolution process and final status."
+}
+
+Example Output:
+{
+  "status": "success",
+  "retry_count": 2,
+  "final_test_case": "import mmcv\nprint(mmcv.__version__)",
+  "final_expected_result": "^2\\.2\\.0$",
+  "final_exit_code": 0,
+  "final_stderr": "",
+  "final_stdout": "42",
+  "actions_taken": [
+    {"step": 1, "detail": "Found error is ModuleNotFoundError for 'mmcv'"},
+    {"step": 2, "detail": "Used `install_pypi_package_mcp` tool to install mmcv==2.2.0"},
+    {"step": 1, "detail": "Second execution succeeded"}
+  ],
+  "conclusion": "The code executed successfully after installing the missing 'mmcv' package."
+}
+
+Note:
+1. All strings must be JSON escaped (\n → \\n, "→ \").
+2. All list fields cannot be empty; If there are no dependencies, return [].
+3. Output only JSON, don't wrap the json, don't explain.
+"""
+
+logger = get_logger("MCPChatBot")
+
+class MCPTask:
+    GITHUB_INFO_GEN = "github_info_generation"
+    PYPI_INFO_GEN = "pypi_info_generation"
+    INSTALLATION = "installation"
+    TEST_EXECUTION = "test_execution"
+
+class MCPChatBot():
+    def __init__(self, server_config) -> None:
+        servers = self._parse_servers_config(server_config)
+        self.chat_session = ChatSession(servers)
+
+    async def initialize(self) -> bool:
+        init_success = await self.chat_session.initialize()
+        if not init_success:
+            logger.error("Failed to initialize chat session")
+            return False
+        return True
+
+    async def send_message(self, task: str, url: str = "", pkg_name: str = "", venv_name: str = "", result: dict = {}, show_workflow: bool = True) -> Dict[str, Any]:
+        combined_prompt = ""
+        if task == MCPTask.GITHUB_INFO_GEN and url != "":
+            # combine prompt + repo url
+            combined_prompt = GITHUB_INFO_GENERATION_PROMPT + "\nRepository URL: " + url
+        elif task == MCPTask.PYPI_INFO_GEN and pkg_name != "":
+            combined_prompt = PYPI_INFO_GENERATION_PROMPT + "\nPackage Name: " + pkg_name
+        elif task == MCPTask.INSTALLATION and pkg_name != "" and venv_name != "" and result != {}:
+            combined_prompt = INSTALLATION_PROMPT + f"\nPyPI Package Name: {pkg_name}" + f"\nVirtual Environment Name: {venv_name}\nInstallation Results: {json.dumps(result)}"
+        elif task == MCPTask.TEST_EXECUTION and pkg_name != "" and venv_name != "" and result != {}:
+            combined_prompt = TEST_EXECUTION_PROMPT + f"\nPyPI Package Name: {pkg_name}" + f"\nVirtual Environment Name: {venv_name}\nTest Results: {json.dumps(result)}"
+        else:
+            logger.error(f"Unknown task: {task}. If the task exists, please provide the necessary parameters.")
+            return {"error": f"Unknown task: {task}"}
+
+        try:
+            response = await self.chat_session.send_message(
+                combined_prompt,
+                show_workflow=show_workflow,
+            )
+            return response
+        except Exception as e:
+            logger.error(f"Error during chat session: {e}")
+            return {"error": str(e)}
+        finally:
+            await self.cleanup()
+
+    async def cleanup(self) -> None:
+        await self.chat_session.cleanup_clients()
+
+    def _parse_servers_config(self, config: Dict[str, Any]) -> List[MCPClient]:
+        return [
+            MCPClient(name, srv_config) for name, srv_config in config["mcpServers"].items()
+        ]
+
+# 确保输出是预期的JSON数据，处理被 ```json 或 ``` 包裹的情况
+def _unwrap_code_block(s: str) -> str:
+    if not isinstance(s, str):
+        return s
+    
+    s = s.strip()
+    
+    # 如果整个字符串被```包裹
+    if s.startswith("```") and s.endswith("```"):
+        inner = s[3:-3].strip()
+        # 如果以语言标识开头（例如 json），去掉它
+        if inner.lower().startswith("json"):
+            inner = inner[4:].lstrip("\r\n")
+        return inner.strip()
+    
+    # 尝试提取其中的JSON块
+    import re
+    # 查找被```包裹的JSON内容
+    json_block_match = re.search(r'```(?:json)?\s*(\{.*?\})\s*```', s, re.DOTALL)
+    if json_block_match:
+        return json_block_match.group(1).strip()
+    
+    # 如果没有代码块标记，直接返回
+    return s
+
+async def assign_task_to_llm_mcp(task: MCPTask, url: str = "", pkg_name: str = "", venv_name: str = "", result: dict = {}) -> Dict[str, Any]:
+    """Initialize and run the chat session."""
+    server_config_path = f"{root_dir}/mcp_chat_bot/mcp_servers/mcp_servers_config.json"
+    config = Configuration()
+    server_config = config.load_config(server_config_path)
+    bot = MCPChatBot(server_config)
+    init_success = await bot.initialize()
+    if not init_success:
+        return {"error": "Failed to initialize chat session"}
+
+    try:
+        response = await bot.send_message(
+            task,
+            url,
+            pkg_name,
+            venv_name,
+            result,
+            show_workflow=True,
+        )
+    finally:
+        await bot.cleanup()
+    cleaned = _unwrap_code_block(response)
+    
+    # 尝试解析JSON
+    result_json = None
+    
+    # 第一次尝试：直接解析清理后的响应
+    try:
+        result_json = json.loads(cleaned)
+    except json.JSONDecodeError as e:
+        logger.info(f"First JSON parse attempt failed: {e}")
+        
+        # 第二次尝试：使用正则表达式提取JSON
+        import re
+        # 更宽松的正则，匹配 {} 或 [] 结构
+        patterns = [
+            r"```(?:json)?\s*(\{.*?\})\s*```",  # 匹配 ```json { } ``` 或 ``` { } ```
+            r"(\{[^}]*\"exists\"[^}]*\})",      # 匹配包含 "exists" 的 JSON 对象
+            r"(\{.*\})",                        # 匹配任何 JSON 对象
+        ]
+        
+        for pattern in patterns:
+            m = re.search(pattern, response, re.DOTALL)
+            if m:
+                try:
+                    result_json = json.loads(m.group(1))
+                    logger.info(f"Successfully extracted JSON using pattern: {pattern}")
+                    break
+                except json.JSONDecodeError:
+                    continue
+        
+        if result_json is None:
+            logger.error(f"Invalid JSON response, could not parse: {response}")
+            raise ValueError(f"Could not parse JSON from response: {response}")
+    if task == MCPTask.TEST_EXECUTION:
+        try:
+            validated = validate_test_result_payload(result_json)
+        except Exception as e:
+            logger.error(f"Payload validation error: {e}")
+            raise
+        return validated
+    elif task == MCPTask.INSTALLATION:
+        try:
+            validated = validate_installation_result_payload(result_json)
+        except Exception as e:
+            logger.error(f"Payload validation error: {e}")
+            raise
+        return validated
+    else:
+        try:
+            validated = validate_package_info_payload(result_json)
+        except Exception as e:
+            logger.error(f"Payload validation error: {e}")
+            raise
+
+        if not validated.get("exists", False):
+            logger.error(f"Package does not exist for repository {url if task == 'github_info_generation' else pkg_name}.")
+
+        return validated
+
+if __name__ == "__main__":
+    parser = argparse.ArgumentParser()
+    parser.add_argument(
+        "--task",
+        type=str, required=True,
+        help="The task to perform: 'github_info_generation' or 'pypi_info_generation' or 'test_execution'.",
+    )
+    parser.add_argument(
+        "--url",
+        type=str,
+        default=" ",
+        help="The GitHub repository URL to analyze.",
+    )
+    parser.add_argument(
+        "--pkg-name",
+        type=str,
+        default=" ",
+        help="The PyPI package name to analyze (required if task is 'pypi_info_generation').",
+    )
+    parser.add_argument(
+        "--use-db",
+        action="store_true",
+        default=False,
+        help="Whether to save the results to a database (if applicable).",
+    )
+    parser.add_argument(
+        "--json-path",
+        default=f"{root_dir}/package_manager/package_info.json",
+        type=str,
+        help="Path to the package_info.json file to update.",
+    )
+    parser.add_argument(
+        "--db-path",
+        type=str,
+        default=f"{root_dir}/package_info.db",
+        help="Path to the database file (if --use-db is set).",
+    )
+    parser.add_argument(
+        "--venv-name",
+        type=str, default=".common_venv",
+        help="The name of the virtual environment (required if task is 'test_execution').",
+    )
+
+    install_result = {
+        "install_cmd": "pip install kfac",
+        "status": "FAIL",
+        "stdout": "",
+        "stderr": """
+DEPRECATION: Building 'gnes' using the legacy setup.py bdist_wheel mechanism, which will be removed in a future version. pip 25.3 will enforce this behaviour change. A possible replacement is to use the standardized build interface by setting the `--use-pep517` option, (possibly combined with `--no-build-isolation`), or adding a `pyproject.toml` file to the source tree of 'gnes'. Discussion can be found at https://github.com/pypa/pip/issues/6334
+  error: subprocess-exited-with-error
+  
+  × python setup.py bdist_wheel did not run successfully.
+  │ exit code: 1
+  ╰─> [388 lines of output]
+      /root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/__init__.py:92: _DeprecatedInstaller: setuptools.installer and fetch_build_eggs are deprecated.
+      !!
+      
+              ********************************************************************************
+              Requirements should be satisfied by a PEP 517 installer.
+              If you are using pip, you can try `pip install --use-pep517`.
+      
+              By 2025-Oct-31, you need to update your project and remove deprecated calls
+              or your builds will no longer be supported.
+              ********************************************************************************
+      
+      !!
+        dist.fetch_build_eggs(dist.setup_requires)
+      Warning: 'classifiers' should be a list, got type 'tuple'
+      /root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/dist.py:759: SetuptoolsDeprecationWarning: License classifiers are deprecated.
+      !!
+      
+              ********************************************************************************
+              Please consider removing the following classifiers in favor of a SPDX license expression:
+      
+              License :: OSI Approved :: Apache Software License
+      
+              See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details.
+              ********************************************************************************
+      
+      !!
+        self._finalize_license_expression()
+      running bdist_wheel
+      running build
+      running build_py
+      creating build/lib.linux-x86_64-cpython-311/gnes
+      copying gnes/helper.py -> build/lib.linux-x86_64-cpython-311/gnes
+      copying gnes/uuid.py -> build/lib.linux-x86_64-cpython-311/gnes
+      copying gnes/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes
+      copying gnes/component.py -> build/lib.linux-x86_64-cpython-311/gnes
+      creating build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_ffmpeg_tools.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_progressbar.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_router.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_yt8m_feature_extractor.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_batching.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_hash_encoder.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_hash_indexer.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_flair_encoder.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_vlad.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_image_preprocessor.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_gif.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_encoder.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_pytorch_transformers_encoder.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_audio_preprocessor.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_indexer_service.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_dump_loads.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_stream_grpc.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_grpc_service.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_image_encoder.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_mh_indexer.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_w2v_encoder.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_yt8m_encoder.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_partition.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_pipeline_train_ext.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_gnes_flow.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_video_encoder_preprocessor.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_euclidean_indexer.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_score_fn.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_healthcheck.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_uuid.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_mfcc_encoder.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_parser.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_video_shotdetect_preprocessor.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_load_dump_pipeline.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_pca_encoder.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/__init__.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_compose.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_proto.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_annoyindexer.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_video_decode_preprocessor.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_joint_indexer.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_leveldbindexerasync.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_client_cli.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_simple_indexer.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_frame_selector.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_vggish.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_yaml.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_quantizer_encoder.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_dummy_transformer.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_preprocessor.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_pipeline_train.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_contrib_module.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_encoder_service.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_pooling_encoder.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_vggish_example.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_pipelinepreprocess.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_service_mgr.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_onnx_image_encoder.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_video_preprocessor.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_leveldbindexer.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_raw_bytes_send.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_dict_indexer.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_bindexer.py -> build/lib.linux-x86_64-cpython-311/tests
+      copying tests/test_pretrain_encoder.py -> build/lib.linux-x86_64-cpython-311/tests
+      creating build/lib.linux-x86_64-cpython-311/gnes/indexer
+      copying gnes/indexer/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/indexer
+      copying gnes/indexer/base.py -> build/lib.linux-x86_64-cpython-311/gnes/indexer
+      creating build/lib.linux-x86_64-cpython-311/gnes/flow
+      copying gnes/flow/helper.py -> build/lib.linux-x86_64-cpython-311/gnes/flow
+      copying gnes/flow/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/flow
+      copying gnes/flow/base.py -> build/lib.linux-x86_64-cpython-311/gnes/flow
+      creating build/lib.linux-x86_64-cpython-311/gnes/preprocessor
+      copying gnes/preprocessor/helper.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor
+      copying gnes/preprocessor/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor
+      copying gnes/preprocessor/base.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor
+      creating build/lib.linux-x86_64-cpython-311/gnes/encoder
+      copying gnes/encoder/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder
+      copying gnes/encoder/base.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder
+      creating build/lib.linux-x86_64-cpython-311/gnes/proto
+      copying gnes/proto/gnes_pb2_grpc.py -> build/lib.linux-x86_64-cpython-311/gnes/proto
+      copying gnes/proto/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/proto
+      copying gnes/proto/gnes_pb2.py -> build/lib.linux-x86_64-cpython-311/gnes/proto
+      creating build/lib.linux-x86_64-cpython-311/gnes/service
+      copying gnes/service/grpc.py -> build/lib.linux-x86_64-cpython-311/gnes/service
+      copying gnes/service/preprocessor.py -> build/lib.linux-x86_64-cpython-311/gnes/service
+      copying gnes/service/frontend.py -> build/lib.linux-x86_64-cpython-311/gnes/service
+      copying gnes/service/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/service
+      copying gnes/service/encoder.py -> build/lib.linux-x86_64-cpython-311/gnes/service
+      copying gnes/service/router.py -> build/lib.linux-x86_64-cpython-311/gnes/service
+      copying gnes/service/base.py -> build/lib.linux-x86_64-cpython-311/gnes/service
+      copying gnes/service/indexer.py -> build/lib.linux-x86_64-cpython-311/gnes/service
+      creating build/lib.linux-x86_64-cpython-311/gnes/base
+      copying gnes/base/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/base
+      creating build/lib.linux-x86_64-cpython-311/gnes/composer
+      copying gnes/composer/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/composer
+      copying gnes/composer/flask.py -> build/lib.linux-x86_64-cpython-311/gnes/composer
+      copying gnes/composer/http.py -> build/lib.linux-x86_64-cpython-311/gnes/composer
+      copying gnes/composer/base.py -> build/lib.linux-x86_64-cpython-311/gnes/composer
+      creating build/lib.linux-x86_64-cpython-311/gnes/client
+      copying gnes/client/stream.py -> build/lib.linux-x86_64-cpython-311/gnes/client
+      copying gnes/client/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/client
+      copying gnes/client/cli.py -> build/lib.linux-x86_64-cpython-311/gnes/client
+      copying gnes/client/http.py -> build/lib.linux-x86_64-cpython-311/gnes/client
+      copying gnes/client/base.py -> build/lib.linux-x86_64-cpython-311/gnes/client
+      creating build/lib.linux-x86_64-cpython-311/gnes/cli
+      copying gnes/cli/parser.py -> build/lib.linux-x86_64-cpython-311/gnes/cli
+      copying gnes/cli/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/cli
+      copying gnes/cli/api.py -> build/lib.linux-x86_64-cpython-311/gnes/cli
+      creating build/lib.linux-x86_64-cpython-311/gnes/router
+      copying gnes/router/map.py -> build/lib.linux-x86_64-cpython-311/gnes/router
+      copying gnes/router/reduce.py -> build/lib.linux-x86_64-cpython-311/gnes/router
+      copying gnes/router/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/router
+      copying gnes/router/base.py -> build/lib.linux-x86_64-cpython-311/gnes/router
+      creating build/lib.linux-x86_64-cpython-311/gnes/score_fn
+      copying gnes/score_fn/doc.py -> build/lib.linux-x86_64-cpython-311/gnes/score_fn
+      copying gnes/score_fn/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/score_fn
+      copying gnes/score_fn/chunk.py -> build/lib.linux-x86_64-cpython-311/gnes/score_fn
+      copying gnes/score_fn/normalize.py -> build/lib.linux-x86_64-cpython-311/gnes/score_fn
+      copying gnes/score_fn/base.py -> build/lib.linux-x86_64-cpython-311/gnes/score_fn
+      creating build/lib.linux-x86_64-cpython-311/gnes/indexer/chunk
+      copying gnes/indexer/chunk/helper.py -> build/lib.linux-x86_64-cpython-311/gnes/indexer/chunk
+      copying gnes/indexer/chunk/faiss.py -> build/lib.linux-x86_64-cpython-311/gnes/indexer/chunk
+      copying gnes/indexer/chunk/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/indexer/chunk
+      copying gnes/indexer/chunk/annoy.py -> build/lib.linux-x86_64-cpython-311/gnes/indexer/chunk
+      copying gnes/indexer/chunk/numpy.py -> build/lib.linux-x86_64-cpython-311/gnes/indexer/chunk
+      creating build/lib.linux-x86_64-cpython-311/gnes/indexer/doc
+      copying gnes/indexer/doc/filesys.py -> build/lib.linux-x86_64-cpython-311/gnes/indexer/doc
+      copying gnes/indexer/doc/dict.py -> build/lib.linux-x86_64-cpython-311/gnes/indexer/doc
+      copying gnes/indexer/doc/rocksdb.py -> build/lib.linux-x86_64-cpython-311/gnes/indexer/doc
+      copying gnes/indexer/doc/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/indexer/doc
+      copying gnes/indexer/doc/leveldb.py -> build/lib.linux-x86_64-cpython-311/gnes/indexer/doc
+      creating build/lib.linux-x86_64-cpython-311/gnes/indexer/chunk/bindexer
+      copying gnes/indexer/chunk/bindexer/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/indexer/chunk/bindexer
+      creating build/lib.linux-x86_64-cpython-311/gnes/indexer/chunk/hbindexer
+      copying gnes/indexer/chunk/hbindexer/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/indexer/chunk/hbindexer
+      creating build/lib.linux-x86_64-cpython-311/gnes/preprocessor/text
+      copying gnes/preprocessor/text/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/text
+      copying gnes/preprocessor/text/split.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/text
+      creating build/lib.linux-x86_64-cpython-311/gnes/preprocessor/audio
+      copying gnes/preprocessor/audio/vggish_example.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/audio
+      copying gnes/preprocessor/audio/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/audio
+      copying gnes/preprocessor/audio/audio_vanilla.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/audio
+      creating build/lib.linux-x86_64-cpython-311/gnes/preprocessor/image
+      copying gnes/preprocessor/image/segmentation.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/image
+      copying gnes/preprocessor/image/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/image
+      copying gnes/preprocessor/image/sliding_window.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/image
+      copying gnes/preprocessor/image/resize.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/image
+      creating build/lib.linux-x86_64-cpython-311/gnes/preprocessor/video
+      copying gnes/preprocessor/video/shot_detector.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/video
+      copying gnes/preprocessor/video/ffmpeg.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/video
+      copying gnes/preprocessor/video/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/video
+      copying gnes/preprocessor/video/frame_select.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/video
+      copying gnes/preprocessor/video/video_encoder.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/video
+      copying gnes/preprocessor/video/video_decoder.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/video
+      creating build/lib.linux-x86_64-cpython-311/gnes/preprocessor/io_utils
+      copying gnes/preprocessor/io_utils/helper.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/io_utils
+      copying gnes/preprocessor/io_utils/ffmpeg.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/io_utils
+      copying gnes/preprocessor/io_utils/video.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/io_utils
+      copying gnes/preprocessor/io_utils/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/io_utils
+      copying gnes/preprocessor/io_utils/webp.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/io_utils
+      copying gnes/preprocessor/io_utils/audio.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/io_utils
+      copying gnes/preprocessor/io_utils/gif.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/io_utils
+      creating build/lib.linux-x86_64-cpython-311/gnes/preprocessor/audio/vggish_example_helper
+      copying gnes/preprocessor/audio/vggish_example_helper/mel_features.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/audio/vggish_example_helper
+      copying gnes/preprocessor/audio/vggish_example_helper/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/preprocessor/audio/vggish_example_helper
+      creating build/lib.linux-x86_64-cpython-311/gnes/encoder/numeric
+      copying gnes/encoder/numeric/standarder.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/numeric
+      copying gnes/encoder/numeric/pca.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/numeric
+      copying gnes/encoder/numeric/pooling.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/numeric
+      copying gnes/encoder/numeric/tf_pq.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/numeric
+      copying gnes/encoder/numeric/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/numeric
+      copying gnes/encoder/numeric/pq.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/numeric
+      copying gnes/encoder/numeric/vlad.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/numeric
+      copying gnes/encoder/numeric/hash.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/numeric
+      copying gnes/encoder/numeric/quantizer.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/numeric
+      creating build/lib.linux-x86_64-cpython-311/gnes/encoder/text
+      copying gnes/encoder/text/bert.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/text
+      copying gnes/encoder/text/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/text
+      copying gnes/encoder/text/transformer.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/text
+      copying gnes/encoder/text/w2v.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/text
+      copying gnes/encoder/text/char.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/text
+      copying gnes/encoder/text/flair.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/text
+      creating build/lib.linux-x86_64-cpython-311/gnes/encoder/audio
+      copying gnes/encoder/audio/mfcc.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/audio
+      copying gnes/encoder/audio/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/audio
+      copying gnes/encoder/audio/vggish.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/audio
+      creating build/lib.linux-x86_64-cpython-311/gnes/encoder/image
+      copying gnes/encoder/image/cvae.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/image
+      copying gnes/encoder/image/torchvision.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/image
+      copying gnes/encoder/image/inception.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/image
+      copying gnes/encoder/image/onnx.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/image
+      copying gnes/encoder/image/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/image
+      creating build/lib.linux-x86_64-cpython-311/gnes/encoder/video
+      copying gnes/encoder/video/incep_mixture.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/video
+      copying gnes/encoder/video/inception.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/video
+      copying gnes/encoder/video/yt8m_feature_extractor.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/video
+      copying gnes/encoder/video/yt8m_model.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/video
+      copying gnes/encoder/video/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/video
+      creating build/lib.linux-x86_64-cpython-311/gnes/encoder/audio/vggish_cores
+      copying gnes/encoder/audio/vggish_cores/vggish_slim.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/audio/vggish_cores
+      copying gnes/encoder/audio/vggish_cores/vggish_params.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/audio/vggish_cores
+      copying gnes/encoder/audio/vggish_cores/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/audio/vggish_cores
+      copying gnes/encoder/audio/vggish_cores/vggish_postprocess.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/audio/vggish_cores
+      creating build/lib.linux-x86_64-cpython-311/gnes/encoder/image/cvae_cores
+      copying gnes/encoder/image/cvae_cores/model.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/image/cvae_cores
+      copying gnes/encoder/image/cvae_cores/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/image/cvae_cores
+      creating build/lib.linux-x86_64-cpython-311/gnes/encoder/image/inception_cores
+      copying gnes/encoder/image/inception_cores/inception_v4.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/image/inception_cores
+      copying gnes/encoder/image/inception_cores/inception_utils.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/image/inception_cores
+      copying gnes/encoder/image/inception_cores/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/image/inception_cores
+      creating build/lib.linux-x86_64-cpython-311/gnes/encoder/video/yt8m_feature_extractor_cores
+      copying gnes/encoder/video/yt8m_feature_extractor_cores/inception_v3.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/video/yt8m_feature_extractor_cores
+      copying gnes/encoder/video/yt8m_feature_extractor_cores/inception_utils.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/video/yt8m_feature_extractor_cores
+      copying gnes/encoder/video/yt8m_feature_extractor_cores/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/video/yt8m_feature_extractor_cores
+      creating build/lib.linux-x86_64-cpython-311/gnes/encoder/video/mixture_core
+      copying gnes/encoder/video/mixture_core/model.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/video/mixture_core
+      copying gnes/encoder/video/mixture_core/__init__.py -> build/lib.linux-x86_64-cpython-311/gnes/encoder/video/mixture_core
+      running egg_info
+      writing gnes.egg-info/PKG-INFO
+      writing dependency_links to gnes.egg-info/dependency_links.txt
+      writing entry points to gnes.egg-info/entry_points.txt
+      writing requirements to gnes.egg-info/requires.txt
+      writing top-level names to gnes.egg-info/top_level.txt
+      [09/06/25 22:20:02] ERROR    listing git files failed - pretending     git.py:26
+                                   there aren't any
+      reading manifest file 'gnes.egg-info/SOURCES.txt'
+      reading manifest template 'MANIFEST.in'
+      writing manifest file 'gnes.egg-info/SOURCES.txt'
+      /root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/command/build_py.py:212: _Warning: Package 'gnes.resources.compose' is absent from the `packages` configuration.
+      !!
+      
+              ********************************************************************************
+              ############################
+              # Package would be ignored #
+              ############################
+              Python recognizes 'gnes.resources.compose' as an importable package[^1],
+              but it is absent from setuptools' `packages` configuration.
+      
+              This leads to an ambiguous overall configuration. If you want to distribute this
+              package, please make sure that 'gnes.resources.compose' is explicitly added
+              to the `packages` configuration field.
+      
+              Alternatively, you can also rely on setuptools' discovery methods
+              (for example by using `find_namespace_packages(...)`/`find_namespace:`
+              instead of `find_packages(...)`/`find:`).
+      
+              You can read more about "package discovery" on setuptools documentation page:
+      
+              - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html
+      
+              If you don't want 'gnes.resources.compose' to be distributed and are
+              already explicitly excluding 'gnes.resources.compose' via
+              `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`,
+              you can try to use `exclude_package_data`, or `include-package-data=False` in
+              combination with a more fine grained `package-data` configuration.
+      
+              You can read more about "package data files" on setuptools documentation page:
+      
+              - https://setuptools.pypa.io/en/latest/userguide/datafiles.html
+      
+      
+              [^1]: For Python, any directory (with suitable naming) can be imported,
+                    even if it does not contain any `.py` files.
+                    On the other hand, currently there is no concept of package data
+                    directory, all directories are treated like packages.
+              ********************************************************************************
+      
+      !!
+        check.warn(importable)
+      creating build/lib.linux-x86_64-cpython-311/gnes/resources/compose
+      copying gnes/resources/compose/gnes-board.html -> build/lib.linux-x86_64-cpython-311/gnes/resources/compose
+      copying gnes/resources/compose/gnes-example.yml -> build/lib.linux-x86_64-cpython-311/gnes/resources/compose
+      copying gnes/resources/compose/gnes-shell.sh -> build/lib.linux-x86_64-cpython-311/gnes/resources/compose
+      copying gnes/resources/compose/gnes-swarm.yml -> build/lib.linux-x86_64-cpython-311/gnes/resources/compose
+      copying gnes/indexer/chunk/bindexer/bindexer.pyx -> build/lib.linux-x86_64-cpython-311/gnes/indexer/chunk/bindexer
+      copying gnes/indexer/chunk/hbindexer/hbindexer.pyx -> build/lib.linux-x86_64-cpython-311/gnes/indexer/chunk/hbindexer
+      running build_ext
+      Compiling gnes/indexer/chunk/bindexer/bindexer.pyx because it changed.
+      [1/1] Cythonizing gnes/indexer/chunk/bindexer/bindexer.pyx
+      warning: gnes/indexer/chunk/bindexer/bindexer.pyx:385:17: Non-trivial type declarators in shared declaration (e.g. mix of pointers and values). Each pointer declaration should be on its own line.
+      warning: gnes/indexer/chunk/bindexer/bindexer.pyx:385:30: Non-trivial type declarators in shared declaration (e.g. mix of pointers and values). Each pointer declaration should be on its own line.
+      warning: gnes/indexer/chunk/bindexer/bindexer.pyx:530:17: Non-trivial type declarators in shared declaration (e.g. mix of pointers and values). Each pointer declaration should be on its own line.
+      warning: gnes/indexer/chunk/bindexer/bindexer.pyx:530:28: Non-trivial type declarators in shared declaration (e.g. mix of pointers and values). Each pointer declaration should be on its own line.
+      warning: gnes/indexer/chunk/bindexer/bindexer.pyx:532:16: Non-trivial type declarators in shared declaration (e.g. mix of pointers and values). Each pointer declaration should be on its own line.
+      warning: gnes/indexer/chunk/bindexer/bindexer.pyx:532:24: Non-trivial type declarators in shared declaration (e.g. mix of pointers and values). Each pointer declaration should be on its own line.
+      
+      Error compiling Cython file:
+      ------------------------------------------------------------
+      ...
+          cdef DataDist*av = <DataDist*>a
+          cdef DataDist*bv = <DataDist*>b
+          return av.dist - bv.dist
+      
+      cdef DataDist*sort_Datadist(DataDist*W, UIDX wsize):
+          qsort(W, wsize, sizeof(DataDist), &cmpfunc)
+                                            ^
+      ------------------------------------------------------------
+      
+      gnes/indexer/chunk/bindexer/bindexer.pyx:84:38: Cannot assign type 'int (*)(const void *, const void *) except? -1 nogil' to 'int (*)(const void *, const void *) noexcept nogil'. Exception values are incompatible. Suggest adding 'noexcept' to the type of the value being assigned.
+      Traceback (most recent call last):
+        File "<string>", line 2, in <module>
+        File "<pip-setuptools-caller>", line 35, in <module>
+        File "/tmp/pip-install-scjaopdg/gnes_73e473bfeee04842ba712fdca1ecbe63/setup.py", line 76, in <module>
+          setup(
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/__init__.py", line 115, in setup
+          return distutils.core.setup(**attrs)
+                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/_distutils/core.py", line 186, in setup
+          return run_commands(dist)
+                 ^^^^^^^^^^^^^^^^^^
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/_distutils/core.py", line 202, in run_commands
+          dist.run_commands()
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/_distutils/dist.py", line 1002, in run_commands
+          self.run_command(cmd)
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/dist.py", line 1102, in run_command
+          super().run_command(command)
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/_distutils/dist.py", line 1021, in run_command
+          cmd_obj.run()
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/command/bdist_wheel.py", line 370, in run
+          self.run_command("build")
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/_distutils/cmd.py", line 357, in run_command
+          self.distribution.run_command(command)
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/dist.py", line 1102, in run_command
+          super().run_command(command)
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/_distutils/dist.py", line 1021, in run_command
+          cmd_obj.run()
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/_distutils/command/build.py", line 135, in run
+          self.run_command(cmd_name)
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/_distutils/cmd.py", line 357, in run_command
+          self.distribution.run_command(command)
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/dist.py", line 1102, in run_command
+          super().run_command(command)
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/_distutils/dist.py", line 1021, in run_command
+          cmd_obj.run()
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/command/build_ext.py", line 96, in run
+          _build_ext.run(self)
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/_distutils/command/build_ext.py", line 368, in run
+          self.build_extensions()
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/_distutils/command/build_ext.py", line 484, in build_extensions
+          self._build_extensions_serial()
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/_distutils/command/build_ext.py", line 510, in _build_extensions_serial
+          self.build_extension(ext)
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/setuptools/command/build_ext.py", line 261, in build_extension
+          _build_ext.build_extension(self, ext)
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/Cython/Distutils/build_ext.py", line 131, in build_extension
+          new_ext = cythonize(
+                    ^^^^^^^^^^
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/Cython/Build/Dependencies.py", line 1154, in cythonize
+          cythonize_one(*args)
+        File "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs/.common_venv/lib64/python3.11/site-packages/Cython/Build/Dependencies.py", line 1298, in cythonize_one
+          raise CompileError(None, pyx_file)
+      Cython.Compiler.Errors.CompileError: gnes/indexer/chunk/bindexer/bindexer.pyx
+      [end of output]
+  
+  note: This error originates from a subprocess, and is likely not a problem with pip.
+  ERROR: Failed building wheel for gnes
+error: failed-wheel-build-for-install
+
+× Failed to build installable wheels for some pyproject.toml based projects
+╰─> gnes
+"""
+    }
+
+    exe_result = {
+        "test_type": "functional test",
+        "test_case": "\n# 基本功能测试\nimport pytorch_rl; result = pytorch_rl.__version__; print(f\"Version: {result}\")\n",
+        "status": "FAIL",
+        "actual_output": "",
+        "expected_output": "^Version: \d+\.\d+\.\d+$",
+        "normalized_actual": "",
+        "normalized_expected": "^Version: \d+\.\d+\.\d+$",
+        "stderr": "Traceback (most recent call last):\n  File \"<string>\", line 3, in <module>\nModuleNotFoundError: No module named 'pytorch_rl'\n",
+        "return_code": 1,
+        "execution_time": 0.012133
+    }
+
+    args = parser.parse_args()
+    if args.task == MCPTask.TEST_EXECUTION:
+        result = asyncio.run(assign_task_to_llm_mcp(args.task, args.url, args.pkg_name, args.venv_name, exe_result))
+        print(f"Test Execution Result: {result}")
+        sys.exit(0 if result.get("status") == "success" else 1)
+
+    if args.task == MCPTask.INSTALLATION:
+        result = asyncio.run(assign_task_to_llm_mcp(args.task, args.url, args.pkg_name, args.venv_name, install_result))
+        print(f"Installation Result: {result}")
+        sys.exit(0 if result.get("status") == "success" else 1)
+
+    result = asyncio.run(assign_task_to_llm_mcp(args.task, args.url, args.pkg_name, args.venv_name, {}))
+    package = PackageConverter.dict_to_model(result)
+    repository = PackageRepository(args.db_path, args.json_path)
+    if args.use_db == True:
+        try:
+            repository.save_to_db(package)
+            logger.info(f"Package info for {package.package_name} saved to database {args.db_path}")
+        except Exception as e:
+            logger.error(f"Failed to save package info for {package.package_name} to database: {e}")
+            raise SystemExit(2)
+    else:  
+        try:
+            repository.save_to_json(package)
+            logger.info(f"Package info for {package.package_name} saved to JSON file {args.json_path}")
+        except Exception as e:
+            logger.error(f"Failed to save package info for {package.package_name} to JSON file: {e}")
+            raise SystemExit(2)
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_tool.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_tool.py"
new file mode 100644
index 0000000000000000000000000000000000000000..e94ac0852b7bcd63842e1be3c04d61d3dec96455
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/mcp_tool.py"	
@@ -0,0 +1,35 @@
+from typing import Any
+
+
+class MCPTool:
+    """Represents a MCP tool with its properties and formatting."""
+
+    def __init__(
+        self, name: str, description: str, input_schema: dict[str, Any]
+    ) -> None:
+        self.name: str = name
+        self.description: str = description
+        self.input_schema: dict[str, Any] = input_schema
+
+    def format_for_llm(self) -> str:
+        """Format tool information for LLM.
+
+        Returns:
+            A formatted string describing the tool.
+        """
+        args_desc = []
+        if "properties" in self.input_schema:
+            for param_name, param_info in self.input_schema["properties"].items():
+                arg_desc = (
+                    f"- {param_name}: {param_info.get('description', 'No description')}"
+                )
+                if param_name in self.input_schema.get("required", []):
+                    arg_desc += " (required)"
+                args_desc.append(arg_desc)
+
+        return f"""
+Tool: {self.name}
+Description: {self.description}
+Arguments:
+{chr(10).join(args_desc)}
+"""
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/session.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/session.py"
new file mode 100644
index 0000000000000000000000000000000000000000..374df5fc6369db293ad01ae6371345cfeb051e53
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/session.py"	
@@ -0,0 +1,436 @@
+import json
+import re
+import sys
+import asyncio
+from pathlib import Path
+from dataclasses import dataclass
+from typing import Any, AsyncGenerator, Dict, List, Optional, Tuple, Union
+
+root_dir = Path(__file__).parent.parent
+sys.path.insert(0, str(root_dir))
+
+from logger.logger import get_logger
+from ai_agent.llm_api import call_llm_api
+from utils.workflow import WorkflowEventType, WorkflowTracer
+from mcp_chat_bot.mcp_client import MCPClient
+
+SYSTEM_MESSAGE = (
+    "You are a helpful assistant with access to these tools:\n\n"
+    "{tools_description}\n\n"
+    "Choose the appropriate tool based on the user's question. "
+    "If no tool is needed, reply directly.\n\n"
+    "IMPORTANT: When you need to use a tool, you must respond with "
+    "the exact JSON object format below:\n"
+    "{{\n"
+    '    "tool": "tool-name",\n'
+    '    "arguments": {{\n'
+    '        "argument-name": "value"\n'
+    "    }}\n"
+    "}}\n\n"
+    "After receiving tool responses:\n"
+    "1. Transform the raw data into a natural, conversational response\n"
+    "2. Keep responses concise but informative\n"
+    "3. Focus on the most relevant information\n"
+    "4. Use appropriate context from the user's question\n"
+    "5. Avoid simply repeating the raw data\n\n"
+    "Please use only the tools that are explicitly defined above."
+)
+
+
+@dataclass
+class ToolCall:
+    """Represents a single tool call data structure."""
+
+    # tool: str
+    # arguments: Dict[str, Any]
+    # result: Optional[Any] = None
+    # error: Optional[str] = None
+
+    def __init__(self, tool: str, arguments: Dict[str, Any]) -> None:
+        self.tool = tool
+        self.arguments = arguments
+        self.result = None
+        self.error = None
+        self.logger = get_logger("ToolCall")
+
+    def is_successful(self) -> bool:
+        """Check if the tool call is successful."""
+        return self.error is None and self.result is not None
+
+    def to_description(self, for_display: bool = False, max_length: int = 200) -> str:
+        """Format the tool call to a description string.
+
+        Args:
+            for_display: Whether to format for display
+            max_length: Maximum length of the formatted string
+
+        Returns:
+            A formatted string
+        """
+        base_description = (
+            f"Tool Name: {self.tool}\n"
+            f"- Arguments: {json.dumps(self.arguments, indent=2)}\n"
+        )
+        final_description = base_description
+        if self.is_successful():
+            result_str = (
+                str(self.result) if for_display else str(self.result)
+            )
+            final_description += f"- Tool call result: {result_str}\n"
+        else:
+            error_str = str(self.error)[:max_length] if for_display else str(self.error)
+            final_description += f"- Tool call error: {error_str}\n"
+        return final_description
+
+
+class ChatSession:
+    """Orchestrates the interaction between user, LLM, and tools."""
+
+    def __init__(self, clients: List[MCPClient]) -> None:
+        """Initialize ChatSession.
+
+        Args:
+            clients: List of MCP clients
+        """
+        self.clients: List[MCPClient] = clients
+        self.messages: List[Dict[str, str]] = []
+        self._is_initialized: bool = False
+        self.logger = get_logger("ChatSession")
+
+    async def cleanup_clients(self) -> None:
+        """Clean up all client resources."""
+        try:
+            for client in reversed(self.clients):
+                try:
+                    self.logger.info(f"Cleaning up client: {client.name}")
+                    await client.cleanup()
+                except Exception as e:
+                    self.logger.warning(f"Warning during cleanup of client {client.name}: {e}")
+        except Exception as e:
+            self.logger.error(f"Error cleanup_clients: {e}")
+
+    async def initialize(self) -> bool:
+        """Initialize MCP clients and prepare system message.
+
+        Returns:
+            True if initialization is successful, False otherwise.
+        """
+        try:
+            if self._is_initialized:
+                return True
+
+            # Initialize all MCP clients
+            self.tool_client_map = {}
+            for client in self.clients:
+                try:
+                    await client.initialize()
+                    tools = await client.list_tools()
+                    for tool in tools:
+                        if tool.name in self.tool_client_map:
+                            self.logger.warning(
+                                f"Tool {tool.name} already exists in "
+                                f"{self.tool_client_map[tool.name].name}"
+                            )
+                        self.tool_client_map[tool.name] = client
+                except Exception as e:
+                    self.logger.error(f"Failed to initialize client: {e}")
+                    await self.cleanup_clients()
+                    return False
+
+            # Collect all available tools
+            all_tools = []
+            for client in self.clients:
+                tools = await client.list_tools()
+                all_tools.extend(tools)
+
+            # Format tool descriptions and create system message
+            tools_description = "\n".join([tool.format_for_llm() for tool in all_tools])
+            system_message = SYSTEM_MESSAGE.format(tools_description=tools_description)
+
+            self.messages = [{"role": "system", "content": system_message}]
+            self._is_initialized = True
+            return True
+        except Exception as e:
+            self.logger.error(f"Initialization error: {e}")
+            await self.cleanup_clients()
+            return False
+
+    def _extract_tool_calls(self, llm_response: str) -> List[Dict[str, Any]]:
+        """Extract tool call JSON objects from LLM response.
+
+        Handles multiple cases:
+        1. Response contains only one JSON object
+        2. Response contains multiple JSON objects
+        3. Response contains JSON objects and additional text
+
+        Args:
+            llm_response: LLM response text
+
+        Returns:
+            List of extracted tool call objects
+        """
+        # Try to parse the entire response as JSON
+        try:
+            tool_call = json.loads(llm_response)
+            if (
+                isinstance(tool_call, dict)
+                and "tool" in tool_call
+                and "arguments" in tool_call
+            ):
+                return [tool_call]
+        except json.JSONDecodeError:
+            pass
+
+        # Try to extract all JSON objects from the response
+        tool_calls = []
+        # Regex pattern to match JSON objects
+        json_pattern = r"({[^{}]*({[^{}]*})*[^{}]*})"
+        json_matches = re.finditer(json_pattern, llm_response)
+
+        for match in json_matches:
+            try:
+                json_obj = json.loads(match.group(0))
+                if (
+                    isinstance(json_obj, dict)
+                    and "tool" in json_obj
+                    and "arguments" in json_obj
+                ):
+                    tool_calls.append(json_obj)
+            except json.JSONDecodeError:
+                continue
+
+        return tool_calls
+
+    async def _execute_tool_call(self, tool_call_data: Dict[str, Any]) -> ToolCall:
+        """Execute a single tool call.
+
+        Args:
+            tool_call_data: A dictionary containing 'tool' and 'arguments'
+
+        Returns:
+            A ToolCall object containing the execution result
+        """
+        tool_name = tool_call_data["tool"]
+        arguments = tool_call_data["arguments"]
+
+        tool_call = ToolCall(tool=tool_name, arguments=arguments)
+
+        # Find the client directly from the tool client map.
+        if tool_name in self.tool_client_map:
+            client = self.tool_client_map[tool_name]
+            try:
+                result = await client.execute_tool(tool_name, arguments)
+                tool_call.result = result
+                return tool_call
+            except Exception as e:
+                error_msg = f"Error executing tool: {str(e)}"
+                self.logger.error(error_msg)
+                tool_call.error = error_msg
+                return tool_call
+
+        # No client found to execute this tool
+        tool_call.error = f"No server found with tool: {tool_name}"
+        return tool_call
+
+    async def process_tool_calls(
+        self,
+        llm_response: str,
+        tool_call_data_list: Optional[List[Dict[str, Any]]] = None,
+    ) -> Tuple[List[ToolCall], bool]:
+        """Process all tool calls in the response.
+
+        Args:
+            llm_response: The response text from the LLM
+            tool_call_data_list: A list of tool call data, if provided, the tool calls
+                will be executed in the order of the list.
+                (without extracting from the response)
+
+        Returns:
+            A list of ToolCall objects and a boolean indicating
+                if any tools were executed
+        """
+        if tool_call_data_list is None:
+            tool_call_data_list = self._extract_tool_calls(llm_response)
+
+        if not tool_call_data_list:
+            return [], False
+
+        tool_calls = []
+        for tool_call_data in tool_call_data_list:
+            tool_call = await self._execute_tool_call(tool_call_data)
+            tool_calls.append(tool_call)
+
+        return tool_calls, True
+
+    def _format_tool_results(
+        self,
+        tool_calls: List[ToolCall],
+        for_display: bool = False,
+        max_length: int = 200,
+    ) -> str:
+        """Format tool call results as a prompt text.
+
+        Args:
+            tool_calls: A list of ToolCall objects
+            for_display: Whether to format for display
+            max_length: Maximum length of the formatted string
+
+        Returns:
+            A formatted tool result text
+        """
+        results = []
+        for i, call in enumerate(tool_calls, 1):
+            result_text = f"Tool Call {i}:\n"
+            result_text += call.to_description(for_display, max_length)
+            results.append(result_text)
+
+        return "Tool execution results:\n\n" + "\n".join(results)
+
+    async def send_message(
+        self,
+        user_message: str,
+        auto_process_tools: bool = True,
+        show_workflow: bool = False,
+        max_iterations: int = 10,
+    ) -> str:
+        """Send message and get response, optionally auto-process tool calls.
+
+        Args:
+            user_message: The user's message
+            auto_process_tools: Whether to auto-process tool calls
+            show_workflow: Whether to show the workflow
+            max_iterations: Maximum number of tool iterations (default: 10)
+
+        Returns:
+            The final response text
+        """
+        if not self._is_initialized:
+            success = await self.initialize()
+            if not success:
+                return "Failed to initialize chat session"
+
+        # Initialize the workflow tracer
+        self.workflow_tracer = WorkflowTracer()
+
+        # Record user query
+        self.workflow_tracer.add_event(
+            WorkflowEventType.USER_QUERY,
+            user_message[:50] if len(user_message) > 50 else user_message,
+        )
+
+        self.messages.append({"role": "user", "content": user_message})
+
+        # Record LLM thinking
+        self.workflow_tracer.add_event(
+            WorkflowEventType.LLM_THINKING, "LLM is processing your query..."
+        )
+
+        # Get LLM response
+        llm_response = call_llm_api(self.messages)
+
+        # Record LLM response
+        self.workflow_tracer.add_event(
+            WorkflowEventType.LLM_RESPONSE,
+            llm_response[:50] if len(llm_response) > 50 else llm_response,
+        )
+
+        self.messages.append({"role": "assistant", "content": llm_response})
+        self.logger.info(
+            f"LLM Response: {llm_response}"
+        )
+
+        if not auto_process_tools:
+            # Record final response
+            self.workflow_tracer.add_event(
+                WorkflowEventType.FINAL_RESPONSE,
+                "Direct response without tool processing",
+            )
+            # Output formatted workflow
+            if show_workflow:
+                print(self.workflow_tracer.render_tree_workflow())
+            return llm_response
+
+        # Automatically process tool calls
+        tool_iteration = 0
+        while tool_iteration < max_iterations:
+            tool_iteration += 1
+            tool_calls, has_tools = await self.process_tool_calls(llm_response)
+
+            if not has_tools:
+                # Record final response
+                self.workflow_tracer.add_event(
+                    WorkflowEventType.FINAL_RESPONSE,
+                    f"Final response after {tool_iteration - 1} tool iterations",
+                )
+                # Output formatted workflow
+                if show_workflow:
+                    print(self.workflow_tracer.render_tree_workflow())
+                return llm_response
+
+            # Record tool calls
+            for i, tool_call in enumerate(tool_calls):
+                # Record tool call request
+                self.workflow_tracer.add_event(
+                    WorkflowEventType.TOOL_CALL,
+                    f"Call {i + 1}: {tool_call.tool}",
+                    {"tool_name": tool_call.tool, "arguments": tool_call.arguments},
+                )
+
+                # Record tool execution
+                self.workflow_tracer.add_event(
+                    WorkflowEventType.TOOL_EXECUTION, f"Executing {tool_call.tool}..."
+                )
+
+                # Record tool result
+                success = tool_call.is_successful()
+                self.workflow_tracer.add_event(
+                    WorkflowEventType.TOOL_RESULT,
+                    "Success" if success else f"Error: {tool_call.error}",
+                    {
+                        "success": success,
+                        "result": str(tool_call.result)[:100] if success else None,
+                    },
+                )
+
+            # Format tool results and add to message history
+            tool_results = self._format_tool_results(tool_calls)
+            self.messages.append({"role": "user", "content": tool_results})
+            tool_result_formatted = self._format_tool_results(
+                tool_calls, for_display=True
+            )
+            self.logger.info(
+                f"Tool Results: {tool_result_formatted}"
+            )
+
+            # Record LLM thinking again
+            self.workflow_tracer.add_event(
+                WorkflowEventType.LLM_THINKING,
+                f"LLM processing tool results (iteration {tool_iteration})...",
+            )
+
+            # Get next response
+            llm_response = call_llm_api(self.messages)
+
+            # Record LLM response
+            self.workflow_tracer.add_event(
+                WorkflowEventType.LLM_RESPONSE,
+                llm_response[:50] if len(llm_response) > 50 else llm_response,
+            )
+
+            self.messages.append({"role": "assistant", "content": llm_response})
+            self.logger.info(
+                f"LLM Response: {llm_response}"
+            )
+
+            # Check if next response still contains tool calls
+            next_tool_calls = self._extract_tool_calls(llm_response)
+            if not next_tool_calls:
+                # Record final response
+                self.workflow_tracer.add_event(
+                    WorkflowEventType.FINAL_RESPONSE,
+                    f"Final response after {tool_iteration} tool iterations",
+                )
+                # Output formatted workflow
+                if show_workflow:
+                    print(self.workflow_tracer.render_tree_workflow())
+                return llm_response
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/unit_tests/chatbot_terminal.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/unit_tests/chatbot_terminal.py"
new file mode 100644
index 0000000000000000000000000000000000000000..aab97862748f7351ea1f39e89667cc7da762fdce
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/unit_tests/chatbot_terminal.py"	
@@ -0,0 +1,66 @@
+import argparse
+import asyncio
+import os
+import colorama
+from pathlib import Path
+import sys
+from typing import Any, Dict, List
+
+root_dir = Path(__file__).parent.parent
+sys.path.insert(0, str(root_dir))
+
+
+from session import ChatSession
+from mcp_client import MCPClient 
+from configuration import Configuration
+
+def parse_servers_config(config: Dict[str, Any]) -> List[MCPClient]:
+    return [
+        MCPClient(name, srv_config) for name, srv_config in config["mcpServers"].items()
+    ]
+
+async def main() -> None:
+    """Initialize and run the chat session."""
+    config = Configuration()
+    # You can change the config file to the one you want to use
+    server_config = config.load_config(f"{root_dir}/mcp_servers/mcp_servers_config.json")
+    servers = parse_servers_config(server_config)
+    chat_session = ChatSession(servers)
+
+    init_success = await chat_session.initialize()
+    if not init_success:
+        print("Failed to initialize chat session")
+        return
+    
+    try:
+        # Main chat loop
+        while True:
+            try:
+                user_input = input(
+                    f"{colorama.Fore.GREEN}You: {colorama.Style.RESET_ALL}"
+                ).strip()
+
+                if user_input.lower() in ["exit", "quit"]:
+                    print("Exiting...")
+                    break
+
+                response = await chat_session.send_message(
+                    user_input,
+                    show_workflow=True,
+                )
+
+                # Display response
+                print(
+                    f"{colorama.Fore.BLUE}"
+                    f"Assistant: {colorama.Style.RESET_ALL}"
+                    f"{response}"
+                )
+
+            except KeyboardInterrupt:
+                print("\nExiting...")
+                break
+    finally:
+        await chat_session.cleanup_clients()
+
+if __name__ == "__main__":
+    asyncio.run(main())
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/unit_tests/clean_up_test.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/unit_tests/clean_up_test.py"
new file mode 100644
index 0000000000000000000000000000000000000000..327465f5bf5942ab1ce7161fae5800c5964752f7
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/unit_tests/clean_up_test.py"	
@@ -0,0 +1,49 @@
+import asyncio
+from contextlib import AsyncExitStack, asynccontextmanager
+import sys
+from mcp import StdioServerParameters
+from mcp.client.stdio import stdio_client
+
+@asynccontextmanager
+async def resource(name: str):
+    print(f"enter {name}")
+    try:
+        yield name
+    finally:
+        print(f"exit {name}")
+
+async def main():
+    lock = asyncio.Lock()
+    stack1 = AsyncExitStack()
+    stack2 = AsyncExitStack()
+
+    server_params1 = StdioServerParameters(
+        command=sys.executable,
+    )
+
+    server_params2 = StdioServerParameters(
+        command=sys.executable,
+    )
+
+    # 在同一协程中 enter 两个上下文
+    await stack1.enter_async_context(resource("r1"))
+    await stack1.enter_async_context(stdio_client(server_params1))
+    print("stack1 ready")
+    await stack2.enter_async_context(resource("r2"))
+    await stack2.enter_async_context(stdio_client(server_params2))
+    print("stack2 ready")
+
+    print("resources ready")
+
+    # 串行且受保护地关闭两个 stack（同一 coroutine）
+    async with lock:
+        try:
+            # shield 防止外层 CancelledError 中断清理
+            await stack2.aclose()
+            await stack1.aclose()
+        except asyncio.CancelledError:
+            print("cleanup cancelled, continuing")
+        except Exception as e:
+            print("cleanup error:", e)
+
+asyncio.run(main())
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/unit_tests/single_prompt.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/unit_tests/single_prompt.py"
new file mode 100644
index 0000000000000000000000000000000000000000..51d2426ddfee284610b3337060c368605bda172b
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/mcp_chat_bot/unit_tests/single_prompt.py"	
@@ -0,0 +1,62 @@
+import argparse
+import asyncio
+import os
+from pathlib import Path
+import sys
+from typing import Any, Dict, List
+
+root_dir = Path(__file__).parent.parent
+sys.path.insert(0, str(root_dir))
+
+
+from session import ChatSession
+from mcp_client import MCPClient 
+from configuration import Configuration
+
+def parse_servers_config(config: Dict[str, Any]) -> List[MCPClient]:
+    return [
+        MCPClient(name, srv_config) for name, srv_config in config["mcpServers"].items()
+    ]
+
+async def main(url: str, system_pkg_name: str, pkg_name: str) -> None:
+    """Initialize and run the chat session."""
+    config = Configuration()
+    # You can change the config file to the one you want to use
+    server_config = config.load_config(f"{root_dir}/mcp_servers/mcp_servers_config.json")
+    servers = parse_servers_config(server_config)
+    chat_session = ChatSession(servers)
+
+    try:
+        await chat_session.initialize()
+        PROMPT1 = f"Please analyse the github repository, and then find the summarize the readme of the github repository, the github repository's url is {url}"
+        PROMPT2 = f"Please install a system package named {system_pkg_name}"
+        PROMPT3 = f"Please install the python package named {pkg_name} and set timeout to 30 seconds, venv name is .main_venv"
+        response = await chat_session.send_message(
+            PROMPT3,
+            show_workflow=True,
+        )
+        print(response)
+    finally:
+        await chat_session.cleanup_clients()
+
+
+if __name__ == "__main__":
+    parser = argparse.ArgumentParser()
+    parser.add_argument(
+        "--url",
+        type=str,
+        default="https://github.com/numpy/numpy"
+    )
+    parser.add_argument(
+        "--system-pkg-name",
+        type=str,
+        default="cmake"
+    )
+    parser.add_argument(
+        "--pkg-name",
+        type=str,
+        default="numpy"
+    )
+    args = parser.parse_args()
+
+    asyncio.run(main(args.url, args.system_pkg_name, args.pkg_name))
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/dependency_analyst.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/dependency_analyst.py"
new file mode 100644
index 0000000000000000000000000000000000000000..8ed4a4f7c103f0acde035cac859f2d89f58483da
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/dependency_analyst.py"	
@@ -0,0 +1,234 @@
+import re
+import importlib.metadata
+import subprocess
+import sys
+import os
+from collections import defaultdict, deque
+from packaging.specifiers import SpecifierSet, InvalidSpecifier
+from typing import Dict, List, Set, Tuple, Optional
+from pathlib import Path
+
+root_dir = Path(__file__).parent.parent
+sys.path.insert(0, str(root_dir))
+from logger.logger import get_logger
+
+logger = get_logger("依赖分析器")
+
+_PKG_SPEC_RE = re.compile(r'^([a-zA-Z0-9_.+-]+)\s*(.*)$')
+
+class DependencyAnalyst:
+    @staticmethod
+    def _parse_dependency(raw: str) -> Tuple[str, str]:
+        """
+        把 'numpy>=1.19.0' 解析成 ('numpy', '>=1.19.0')
+        把 'pandas Any' 解析成 ('pandas', '')
+        如果没有版本号则 specs 为空字符串
+        """
+        # 预处理：去除首尾空格、换行符和可能的逗号
+        cleaned = raw.strip().lstrip(',').strip()
+        
+        # 处理包含换行符和注释的情况
+        if '\n' in cleaned:
+            # 分割成多行，过滤掉注释行和空行
+            lines = [line.strip() for line in cleaned.split('\n')]
+            non_comment_lines = [line for line in lines if line and not line.startswith('#')]
+            if non_comment_lines:
+                cleaned = non_comment_lines[0]  # 取第一个非注释行
+            else:
+                cleaned = ""
+        
+        # 再次检查是否为注释（以 # 开头）
+        if cleaned.startswith('#'):
+            return "", ""
+        
+        # 如果清理后为空字符串，跳过
+        if not cleaned:
+            return "", ""
+        
+        m = _PKG_SPEC_RE.match(cleaned)
+        if not m:
+            # 如果仍然匹配失败，可能是复杂的注释或无效格式，直接返回空
+            return "", ""
+        name, specs = m.groups()
+        return name, specs.strip()
+
+    @classmethod
+    def build_dependency_graph(cls, dependency_info: Dict[str, List[str]]) -> Dict[str, List[str]]:
+        """
+        根据 {package: {dependency_spec: ...}} 构建反向依赖树，
+        所有 dependency 均按“纯包名”入库，不带版本号。
+        """
+        dependency_graph = defaultdict(list)
+
+        for package, dep_specs in dependency_info.items():
+            print(f"处理包 {package} 的依赖: {dep_specs}")
+            # dep_specs 的 key 就是原始 'numpy>=1.19.0' 字符串
+            for raw_dep in dep_specs:
+                dep_name, _ = cls._parse_dependency(raw_dep)   # 只拿名字，丢弃版本
+                if not dep_name or dep_name == package:       # 跳过空名字和自依赖
+                    continue
+                dependency_graph[dep_name].append(package)
+
+        return dependency_graph
+
+    @staticmethod
+    def topological_sort(graph: Dict[str, List[str]]) -> List[str]:
+        in_degree = defaultdict(int)
+        for u in graph:
+            in_degree.setdefault(u, 0)          # 确保所有节点都在
+        for u, deps in graph.items():
+            for v in deps:
+                in_degree[v] += 1
+
+        # 只关心出现在 graph 的节点
+        queue = deque([n for n in graph if in_degree[n] == 0])
+        order = []
+        while queue:
+            node = queue.popleft()
+            order.append(node)
+            for neighbor in graph[node]:
+                in_degree[neighbor] -= 1
+                if in_degree[neighbor] == 0:
+                    queue.append(neighbor)
+
+        if len(order) != len(graph):
+            # 找到环
+            remain = set(graph) - set(order)
+            raise RuntimeError(f"发现循环依赖，涉及节点: {remain}")
+
+        return order
+
+    @staticmethod
+    def _get_package_version_from_venv(package_name: str, venv_path: Optional[str] = None) -> Optional[str]:
+        """
+        从指定虚拟环境中获取包的版本信息
+        如果 venv_path 为 None，则使用当前环境
+        """
+        if venv_path is None:
+            # 使用当前环境
+            try:
+                return importlib.metadata.version(package_name)
+            except importlib.metadata.PackageNotFoundError:
+                return None
+        
+        # 使用指定虚拟环境
+        python_executable = os.path.join(venv_path, 'bin', 'python')
+        if not os.path.exists(python_executable):
+            # 尝试Windows路径
+            python_executable = os.path.join(venv_path, 'Scripts', 'python.exe')
+            if not os.path.exists(python_executable):
+                raise ValueError(f"无法在虚拟环境 {venv_path} 中找到Python可执行文件")
+        
+        try:
+            # 在虚拟环境中执行Python代码获取包版本
+            cmd = [
+                python_executable, 
+                '-c', 
+                f'import importlib.metadata; print(importlib.metadata.version("{package_name}"))'
+            ]
+            result = subprocess.run(cmd, capture_output=True, text=True, timeout=10)
+            
+            if result.returncode == 0:
+                return result.stdout.strip()
+            else:
+                # 包未安装或其他错误
+                return None
+                
+        except (subprocess.TimeoutExpired, subprocess.SubprocessError, FileNotFoundError):
+            return None
+
+    @classmethod
+    def detect_potential_version_conflicts(cls, package_constraints: List[str], venv_path: Optional[str] = None) -> Dict[str, Set[Tuple[str, str]]]:
+        """
+        将依赖包的版本限制与指定虚拟环境中已安装的版本进行对比，找出潜在的版本冲突。
+        
+        Args:
+            package_constraints: 包约束列表，如 ['numpy>=1.19.0', 'pandas<1.2.0']
+            venv_path: 虚拟环境路径，如果为None则使用当前环境
+        
+        Returns:
+            {包名: {(要求规格, 实际版本/状态)}}
+        """
+        conflict_map: Dict[str, Set[Tuple[str, str]]] = defaultdict(set)
+
+        for raw_constraint in package_constraints:
+            pkg_name, specs = cls._parse_dependency(raw_constraint)  # 解析出 (包名, 版本限制)
+
+            # 跳过空的包名
+            if not pkg_name:
+                continue
+
+            # 1. 获取虚拟环境中的已安装版本
+            installed_ver = cls._get_package_version_from_venv(pkg_name, venv_path)
+            
+            if installed_ver is None:
+                # 包未安装，记为不冲突
+                continue
+
+            # 2. 无版本限制直接跳过
+            if not specs or specs.lower() == 'any':
+                continue
+
+            # 3. 用 SpecifierSet 做判断
+            try:
+                specifier = SpecifierSet(specs)
+            except InvalidSpecifier:
+                # 版本限制写法非法，跳过
+                logger.warning(f"包 {pkg_name} 的版本限制 {specs} 非法，已跳过")
+                continue
+
+            # 4. 真正的版本比较
+            if not specifier.contains(installed_ver, prereleases=True):
+                conflict_map[pkg_name].add((specs, installed_ver))
+
+        logger.info(f"版本冲突检测结果: {conflict_map}")
+        
+        return conflict_map
+
+
+if __name__ == "__main__":
+    dependency_analyst = DependencyAnalyst()
+    graph = dependency_analyst.build_dependency_graph({
+        'A': ['B>=1.1', 'C<1.0'],
+        'B': ['D~=2.0'],
+        'C': ['B Any'],
+        'D': [ ]
+    })
+    print("拓扑排序结果:", dependency_analyst.topological_sort(graph))
+    
+    # 测试当前环境的版本冲突检测
+    print("\n=== 当前环境版本冲突检测 ===")
+    conflicts = dependency_analyst.detect_potential_version_conflicts([
+        'numpy>=1.19.0',
+        'pandas<1.2.0',
+        'scipy Any'
+    ])
+    if not conflicts:
+        print("未检测到版本冲突")
+    else:
+        print("检测到版本冲突:")
+        for pkg, issues in conflicts.items():
+            for spec, actual in issues:
+                print(f"  包 {pkg} 版本要求 {spec}，但实际版本为 {actual}")
+    
+    # 测试指定虚拟环境的版本冲突检测
+    print("\n=== 指定虚拟环境版本冲突检测 ===")
+    # 示例：检测pytorch_env虚拟环境
+    venv_path = "/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/.main_venv"
+    if os.path.exists(venv_path):
+        print(f"检测虚拟环境: {venv_path}")
+        venv_conflicts = dependency_analyst.detect_potential_version_conflicts([
+            'torch>=1.8.0',
+            'numpy>=1.19.0',
+            'pillow<9.0.0'
+        ], venv_path=venv_path)
+        
+        if not venv_conflicts:
+            print("虚拟环境中未检测到版本冲突")
+        else:
+            print("虚拟环境中检测到版本冲突:")
+            for pkg, issues in venv_conflicts.items():
+                for spec, actual in issues:
+                    print(f"  包 {pkg} 版本要求 {spec}，但实际版本为 {actual}")
+    else:
+        print(f"虚拟环境 {venv_path} 不存在，跳过检测")
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/environment_resolver.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/environment_resolver.py"
new file mode 100644
index 0000000000000000000000000000000000000000..28f42de8a3e0a7e2a92c23282c67c0dd4b76193f
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/environment_resolver.py"	
@@ -0,0 +1,372 @@
+import argparse
+import asyncio
+import subprocess
+import sys
+import os
+import site
+import glob
+import shutil
+import re
+from pathlib import Path
+from typing import Set
+
+root_dir = Path(__file__).parent.parent
+sys.path.insert(0, str(root_dir))
+
+from logger.logger import get_logger
+from ai_agent.llm_api import call_llm_api
+
+class EnvironmentResolver:
+    def __init__(self, use_llm: bool = False):
+        self.use_llm = use_llm
+        self.logger = get_logger("环境分析模块")
+        self.system_package_managers = {
+            'dnf': ['dnf', 'install', '-y'],
+            'yum': ['yum', 'install', '-y'],
+            'apt': ['apt-get', 'install', '-y'],
+            'pacman': ['pacman', '-S', '--noconfirm'],
+        }
+        self.active_pm = None
+
+    async def detect_system_package_manager(self) -> str:
+        # 检测包管理器        
+        for pm, _ in self.system_package_managers.items():
+            if shutil.which(pm):
+                self.logger.info(f"检测到包管理器: {pm}")
+                self.active_pm = pm
+                return pm
+        
+        self.logger.warning("未检测到支持的包管理器")
+        return None
+
+    async def detect_import_names(self, venv_dir: str, package_name: str) -> Set[str]:
+        import_names = set()
+        # 构建虚拟环境的 site-packages 路径
+        venv_path = Path(venv_dir)
+        
+        # 动态检测 Python 版本
+        python_version = f"python{sys.version_info.major}.{sys.version_info.minor}"
+        site_packages = [
+            str(venv_path / "lib" / python_version / "site-packages"),
+            str(venv_path / "lib64" / python_version / "site-packages")
+        ]
+        
+        # 生成包名的不同变体
+        package_variants = {package_name}
+        
+        # 如果包名包含 - 或 _，添加另一种版本
+        if '-' in package_name:
+            package_variants.add(package_name.replace('-', '_'))
+        if '_' in package_name:
+            package_variants.add(package_name.replace('_', '-'))
+        
+        # 从 dist-info 目录中查找实际的 import 名
+        for sp in site_packages:
+            if not os.path.exists(sp):
+                continue
+                
+            # 为每个包名变体查找匹配的 dist-info 目录
+            search_patterns = []
+            for variant in package_variants:
+                search_patterns.extend([
+                    f"{variant}-*.dist-info",
+                    f"{variant.replace('-', '_')}-*.dist-info"
+                ])
+
+            for pattern in search_patterns:
+                dist_info_pattern = os.path.join(sp, pattern)
+                dist_info_dirs = glob.glob(dist_info_pattern)
+                for dist_info in dist_info_dirs:
+                    self.logger.info(f"Found dist-info: {dist_info}")
+                    # 读取 top_level.txt 文件获取实际的模块名
+                    top_level_path = os.path.join(dist_info, "top_level.txt")
+                    if os.path.exists(top_level_path):
+                        self.logger.info(f"Reading top_level.txt: {top_level_path}")
+                        with open(top_level_path, "r", encoding="utf-8", errors="ignore") as f:
+                            for line in f:
+                                name = line.strip()
+                                if name:
+                                    import_names.add(name)
+                                    self.logger.info(f"Found import name: {name}")
+        
+        return import_names
+
+    async def install_system_package(self, package_name: str) -> dict:
+        """
+        使用系统包管理器安装指定的包
+        """
+        result = {
+            'stdout': '',
+            'stderr': '',
+            'returncode': 0
+        }
+        await self.detect_system_package_manager()
+        if not self.active_pm:
+            self.logger.error("No active package manager to install dependencies")
+            result['stderr'] = "No active package manager found"
+            result['returncode'] = -1
+            return result
+            
+        cmd = self.system_package_managers[self.active_pm] + [package_name]
+        output = subprocess.run(cmd, capture_output=True)
+        result['stdout'] = output.stdout.decode('utf-8', errors='ignore')
+        result['stderr'] = output.stderr.decode('utf-8', errors='ignore')
+        result['returncode'] = output.returncode
+        return result
+
+    async def find_dynamic_libs(self, venv_dir: str, package_name: str) -> Set[str]:
+        """
+        从 PyPI 元数据中获取实际 import 包名，然后查找对应的 .so 文件
+        """
+        import_names = await self.detect_import_names(venv_dir, package_name)
+        
+        if not import_names and self.use_llm:
+            # 如果没找到，且可以使用llm分析，则使用llm分析获取包名
+            self.logger.info(f"No top_level.txt found, using LLM to analyze package: {package_name}")
+            prompt = f"""
+Given the Python package name '{package_name}', list the top-level importable module names provided by this package. 
+Important:Provide only the module names without any additional text.
+"""
+            response = call_llm_api([{"role": "user", "content": prompt}])
+            if response:
+                for line in response.splitlines():
+                    import_names.add(line.strip())
+                    self.logger.info(f"LLM found import name: {line.strip()}")
+            else:
+                import_names.add(package_name)
+        elif not import_names:
+            # 如果没找到，且不使用llm分析，则使用包名作为import包名
+            import_names.add(package_name)
+            self.logger.info(f"No top_level.txt found, using package name: {package_name}")
+
+        self.logger.info(f"Will search for .so files in modules: {import_names}")
+
+        # 构建虚拟环境的 site-packages 路径
+        venv_path = Path(venv_dir)
+        
+        # 动态检测 Python 版本
+        python_version = f"python{sys.version_info.major}.{sys.version_info.minor}"
+        site_packages = [
+            str(venv_path / "lib" / python_version / "site-packages"),
+            str(venv_path / "lib64" / python_version / "site-packages")
+        ]
+        
+        # 在所有 site-packages 下查找 .so 文件
+        so_files = []
+        for sp in site_packages:
+            if not os.path.exists(sp):
+                continue
+                
+            for import_name in import_names:
+                pkg_path = os.path.join(sp, import_name)
+                if os.path.isdir(pkg_path):
+                    print(f"Searching in directory: {pkg_path}")
+                    found_so = glob.glob(os.path.join(pkg_path, "**", "*.so"), recursive=True)
+                    so_files.extend(found_so)
+                    print(f"Found {len(found_so)} .so files in {pkg_path}")
+        
+        if not so_files:
+            print(f"No .so files found for import names: {import_names}")
+        else:
+            print(f"Total found {len(so_files)} .so files")
+            
+        return so_files
+
+    async def analyze_lib_with_ldd(self, lib_path: str) -> str:
+        self.logger.info(f"Analyzing {lib_path} with ldd...")
+        result = subprocess.run(["ldd", lib_path], capture_output=True, text=True)
+        if result.returncode != 0:
+            self.logger.error(result.stderr)
+            return None
+        
+        ldd_output = result.stdout
+        return ldd_output
+
+    async def auto_install_missing_dependencies(self, ldd_output: str) -> bool:
+        """
+        根据 ldd 输出自动安装缺失的依赖
+        成功返回True，失败返回False
+        """
+        if "not found" not in ldd_output:
+            self.logger.info("No missing dependencies found")
+            return True
+
+        self.logger.info("Auto-installing missing dependencies...")
+
+        # 定义缺失库到系统包的映射关系
+        dependency_mapping = {
+            # OpenGL 相关
+            'libGL.so.1': ['mesa-libGL', 'libgl1-mesa-dev'],
+            'libGLU.so.1': ['mesa-libGLU', 'libglu1-mesa-dev'],
+            'libglut.so.3': ['freeglut-devel', 'freeglut3-dev'],
+            
+            # OpenCL 相关
+            'libOpenCL.so.1': ['ocl-icd', 'opencl-icd-libopencl1'],
+            'libOpenCL.so': ['ocl-icd-devel', 'opencl-headers'],
+            
+            # CUDA 相关
+            'libcuda.so.1': ['nvidia-driver-cuda', 'nvidia-cuda-dev'],
+            'libcudart.so': ['cuda-toolkit', 'nvidia-cuda-toolkit'],
+            'libcublas.so': ['cuda-toolkit', 'nvidia-cuda-toolkit'],
+            
+            # X11 相关
+            'libX11.so.6': ['libX11-devel', 'libx11-dev'],
+            'libXext.so.6': ['libXext-devel', 'libxext-dev'],
+            'libXrender.so.1': ['libXrender-devel', 'libxrender-dev'],
+            
+            # 音频相关
+            'libasound.so.2': ['alsa-lib-devel', 'libasound2-dev'],
+            'libpulse.so.0': ['pulseaudio-libs-devel', 'libpulse-dev'],
+            
+            # 图像处理相关
+            'libjpeg.so': ['libjpeg-turbo-devel', 'libjpeg-dev'],
+            'libpng16.so.16': ['libpng-devel', 'libpng-dev'],
+            'libtiff.so': ['libtiff-devel', 'libtiff-dev'],
+            
+            # 压缩相关
+            'libz.so.1': ['zlib-devel', 'zlib1g-dev'],
+            'libbz2.so.1': ['bzip2-devel', 'libbz2-dev'],
+            
+            # 系统库
+            'libdl.so.2': ['glibc-devel', 'libc6-dev'],
+            'libm.so.6': ['glibc-devel', 'libc6-dev'],
+            'libpthread.so.0': ['glibc-devel', 'libc6-dev'],
+            
+            # 其他常见库
+            'libssl.so': ['openssl-devel', 'libssl-dev'],
+            'libcrypto.so': ['openssl-devel', 'libssl-dev'],
+            'libffi.so': ['libffi-devel', 'libffi-dev'],
+        }
+        
+        
+        # 解析缺失的库
+        missing_libs = []
+        for line in ldd_output.split('\n'):
+            if 'not found' in line:
+                # 提取库名，例如：libGL.so.1 => not found
+                parts = line.strip().split(' => ')
+                if len(parts) >= 1:
+                    lib_name = parts[0].strip()
+                    missing_libs.append(lib_name)
+        
+        if not missing_libs:
+            return True
+
+        self.logger.info(f"Found missing libraries: {missing_libs}")
+
+        await self.detect_system_package_manager()
+        if not self.active_pm:
+            self.logger.error("No active package manager to install dependencies")
+            return False
+
+        # 安装缺失的依赖
+        packages_to_install = set()
+        for lib in missing_libs:
+            if lib in dependency_mapping:
+                # 根据包管理器选择合适的包名
+                pkg_options = dependency_mapping[lib]
+                if self.active_pm in ['dnf', 'yum']:
+                    # RedHat 系列，使用第一个选项
+                    packages_to_install.add(pkg_options[0])
+                elif self.active_pm == 'apt':
+                    # Debian 系列，使用第二个选项（如果存在）
+                    packages_to_install.add(pkg_options[1] if len(pkg_options) > 1 else pkg_options[0])
+                else:
+                    packages_to_install.add(pkg_options[0])
+                self.logger.info(f"{lib} -> {pkg_options[0]}")
+            elif self.use_llm:
+                # 如果映射中没有，且可以使用llm分析，则使用llm分析获取包名
+                prompt = f"""
+The following shared library is missing on a Linux system: '{lib}'. Please provide the name of the system package that typically provides this library for installation via {self.active_pm}. If unsure, respond with 'Unknown'.
+Important: Only respond with the package name, nothing else.
+"""
+                package_to_install = call_llm_api([{"role": "user", "content": prompt}])
+                if package_to_install != 'Unknown':
+                    self.logger.info(f'LLM suggested package: {package_to_install.strip()}')
+                    packages_to_install.add(package_to_install.strip())
+                else:
+                    self.logger.error('LLM did not suggest a package')
+                    return False
+            else:
+                self.logger.warning(f"{lib} -> Unknown mapping, trying to search...")
+                # 尝试智能搜索
+                search_result = subprocess.run([self.active_pm, 'search', lib], capture_output=True, text=True)
+                if search_result.returncode == 0:
+                    self.logger.info(f"Search results available, manual intervention may be needed")
+                    return False
+
+        if packages_to_install != 'Unknown':
+            self.logger.info(f"Installing packages: {list(packages_to_install)}")
+            
+            for package in packages_to_install:
+                cmd = self.system_package_managers[self.active_pm] + [package]
+                self.logger.info(f"Running: {' '.join(cmd)}")
+
+                result = await self.install_system_package(package)
+                if result['returncode'] == 0:
+                    self.logger.info(f"Successfully installed: {package}")
+                else:
+                    self.logger.error(f"Failed to install {package}: {result['stderr']}")
+
+            self.logger.info("Re-checking dependencies after installation...")
+            return True
+        else:
+            self.logger.warning("No automatic installation available for missing libraries")
+            return False
+
+    async def pre_resolve_environment(self, venv_dir: str, package_name: str) -> bool:
+        """
+        安装完包后查看site-package以判断这个包所依赖的动态库
+        在缺少依赖的情况下自动安装相关依赖
+        返回是否解决完相关依赖
+        """
+        so_files = await self.find_dynamic_libs(venv_dir, package_name)
+        missing_found = False
+        for so in so_files:
+            ldd_output = await self.analyze_lib_with_ldd(so)
+            if ldd_output:
+                self.logger.info(f"ldd output for {so}:")
+                self.logger.info(ldd_output)
+                
+                # 检查是否有未找到的依赖
+                if "not found" in ldd_output:
+                    self.logger.warning(f"Warning: {so} has missing dependencies!")
+                    missing_found = True
+                    # 自动安装缺失的依赖
+                    if await self.auto_install_missing_dependencies(ldd_output) == False:
+                        self.logger.error(f"Failed to install missing dependencies for {so}")
+                        return False
+                    self.logger.info(f"Successfully handled missing dependencies for {so}")
+        # 如果安装了新的依赖，重新检查
+        if missing_found:
+            self.logger.info("Re-analyzing libraries after dependency installation...")
+            for so in so_files:
+                self.logger.info(f"Final check for {so}:")
+                final_ldd_output = await self.analyze_lib_with_ldd(so)
+                if final_ldd_output and "not found" in final_ldd_output:
+                    self.logger.error("Some dependencies are still missing")
+                    return False
+                else:
+                    self.logger.info("All dependencies resolved")
+        else:
+            self.logger.info("No missing dependencies detected")
+        return True
+
+async def main():
+    parser = argparse.ArgumentParser(description="Analyze installed Python package for dynamic library dependencies and GPU support.")
+    parser.add_argument("--package", required=True, help="The name of the Python package to analyze.")
+    args = parser.parse_args()
+    env_resolver = EnvironmentResolver(use_llm=True)
+    pm = await env_resolver.detect_system_package_manager()
+    print('system package manager: ',  pm)
+    import sys
+    from pathlib import Path
+    root_dir = Path(__file__).parent.parent
+    venv_dir = f'{root_dir}/.main_venv'
+    package_name = args.package
+    success = await env_resolver.pre_resolve_environment(venv_dir, package_name)
+    print(success)
+    # env_resolver.detect_import_names(venv_dir, package_name)
+
+if __name__ == "__main__":
+    asyncio.run(main())
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/package_info.json" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/package_info.json"
new file mode 100644
index 0000000000000000000000000000000000000000..f3774db03a008261632b9679afe481026e6a2c7e
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/package_info.json"	
@@ -0,0 +1,1250 @@
+{
+    "tensorflow": {
+        "dependency": [
+            "numpy",
+            "six",
+            "protobuf"
+        ],
+        "import_test_code": "import tensorflow as tf",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import tensorflow as tf; result = tf.constant([1, 2, 3]).numpy().sum(); print(result)",
+        "function_test_expected_result": "^6$",
+        "gpu_test_code": "import sys; import tensorflow as tf; result = tf.test.is_built_with_cuda(); print(result)",
+        "gpu_test_expected_result": "^(True|False)$",
+        "exists": "True"
+    },
+    "numpy": {
+        "dependency": [],
+        "import_test_code": "import numpy as np",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import numpy as np; result = np.array([1, 2, 3]).sum(); print(result)",
+        "function_test_expected_result": "^6$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "six": {
+        "dependency": [],
+        "import_test_code": "import six",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import six; result = list(six.moves.range(3)); print(result)",
+        "function_test_expected_result": "^\\[0, 1, 2\\]$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "protobuf": {
+        "dependency": [],
+        "import_test_code": "from google import protobuf",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; from google import protobuf; result = protobuf.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "torch": {
+        "dependency": [
+            "filelock",
+            "typing-extensions",
+            "setuptools",
+            "sympy",
+            "networkx",
+            "jinja2",
+            "fsspec",
+            "nvidia-cuda-nvrtc-cu12",
+            "nvidia-cuda-runtime-cu12",
+            "nvidia-cuda-cupti-cu12",
+            "nvidia-cudnn-cu12",
+            "nvidia-cublas-cu12",
+            "nvidia-cufft-cu12",
+            "nvidia-curand-cu12",
+            "nvidia-cusolver-cu12",
+            "nvidia-cusparse-cu12",
+            "nvidia-cusparselt-cu12",
+            "nvidia-nccl-cu12",
+            "nvidia-nvtx-cu12",
+            "nvidia-nvjitlink-cu12",
+            "nvidia-cufile-cu12",
+            "triton",
+            "--requirement",
+            "build",
+            "expecttest",
+            "hypothesis",
+            "lintrunner",
+            "optree",
+            "psutil",
+            "wheel"
+        ],
+        "import_test_code": "import torch",
+        "import_test_expected_result": "",
+        "function_test_code": "print(torch.__version__)",
+        "function_test_expected_result": "\\d+(\\.\\d+)*",
+        "gpu_test_code": "import torch; print(torch.cuda.is_available())",
+        "gpu_test_expected_result": "True",
+        "verified": "False",
+        "exists": "True"
+    },
+    "typing-extensions": {
+        "dependency": [],
+        "import_test_code": "import typing_extensions",
+        "import_test_expected_result": "",
+        "function_test_code": "import typing_extensions as te; print('TYPING_EXT_OK' if hasattr(te,'TypedDict') or hasattr(te,'Protocol') else 'TYPING_EXT_FAIL')",
+        "function_test_expected_result": "^TYPING_EXT_OK\\s*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "sympy": {
+        "dependency": [
+            "mpmath"
+        ],
+        "import_test_code": "import sympy",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import sympy; result = sympy.var('x').subs('x', 1); print(result)",
+        "function_test_expected_result": "^1$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "networkx": {
+        "dependency": [],
+        "import_test_code": "import networkx as nx",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import networkx as nx; result = len(nx.complete_graph(3).nodes); print(result)",
+        "function_test_expected_result": "^3$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "mpmath": {
+        "dependency": [],
+        "import_test_code": "import mpmath",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import mpmath; result = float(mpmath.pi); print(result)",
+        "function_test_expected_result": "^3\\.141592653589793$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "transformers": {
+        "dependency": [
+            "filelock Any",
+            "huggingface-hub <1.0,>=0.34.0",
+            "numpy >=1.17",
+            "packaging >=20.0",
+            "pyyaml >=5.1",
+            "regex !=2019.12.17",
+            "requests Any",
+            "tokenizers <=0.23.0,>=0.22.0",
+            "safetensors >=0.4.3",
+            "tqdm >=4.27"
+        ],
+        "import_test_code": "import transformers",
+        "import_test_expected_result": "",
+        "function_test_code": "import transformers; from transformers import pipeline; classifier = pipeline('sentiment-analysis'); result = classifier('I love this package!'); print(result[0]['label'])",
+        "function_test_expected_result": "^(POSITIVE|NEGATIVE)$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "verified": "False",
+        "exists": true
+    },
+    "huggingface-hub": {
+        "dependency": [
+            "requests",
+            "tqdm",
+            "pyyaml",
+            "packaging",
+            "typing-extensions"
+        ],
+        "import_test_code": "import huggingface_hub as hh",
+        "import_test_expected_result": "",
+        "function_test_code": "import requests,traceback,os; os.environ['HF_ENDPOINT']='https://hf-mirror.com'; import huggingface_hub as hh; it=hh.list_models(limit=1); m=next(it, None); print('sample model id:', getattr(m,'modelId', getattr(m,'id', None)))",
+        "function_test_expected_result": "^sample model id: (?:[A-Za-z0-9_.\\-]+(?:/[A-Za-z0-9_.\\-]+)?|None)\\s*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "tokenizers": {
+        "dependency": [],
+        "import_test_code": "import tokenizers",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import tokenizers; result = tokenizers.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "tqdm": {
+        "dependency": [],
+        "import_test_code": "import tqdm",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import tqdm; result = list(tqdm.tqdm([1, 2, 3])); print(result)",
+        "function_test_expected_result": "^\\[1, 2, 3\\]$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "pyyaml": {
+        "dependency": [],
+        "import_test_code": "import yaml",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import yaml; result = yaml.safe_load('a: 1')['a']; print(result)",
+        "function_test_expected_result": "^1$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "requests": {
+        "dependency": [
+            "urllib3",
+            "charset-normalizer",
+            "certifi",
+            "idna"
+        ],
+        "import_test_code": "import requests",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import requests; result = requests.get('https://httpbin.org/status/200').status_code; print(result)",
+        "function_test_expected_result": "^200$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "urllib3": {
+        "dependency": [],
+        "import_test_code": "import urllib3",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import urllib3; result = urllib3.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "charset-normalizer": {
+        "dependency": [],
+        "import_test_code": "import charset_normalizer",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import charset_normalizer; result = charset_normalizer.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "certifi": {
+        "dependency": [],
+        "import_test_code": "import certifi",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import certifi; result = certifi.where(); print(result)",
+        "function_test_expected_result": "^.+$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "idna": {
+        "dependency": [],
+        "import_test_code": "import idna",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import idna; result = idna.encode('müller.de'); print(result)",
+        "function_test_expected_result": "^b'xn--mller-kva.de'$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "packaging": {
+        "dependency": [],
+        "import_test_code": "import packaging.version",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import packaging.version; result = str(packaging.version.parse('1.0.0')); print(result)",
+        "function_test_expected_result": "^1\\.0\\.0$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "openai": {
+        "dependency": [
+            "requests",
+            "tqdm",
+            "typing-extensions"
+        ],
+        "import_test_code": "import openai",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import openai; result = openai.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "jax": {
+        "dependency": [
+            "numpy",
+            "opt-einsum",
+            "scipy"
+        ],
+        "import_test_code": "import jax.numpy as jnp",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import jax.numpy as jnp; result = jnp.array([1, 2, 3]).sum().item(); print(result)",
+        "function_test_expected_result": "^6$",
+        "gpu_test_code": "import sys; import jax; result = jax.default_backend(); print(result)",
+        "gpu_test_expected_result": "^(gpu|cpu|tpu)$",
+        "exists": "True"
+    },
+    "opt-einsum": {
+        "dependency": [
+            "numpy"
+        ],
+        "import_test_code": "import opt_einsum",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import opt_einsum; result = opt_einsum.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "scipy": {
+        "dependency": [
+            "numpy"
+        ],
+        "import_test_code": "import scipy",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import scipy; result = scipy.constants.pi; print(result)",
+        "function_test_expected_result": "^3\\.141592653589793$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "flax": {
+        "dependency": [
+            "numpy",
+            "jax",
+            "msgpack",
+            "optax",
+            "orbax-checkpoint"
+        ],
+        "import_test_code": "import flax",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import flax; result = flax.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "msgpack": {
+        "dependency": [],
+        "import_test_code": "import msgpack",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import msgpack; result = msgpack.packb([1, 2, 3]); print(result)",
+        "function_test_expected_result": "^b'\\\\x93\\\\x01\\\\x02\\\\x03'$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "optax": {
+        "dependency": [
+            "jax",
+            "numpy",
+            "chex"
+        ],
+        "import_test_code": "import optax",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import optax; result = optax.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "chex": {
+        "dependency": [
+            "jax",
+            "numpy",
+            "toolz",
+            "dm-tree"
+        ],
+        "import_test_code": "import chex",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import chex; result = chex.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "toolz": {
+        "dependency": [],
+        "import_test_code": "import toolz",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import toolz; result = list(toolz.map(lambda x: x*2, [1, 2, 3])); print(result)",
+        "function_test_expected_result": "^\\[2, 4, 6\\]$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "dm-tree": {
+        "dependency": [],
+        "import_test_code": "import tree",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import tree; result = tree.flatten({'a': [1, 2]}); print(result)",
+        "function_test_expected_result": "^\\[1, 2\\]$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "orbax-checkpoint": {
+        "dependency": [
+            "jax",
+            "numpy",
+            "msgpack",
+            "tensorstore"
+        ],
+        "import_test_code": "import orbax.checkpoint",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import orbax.checkpoint; result = orbax.checkpoint.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "tensorstore": {
+        "dependency": [
+            "numpy"
+        ],
+        "import_test_code": "import tensorstore",
+        "import_test_expected_result": "",
+        "function_test_code": "import tensorstore as ts; import json; import numpy as np; data = np.array([[1, 2], [3, 4]]); t = ts.array(data); print('TENSORSTORE_OK' if np.array_equal(t.read().result(), data) else 'TENSORSTORE_FAIL')",
+        "function_test_expected_result": "^TENSORSTORE_OK\\s*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "paddlepaddle": {
+        "dependency": [
+            "numpy",
+            "protobuf",
+            "pillow"
+        ],
+        "import_test_code": "import paddle",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import paddle; result = paddle.to_tensor([1, 2, 3]).sum().item(); print(result)",
+        "function_test_expected_result": "^6$",
+        "gpu_test_code": "import sys; import paddle; result = paddle.is_compiled_with_cuda(); print(result)",
+        "gpu_test_expected_result": "^(True|False)$",
+        "exists": "True"
+    },
+    "pillow": {
+        "dependency": [],
+        "import_test_code": "from PIL import Image",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; from PIL import Image; result = Image.new('RGB', (1, 1)).size; print(result)",
+        "function_test_expected_result": "^\\(1, 1\\)$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "opencv-python": {
+        "dependency": [
+            "numpy"
+        ],
+        "import_test_code": "import cv2",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import cv2; result = cv2.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "scikit-learn": {
+        "dependency": [
+            "numpy",
+            "scipy",
+            "joblib",
+            "threadpoolctl"
+        ],
+        "import_test_code": "import sklearn",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import sklearn; from sklearn.datasets import load_iris; result = len(load_iris().data); print(result)",
+        "function_test_expected_result": "^150$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "joblib": {
+        "dependency": [],
+        "import_test_code": "import joblib",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import joblib; result = joblib.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "threadpoolctl": {
+        "dependency": [],
+        "import_test_code": "import threadpoolctl",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import threadpoolctl; result = threadpoolctl.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "pandas": {
+        "dependency": [
+            "numpy",
+            "python-dateutil",
+            "pytz"
+        ],
+        "import_test_code": "import pandas as pd",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import pandas as pd; result = pd.Series([1, 2, 3]).sum(); print(result)",
+        "function_test_expected_result": "^6$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "python-dateutil": {
+        "dependency": [
+            "six"
+        ],
+        "import_test_code": "import dateutil",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import dateutil; result = dateutil.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "pytz": {
+        "dependency": [],
+        "import_test_code": "import pytz",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import pytz; result = str(pytz.timezone('UTC')); print(result)",
+        "function_test_expected_result": "^UTC$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "matplotlib": {
+        "dependency": [
+            "numpy",
+            "pillow",
+            "pyparsing",
+            "python-dateutil",
+            "kiwisolver"
+        ],
+        "import_test_code": "import matplotlib.pyplot as plt",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import matplotlib.pyplot as plt; result = plt.figure().number; print(result)",
+        "function_test_expected_result": "^\\d+$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "pyparsing": {
+        "dependency": [],
+        "import_test_code": "import pyparsing",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import pyparsing; result = pyparsing.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "kiwisolver": {
+        "dependency": [],
+        "import_test_code": "import kiwisolver",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import kiwisolver; result = kiwisolver.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "seaborn": {
+        "dependency": [
+            "numpy",
+            "pandas",
+            "matplotlib"
+        ],
+        "import_test_code": "import seaborn",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import seaborn; result = seaborn.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "plotly": {
+        "dependency": [
+            "tenacity"
+        ],
+        "import_test_code": "import plotly",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import plotly; result = plotly.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "tenacity": {
+        "dependency": [],
+        "import_test_code": "import tenacity",
+        "import_test_expected_result": "",
+        "function_test_code": "import tenacity, sys\nattempts = {'c': 0}\n\n@tenacity.retry(stop=tenacity.stop_after_attempt(3), wait=tenacity.wait_fixed(0.01))\ndef flaky():\n    attempts['c'] += 1\n    if attempts['c'] < 2:\n        raise RuntimeError('transient')\n    return 'ok'\n\ntry:\n    res = flaky()\n    print('TENACITY_OK' if res == 'ok' and attempts['c'] >= 2 else 'TENACITY_FAIL')\nexcept Exception as e:\n    print('TENACITY_ERROR', e)\n    sys.exit(2)",
+        "function_test_expected_result": "^TENACITY_OK\\s*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "streamlit": {
+        "dependency": [
+            "numpy",
+            "pandas",
+            "pillow",
+            "pyarrow",
+            "click",
+            "blinker",
+            "gitpython",
+            "packaging",
+            "rich",
+            "tenacity",
+            "toml",
+            "typing-extensions",
+            "watchdog"
+        ],
+        "import_test_code": "import streamlit",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import streamlit; result = streamlit.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "pyarrow": {
+        "dependency": [
+            "numpy"
+        ],
+        "import_test_code": "import pyarrow",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import pyarrow; result = pyarrow.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "click": {
+        "dependency": [],
+        "import_test_code": "import click",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import click; result = click.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "blinker": {
+        "dependency": [],
+        "import_test_code": "import blinker",
+        "import_test_expected_result": "",
+        "function_test_code": "from blinker import signal; s=signal('test'); received=[]; handler=(lambda sender, **kw: (received.append(kw.get('data')), True)[1]); s.connect(handler); s.send('sender', data='test_data'); print('BLINKER_OK' if received==['test_data'] else f'BLINKER_FAIL:{received}')",
+        "function_test_expected_result": "^BLINKER_OK\\s*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "gitpython": {
+        "dependency": [
+            "gitdb"
+        ],
+        "import_test_code": "import git",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import git; result = git.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "gitdb": {
+        "dependency": [
+            "smmap"
+        ],
+        "import_test_code": "import gitdb",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import gitdb; result = gitdb.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "smmap": {
+        "dependency": [],
+        "import_test_code": "import smmap",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import smmap; result = smmap.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "rich": {
+        "dependency": [
+            "markdown-it-py",
+            "pygments"
+        ],
+        "import_test_code": "import rich",
+        "import_test_expected_result": "",
+        "function_test_code": "from rich.console import Console; Console().print('OK', style='green')",
+        "function_test_expected_result": "OK",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "markdown-it-py": {
+        "dependency": [
+            "mdurl"
+        ],
+        "import_test_code": "import markdown_it",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import markdown_it; result = markdown_it.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "mdurl": {
+        "dependency": [],
+        "import_test_code": "import mdurl",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import mdurl; result = mdurl.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "pygments": {
+        "dependency": [],
+        "import_test_code": "import pygments",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import pygments; result = pygments.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "toml": {
+        "dependency": [],
+        "import_test_code": "import toml",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import toml; result = toml.loads('[a]\\nb = 1')['a']['b']; print(result)",
+        "function_test_expected_result": "^1$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "watchdog": {
+        "dependency": [],
+        "import_test_code": "import watchdog",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import watchdog; result = watchdog.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "xgboost": {
+        "dependency": [
+            "numpy",
+            "scipy"
+        ],
+        "import_test_code": "import xgboost",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import xgboost; result = xgboost.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "import sys; import xgboost; result = xgboost.build_info()['USE_CUDA']; print(result)",
+        "gpu_test_expected_result": "^(0|1|True|False)$",
+        "exists": "True"
+    },
+    "lightgbm": {
+        "dependency": [
+            "numpy",
+            "scipy"
+        ],
+        "import_test_code": "import lightgbm",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import lightgbm; result = lightgbm.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "import traceback; import numpy as np; import lightgbm as lgb; X=np.random.rand(64,8).astype('float32'); y=np.random.randint(0,2,64); d=lgb.Dataset(X,y); params={'objective':'binary','metric':'binary_logloss','device':'gpu'}; try:\n lgb.train(params,d,num_boost_round=1,verbose_eval=False); print('LIGHTGBM_GPU_OK'); except Exception as e:\n print('LIGHTGBM_GPU_ERROR:',type(e).__name__,e); traceback.print_exc()",
+        "gpu_test_expected_result": "^LIGHTGBM_GPU_OK\\s*$",
+        "exists": "True"
+    },
+    "catboost": {
+        "dependency": [
+            "numpy",
+            "pandas",
+            "scipy",
+            "matplotlib",
+            "plotly"
+        ],
+        "import_test_code": "import catboost",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import catboost; result = catboost.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "import numpy as np; from catboost import CatBoostClassifier, Pool; X = np.random.rand(32,8).astype(np.float32); y = np.random.randint(0,2, size=(32,)); model = CatBoostClassifier(iterations=1, depth=1, learning_rate=0.1, task_type='GPU', devices='0'); model.fit(Pool(X,y), verbose=False); print('CATBOOST_GPU_OK')",
+        "gpu_test_expected_result": "^CATBOOST_GPU_OK\\s*$",
+        "exists": "True"
+    },
+    "dask": {
+        "dependency": [
+            "numpy",
+            "pandas",
+            "cloudpickle",
+            "fsspec",
+            "toolz",
+            "partd"
+        ],
+        "import_test_code": "import dask",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import dask; result = dask.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "cloudpickle": {
+        "dependency": [],
+        "import_test_code": "import cloudpickle",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import cloudpickle; result = cloudpickle.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "fsspec": {
+        "dependency": [],
+        "import_test_code": "import fsspec",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import fsspec; result = fsspec.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "partd": {
+        "dependency": [
+            "numpy"
+        ],
+        "import_test_code": "import partd",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import partd; result = partd.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "ray": {
+        "dependency": [
+            "numpy",
+            "filelock",
+            "jsonschema",
+            "msgpack",
+            "pyyaml",
+            "aiohttp",
+            "grpcio",
+            "packaging"
+        ],
+        "import_test_code": "import ray",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import ray; ray.init(); result = ray.__version__; ray.shutdown(); print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "import sys; import ray; ray.init(ignore_reinit_error=True); result = ray.cluster_resources().get('GPU', 0); print(result)",
+        "gpu_test_expected_result": "^\\d+$",
+        "exists": "True"
+    },
+    "filelock": {
+        "dependency": [],
+        "import_test_code": "import filelock",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import filelock; result = filelock.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "jsonschema": {
+        "dependency": [
+            "attrs",
+            "pyrsistent"
+        ],
+        "import_test_code": "import jsonschema",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import jsonschema; result = jsonschema.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "attrs": {
+        "dependency": [],
+        "import_test_code": "import attrs",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import attrs; result = attrs.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "pyrsistent": {
+        "dependency": [],
+        "import_test_code": "import pyrsistent",
+        "import_test_expected_result": "",
+        "function_test_code": "import pyrsistent as p; pv=p.pvector([1,2]); pv2=pv.append(3); print('PERSISTENT_OK' if pv==p.pvector([1,2]) and pv2==p.pvector([1,2,3]) else 'PERSISTENT_FAIL')",
+        "function_test_expected_result": "^PERSISTENT_OK\\s*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "aiohttp": {
+        "dependency": [
+            "async-timeout",
+            "attrs",
+            "charset-normalizer",
+            "multidict",
+            "yarl"
+        ],
+        "import_test_code": "import aiohttp",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import aiohttp; result = aiohttp.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "async-timeout": {
+        "dependency": [],
+        "import_test_code": "import async_timeout",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import async_timeout; result = async_timeout.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "multidict": {
+        "dependency": [],
+        "import_test_code": "import multidict",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import multidict; result = multidict.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "yarl": {
+        "dependency": [
+            "idna",
+            "multidict"
+        ],
+        "import_test_code": "import yarl",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import yarl; result = yarl.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "grpcio": {
+        "dependency": [
+            "six"
+        ],
+        "import_test_code": "import grpc",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import grpc; result = grpc.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "stable-baselines3": {
+        "dependency": [
+            "numpy",
+            "torch",
+            "gymnasium",
+            "cloudpickle",
+            "pandas",
+            "matplotlib"
+        ],
+        "import_test_code": "import stable_baselines3",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import stable_baselines3; result = stable_baselines3.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "gymnasium": {
+        "dependency": [
+            "numpy",
+            "cloudpickle",
+            "importlib-metadata",
+            "typing-extensions"
+        ],
+        "import_test_code": "import gymnasium",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import gymnasium; result = gymnasium.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "importlib-metadata": {
+        "dependency": [],
+        "import_test_code": "import importlib_metadata",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import importlib_metadata; result = importlib_metadata.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "diffusers": {
+        "dependency": [
+            "numpy",
+            "torch",
+            "transformers",
+            "accelerate",
+            "huggingface-hub"
+        ],
+        "import_test_code": "import diffusers",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import diffusers; result = diffusers.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "accelerate": {
+        "dependency": [
+            "numpy",
+            "pyyaml",
+            "torch",
+            "packaging",
+            "psutil"
+        ],
+        "import_test_code": "import accelerate",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import accelerate; result = accelerate.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "psutil": {
+        "dependency": [],
+        "import_test_code": "import psutil",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import psutil; result = psutil.cpu_count(); print(result)",
+        "function_test_expected_result": "^\\d+$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "langchain": {
+        "dependency": [
+            "pydantic",
+            "sqlalchemy",
+            "numpy",
+            "requests",
+            "pyyaml",
+            "jsonpatch",
+            "tenacity"
+        ],
+        "import_test_code": "import langchain",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import langchain; result = langchain.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "pydantic": {
+        "dependency": [
+            "typing-extensions",
+            "annotated-types"
+        ],
+        "import_test_code": "import pydantic",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import pydantic; result = pydantic.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "annotated-types": {
+        "dependency": [],
+        "import_test_code": "import annotated_types",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import annotated_types; result = annotated_types.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "sqlalchemy": {
+        "dependency": [
+            "typing-extensions"
+        ],
+        "import_test_code": "import sqlalchemy",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import sqlalchemy; result = sqlalchemy.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "jsonpatch": {
+        "dependency": [
+            "jsonpointer"
+        ],
+        "import_test_code": "import jsonpatch",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import jsonpatch; result = jsonpatch.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "jsonpointer": {
+        "dependency": [],
+        "import_test_code": "import jsonpointer",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import jsonpointer; result = jsonpointer.__version__; print(result)",
+        "function_test_expected_result": "^\\d+\\.\\d+(\\.\\d+)?.*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "openai-whisper": {
+        "dependency": [
+            "numpy",
+            "torch",
+            "tqdm",
+            "more-itertools",
+            "tiktoken",
+            "triton"
+        ],
+        "import_test_code": "import whisper",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import whisper; result = whisper.__version__; print(result)",
+        "function_test_expected_result": "^(19|20)\\d{2}(0[1-9]|1[0-2])(0[1-9]|[12]\\d|3[01])$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "more-itertools": {
+        "dependency": [],
+        "import_test_code": "import more_itertools",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import more_itertools; result = list(more_itertools.take(3, [1, 2, 3, 4])); print(result)",
+        "function_test_expected_result": "^\\[1, 2, 3\\]$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "tiktoken": {
+        "dependency": [
+            "regex"
+        ],
+        "import_test_code": "import tiktoken",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import tiktoken; result = tiktoken.encoding_for_model('gpt-3.5-turbo').encode('hello'); print(result)",
+        "function_test_expected_result": "^\\[\\d+\\]\\s*$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "regex": {
+        "dependency": [],
+        "import_test_code": "import regex",
+        "import_test_expected_result": "",
+        "function_test_code": "import regex; m = regex.search(r'\\d+', 'abc123'); print(m.group() if m else 'NO_MATCH')",
+        "function_test_expected_result": "^123$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "sd-prompt-reader": {
+        "dependency": [
+            "pillow",
+            "tkinterdnd2-universal",
+            "pyperclip",
+            "customtkinter",
+            "pyobjus",
+            "pyobjc-framework-cocoa",
+            "piexif",
+            "pefile",
+            "requests",
+            "toml",
+            "ctktooltip",
+            "click",
+            "python",
+            "Pillow",
+            "pyinstaller",
+            "pyinstaller-hooks-contrib",
+            "py2app",
+            "pyobjc-framework-Cocoa",
+            "CTkToolTip"
+        ],
+        "import_test_code": "import sd_prompt_reader",
+        "import_test_expected_result": "",
+        "function_test_code": "import sys; import sd_prompt_reader; print(sd_prompt_reader.__version__)",
+        "function_test_expected_result": "^\\d+\\.\\d+\\.\\d+$",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "exists": "True"
+    },
+    "comfy-cli": {
+        "dependency": [
+            "charset-normalizer",
+            "click",
+            "cookiecutter",
+            "gitpython",
+            "httpx",
+            "mixpanel",
+            "packaging",
+            "pathspec",
+            "psutil",
+            "pyyaml",
+            "questionary",
+            "requests",
+            "rich",
+            "ruff",
+            "semver",
+            "tomlkit",
+            "typer",
+            "typing-extensions",
+            "uv",
+            "websocket-client"
+        ],
+        "import_test_code": "import comfy-cli",
+        "import_test_expected_result": "",
+        "function_test_code": "print(comfy-cli.__version__)",
+        "function_test_expected_result": "\\d+(\\.\\d+)*",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "verified": "False",
+        "exists": "True"
+    },
+    "torchvision": {
+        "dependency": [
+            "numpy",
+            "torch",
+            "pillow"
+        ],
+        "import_test_code": "import torchvision",
+        "import_test_expected_result": "",
+        "function_test_code": "print(torchvision.__version__)",
+        "function_test_expected_result": "\\d+(\\.\\d+)*",
+        "gpu_test_code": "",
+        "gpu_test_expected_result": "",
+        "verified": "False",
+        "exists": "True"
+    }
+}
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/package_info_generator.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/package_info_generator.py"
new file mode 100644
index 0000000000000000000000000000000000000000..d6f1997294db28481f2a2dc1fb521c711d495ebc
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/package_info_generator.py"	
@@ -0,0 +1,197 @@
+import argparse
+import sys
+import json
+import asyncio
+from pathlib import Path
+from typing import List
+
+root_dir = Path(__file__).parent.parent
+sys.path.insert(0, str(root_dir))
+from logger.logger import get_logger
+
+logger = get_logger("Python包信息生成器")
+
+from data_manager.package_info import Package, PackageInfo
+from data_manager.package_converter import PackageConverter
+from data_manager.package_repository import PackageRepository
+from utils.prepare_pypi_input import convert_github_csv_to_pypi_input
+from info_crawler.get_pypi_name import FileProcessor
+from info_crawler.analyse_dependency import detect_gpu_requirement, find_dependency_for_pip_package 
+from mcp_chat_bot.mcp_task_executor import assign_task_to_llm_mcp, MCPTask
+
+class PackageInfoGenerator:
+    """
+    根据抓取到的仓库信息，判断仓库是否有安装PyPI包的步骤；
+    如果有，尝试提取PyPI包名，并分析每个包的依赖关系，并为每个包生成验证代码和预期结果。
+    最后将结果整合
+    """
+
+    def _extract_package_list(self, line: str) -> List[str]:
+        parts = line.strip().split(" ", 2)  # Split into max 3 parts
+        if len(parts) >= 3:
+            # Find the package list part (everything between first [ and last ])
+            if '[' in line and ']' in line:
+                packages_str = line[line.find('['):line.rfind(']')+1]
+                # Handle empty list case
+                if not packages_str or packages_str == "[]":
+                    return []
+                    
+                # Remove outer brackets
+                content = packages_str.strip("[]")
+                if not content:
+                    return []
+                
+                # Split by comma and clean up each package name
+                packages = [pkg.strip() for pkg in content.split(",")]
+                
+                # Remove empty strings
+                packages = [pkg for pkg in packages if pkg]
+
+                return packages
+
+    
+    async def generate_package_info(self, pypi_file: str, json_path: str, use_llm: bool = False, save_to_db: bool = False, 
+                                    db_path: str = f"{root_dir}/package_info.db"): 
+        repository = PackageRepository(db_path, json_path)
+        # 获取每个包的依赖和是否需要GPU
+        with open(pypi_file, 'r', encoding='utf-8') as f:
+            lines = f.readlines()
+            for line in lines:
+                package_list = self._extract_package_list(line)
+                logger.info(f"Anlyse repo: {line.split(' ')[0]} Extracted packages: {package_list}")
+                print("package_list:", package_list)
+                for package_name in package_list:
+                    package = None
+                    # 如果该包已经分析过，即数据库/json文件中有记录，则不再分析
+                    if save_to_db:
+                        try:
+                            existing_packages = repository.get_certain_package_list_from_db(f"package_name == '{package_name}'")
+                            if existing_packages and len(existing_packages) > 0:
+                                logger.info(f"Package info for {package_name} already exists in database {db_path}, skip.")
+                                continue
+                        except Exception as e:
+                            logger.error(f"Failed to query package info for {package_name} from database: {e}")
+                    else:
+                        try:
+                            existing_packages = repository.get_certain_package_list_from_json("package_name", package_name)
+                            if existing_packages and len(existing_packages) > 0:
+                                logger.info(f"Package info for {package_name} already exists in JSON file {json_path}, skip.")
+                                continue
+                        except Exception as e:
+                            logger.error(f"Failed to query package info for {package_name} from JSON file: {e}")
+            
+                    if use_llm == True:
+                        try:
+                            result = await assign_task_to_llm_mcp(task=MCPTask.PYPI_INFO_GEN, pkg_name=package_name)
+                            package = PackageConverter.dict_to_model(result)
+                        except Exception as e:
+                            logger.error(f"Failed to get package info for {package_name} using LLM: {e}")
+                    else:
+                        # 获取依赖关系，和GPU需求
+                        dependency = find_dependency_for_pip_package(package_name)
+                        gpu_required = detect_gpu_requirement(package_name)
+                        # 生成验证代码和预期结果
+                        import_code = f"import {package_name}"
+                        function_test_code = f"print({package_name}.__version__)"
+                        function_test_expected_result = r"\d+(\.\d+)*"  # 版本号的正则表达式
+                        gpu_test_code = ""
+                        gpu_test_expected_result = ""
+                        if gpu_required['gpu'] == True:
+                            gpu_test_code = f"import torch; print(torch.cuda.is_available())"
+                            gpu_test_expected_result = "True"
+                        package_info = PackageInfo(
+                            dependency= dependency, 
+                            import_test_code= import_code,
+                            import_test_code_result= "",
+                            function_test_code= function_test_code,
+                            function_test_expected_result= function_test_expected_result,
+                            gpu_test_code= gpu_test_code,
+                            gpu_test_expected_result= gpu_test_expected_result,
+                            verified= "False")
+                        package = Package(
+                            exists=True,
+                            package_name= package_name,
+                            info= package_info)
+                    if save_to_db:
+                        try:
+                            repository.save_to_db(package)
+                            logger.info(f"Package info for {package} saved to database {db_path}")
+                        except Exception as e:
+                            logger.error(f"Failed to save package info for {package} to database: {e}")
+                    else:
+                        try:
+                            repository.save_to_json(package)
+                            logger.info(f"Package info for {package} saved to JSON file {json_path}")
+                        except Exception as e:
+                            logger.error(f"Failed to save package info for {package} to JSON file: {e}")
+
+if __name__ == "__main__":
+    parser = argparse.ArgumentParser(description="Generate package info from GitHub repos")
+    parser.add_argument("--topic", 
+                        type=str, 
+                        default="ai", 
+                        help="GitHub topic to search for repositories"
+                        )
+    
+    parser.add_argument("--use-llm",
+                        action="store_true",
+                        help="Use LLM-based analyzer."
+                        )
+
+    parser.add_argument("--get-pypi",
+                        action="store_true",
+                        help="Get PyPI package names."
+                        )
+
+    parser.add_argument("--generate-package-info",
+                        action="store_true",
+                        help="Generate package info."
+                        )
+
+    args = parser.parse_args()
+    CONFIG_FILE = f"{root_dir}/config.json"
+    GITHUB_TOKEN = json.load(open(CONFIG_FILE, 'r'))['github_access_token']
+    TMP_DIR = json.load(open(CONFIG_FILE, 'r'))['tmp_path']
+    CSV_FILE = f"{TMP_DIR}/github_{args.topic}_repos_with_desc.csv"
+    REPO_FILE = f"{TMP_DIR}/repos.txt"
+    PYPI_FILE = f"{TMP_DIR}/pypi.txt"
+
+    with open(CONFIG_FILE, 'r') as f:
+        config = json.load(f)
+        JSON_PATH = config['json_file_path']
+        DB_PATH = config['db_path']
+        SAVE_TO_DB = True if config.get('save_method') == 'db' else False
+
+    # 可以选择使用LLM分析
+    USE_LLM = True if args.use_llm else False  # 设置为True以启用LLM分析
+    QPM = 5  # 每分钟请求次数限制
+    BATCH_SIZE = 5  # 批处理大小，默认每次处理5个仓库
+    
+    # 数据库存储选项
+    CACHE_DAYS = 30  # 缓存有效期（天数）
+
+    GET_PYPI = True if args.get_pypi else False  # 是否获取PyPI包名
+    GEN_PACKAGE_INFO = True if args.generate_package_info else False  # 是否获取包的依赖和验证代码
+
+    if GET_PYPI:
+        # 将csv文件转换为可供使用的格式
+        convert_github_csv_to_pypi_input(CSV_FILE, REPO_FILE)
+        # 获取包名和相关信息
+        FileProcessor.process_io(REPO_FILE, PYPI_FILE, GITHUB_TOKEN, USE_LLM, QPM, BATCH_SIZE, False, DB_PATH, CACHE_DAYS)
+        logger.info(f"PyPI package names saved to {PYPI_FILE}")
+
+    if GEN_PACKAGE_INFO:
+        generator = PackageInfoGenerator()
+        asyncio.run(generator.generate_package_info(
+            pypi_file=PYPI_FILE,
+            json_path=JSON_PATH,
+            use_llm=USE_LLM,
+            save_to_db=SAVE_TO_DB,
+            db_path='package_info.db'
+        ))
+
+        
+        
+
+
+                    
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/package_installer.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/package_installer.py"
new file mode 100644
index 0000000000000000000000000000000000000000..804a55ae640bd1ad7142f1d5b3f68f80c5a61c58
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/package_installer.py"	
@@ -0,0 +1,706 @@
+"""
+PackageInstaller 使用说明
+========================
+
+这是一个支持pip和uv的包管理器，提供了丰富的安装选项支持。
+
+基本用法:
+--------
+```python
+# 简单安装
+success, msg = await PackageInstaller.install_package("/path/to/venv", "requests")
+
+# 使用uv安装
+success, msg = await PackageInstaller.install_package("/path/to/venv", "requests", use_uv=True)
+
+# 卸载包
+success, msg = await PackageInstaller.uninstall_package("/path/to/venv", "requests")
+```
+
+高级选项:
+---------
+```python
+# 使用自定义源和选项
+options = {
+    'index_url': 'https://pypi.tuna.tsinghua.edu.cn/simple',
+    'trusted_host': 'pypi.tuna.tsinghua.edu.cn',
+    'upgrade': True,
+    'no_deps': False,
+    'verbose': True,
+    'timeout': 300,
+    'retries': 3
+}
+success, msg = await PackageInstaller.install_package("/path/to/venv", "numpy", options=options)
+```
+
+支持的pip选项:
+--------------
+- upgrade: 升级包
+- force_reinstall: 强制重新安装
+- no_deps: 不安装依赖
+- user: 安装到用户目录
+- target: 安装到指定目录
+- constraint: 约束文件
+- requirement: requirements文件
+- editable: 可编辑安装
+- index_url: PyPI索引URL
+- extra_index_url: 额外索引URL
+- trusted_host: 信任的主机
+- find_links: 查找链接
+- no_index: 不使用索引
+- no_cache_dir: 禁用缓存
+- cache_dir: 缓存目录
+- timeout: 超时时间
+- retries: 重试次数
+- quiet: 静默模式
+- verbose: 详细输出
+- progress_bar: 进度条类型
+- platform: 平台
+- python_version: Python版本
+- implementation: Python实现
+- abi: ABI标签
+- pre: 预发布版本
+- no_binary: 不使用二进制包
+- only_binary: 只使用二进制包
+- compile: 编译字节码
+- no_compile: 不编译字节码
+
+支持的uv选项:
+-------------
+uv支持大部分pip选项，另外还有：
+- reinstall: 重新安装（对应pip的force_reinstall）
+- no_cache: 禁用缓存
+- prerelease: 预发布策略
+- resolution: 解析策略
+- compile_bytecode: 编译字节码
+- python_platform: Python平台
+
+其他功能:
+---------
+- list_packages(): 列出已安装包
+- show_package(): 显示包信息
+- 自动检测uv可用性
+- 完整的错误处理和日志记录
+"""
+import json
+import sys
+import os
+import asyncio
+import subprocess
+from pathlib import Path
+from typing import Tuple, Dict, List, Optional, Union, Any
+
+root_dir = Path(__file__).parent.parent
+sys.path.insert(0, str(root_dir))
+
+from mcp_chat_bot.mcp_task_executor import MCPTask, assign_task_to_llm_mcp
+from logger.logger import get_logger
+
+logger = get_logger("安装模块")
+
+CONFIG_PATH = f'{root_dir}/config.json'
+with open(CONFIG_PATH, 'r') as f:
+    config = json.load(f)
+    VENVS_DIR = config['venvs_path']
+
+class PackageInstaller:
+    """在指定虚拟环境里安装包的异步安装器"""
+    
+    @staticmethod
+    def _parse_pip_options(options: Dict[str, Any]) -> List[str]:
+        """解析pip选项为命令行参数"""
+        cmd_args = []
+        
+        # 基本安装选项
+        if options.get('upgrade', False):
+            cmd_args.append('--upgrade')
+        if options.get('force_reinstall', False):
+            cmd_args.append('--force-reinstall')
+        if options.get('no_deps', False):
+            cmd_args.append('--no-deps')
+        if options.get('user', False):
+            cmd_args.append('--user')
+        if options.get('target'):
+            cmd_args.extend(['--target', str(options['target'])])
+        
+        # 版本控制
+        if options.get('constraint'):
+            cmd_args.extend(['--constraint', str(options['constraint'])])
+        if options.get('requirement'):
+            cmd_args.extend(['--requirement', str(options['requirement'])])
+        if options.get('editable'):
+            cmd_args.extend(['--editable', str(options['editable'])])
+        
+        # 索引和源选项
+        if options.get('index_url'):
+            cmd_args.extend(['--index-url', str(options['index_url'])])
+        if options.get('extra_index_url'):
+            for url in options['extra_index_url'] if isinstance(options['extra_index_url'], list) else [options['extra_index_url']]:
+                cmd_args.extend(['--extra-index-url', str(url)])
+        if options.get('trusted_host'):
+            for host in options['trusted_host'] if isinstance(options['trusted_host'], list) else [options['trusted_host']]:
+                cmd_args.extend(['--trusted-host', str(host)])
+        if options.get('find_links'):
+            for link in options['find_links'] if isinstance(options['find_links'], list) else [options['find_links']]:
+                cmd_args.extend(['--find-links', str(link)])
+        
+        # 安装控制
+        if options.get('no_index', False):
+            cmd_args.append('--no-index')
+        if options.get('no_cache_dir', False):
+            cmd_args.append('--no-cache-dir')
+        if options.get('cache_dir'):
+            cmd_args.extend(['--cache-dir', str(options['cache_dir'])])
+        if options.get('timeout'):
+            cmd_args.extend(['--timeout', str(options['timeout'])])
+        if options.get('retries'):
+            cmd_args.extend(['--retries', str(options['retries'])])
+        
+        # 输出控制
+        if options.get('quiet', False):
+            cmd_args.append('--quiet')
+        if options.get('verbose', False):
+            cmd_args.append('--verbose')
+        if options.get('progress_bar'):
+            cmd_args.extend(['--progress-bar', str(options['progress_bar'])])
+        
+        # 平台和环境
+        if options.get('platform'):
+            cmd_args.extend(['--platform', str(options['platform'])])
+        if options.get('python_version'):
+            cmd_args.extend(['--python-version', str(options['python_version'])])
+        if options.get('implementation'):
+            cmd_args.extend(['--implementation', str(options['implementation'])])
+        if options.get('abi'):
+            cmd_args.extend(['--abi', str(options['abi'])])
+        
+        # 其他选项
+        if options.get('pre', False):
+            cmd_args.append('--pre')
+        if options.get('no_binary'):
+            if options['no_binary'] == ':all:':
+                cmd_args.extend(['--no-binary', ':all:'])
+            else:
+                for pkg in options['no_binary'] if isinstance(options['no_binary'], list) else [options['no_binary']]:
+                    cmd_args.extend(['--no-binary', str(pkg)])
+        if options.get('only_binary'):
+            if options['only_binary'] == ':all:':
+                cmd_args.extend(['--only-binary', ':all:'])
+            else:
+                for pkg in options['only_binary'] if isinstance(options['only_binary'], list) else [options['only_binary']]:
+                    cmd_args.extend(['--only-binary', str(pkg)])
+        if options.get('compile', False):
+            cmd_args.append('--compile')
+        if options.get('no_compile', False):
+            cmd_args.append('--no-compile')
+        if options.get('no_warn_script_location', False):
+            cmd_args.append('--no-warn-script-location')
+        
+        return cmd_args
+    
+    @staticmethod
+    def _parse_uv_options(options: Dict[str, Any]) -> List[str]:
+        """解析uv选项为命令行参数"""
+        cmd_args = []
+        
+        # uv pip install 特有选项
+        if options.get('upgrade', False):
+            cmd_args.append('--upgrade')
+        if options.get('force_reinstall', False):
+            cmd_args.append('--reinstall')  # uv使用--reinstall而不是--force-reinstall
+        if options.get('no_deps', False):
+            cmd_args.append('--no-deps')
+        if options.get('target'):
+            cmd_args.extend(['--target', str(options['target'])])
+        
+        # 版本控制
+        if options.get('constraint'):
+            cmd_args.extend(['--constraint', str(options['constraint'])])
+        if options.get('requirement'):
+            cmd_args.extend(['--requirement', str(options['requirement'])])
+        if options.get('editable'):
+            cmd_args.extend(['--editable', str(options['editable'])])
+        
+        # 索引和源选项
+        if options.get('index_url'):
+            cmd_args.extend(['--index-url', str(options['index_url'])])
+        if options.get('extra_index_url'):
+            for url in options['extra_index_url'] if isinstance(options['extra_index_url'], list) else [options['extra_index_url']]:
+                cmd_args.extend(['--extra-index-url', str(url)])
+        if options.get('find_links'):
+            for link in options['find_links'] if isinstance(options['find_links'], list) else [options['find_links']]:
+                cmd_args.extend(['--find-links', str(link)])
+        
+        # uv特有选项
+        if options.get('no_index', False):
+            cmd_args.append('--no-index')
+        if options.get('no_cache', False):
+            cmd_args.append('--no-cache')
+        if options.get('cache_dir'):
+            cmd_args.extend(['--cache-dir', str(options['cache_dir'])])
+        
+        # 输出控制
+        if options.get('quiet', False):
+            cmd_args.append('--quiet')
+        if options.get('verbose', False):
+            cmd_args.append('--verbose')
+        
+        # 平台选项
+        if options.get('python_version'):
+            cmd_args.extend(['--python-version', str(options['python_version'])])
+        if options.get('python_platform'):
+            cmd_args.extend(['--python-platform', str(options['python_platform'])])
+        
+        # 其他uv选项
+        if options.get('prerelease'):
+            cmd_args.extend(['--prerelease', str(options['prerelease'])])
+        if options.get('resolution'):
+            cmd_args.extend(['--resolution', str(options['resolution'])])
+        if options.get('compile_bytecode', False):
+            cmd_args.append('--compile-bytecode')
+        if options.get('no_compile_bytecode', False):
+            cmd_args.append('--no-compile-bytecode')
+        
+        return cmd_args
+    
+    # -------------------------  public  -------------------------
+    @staticmethod
+    def search_venvs(venvs_dir: str) -> List[str]:
+        """搜索指定目录下的所有虚拟环境"""
+        venv_dirs = []
+        if os.path.exists(venvs_dir) and os.path.isdir(venvs_dir):
+            for item in os.listdir(venvs_dir):
+                item_path = os.path.join(venvs_dir, item)
+                if os.path.isdir(item_path) and item.startswith('.'):
+                    venv_dirs.append(item_path)
+        return venv_dirs
+
+    @classmethod
+    async def install_package(
+        cls,
+        venv_path: str,
+        package_name: str,  
+        options: Optional[Dict[str, Any]] = None,
+        use_uv: bool = False,
+        use_llm: bool = False
+    ) -> Tuple[bool, str, str]:
+        """在指定虚拟环境里安装包
+        
+        Args:
+            venv_path: 虚拟环境路径
+            package_name: 包名
+            options: 安装选项字典
+            use_uv: 是否使用uv
+            use_llm: 是否使用LLM辅助安装
+            
+        Returns:
+            (成功状态, 错误信息， 最终使用的虚拟环境)
+        """
+        python_exe = f'{venv_path}/bin/python'
+        uv_exe = f'{venv_path}/bin/uv'
+        has_uv = False
+        
+        if options is None:
+            options = {}
+        
+        if use_uv:
+            has_uv = await cls._check_uv_available(uv_exe)
+
+        logger.info(f"开始安装包: {package_name}")
+        try:
+            # 选择命令与参数
+            if use_uv and has_uv:
+                cmd = [str(uv_exe), "pip", "install"]
+                cmd.extend(cls._parse_uv_options(options))
+                cmd.append(package_name)
+            else:
+                cmd = [str(python_exe), "-m", "pip", "install"]
+                cmd.extend(cls._parse_pip_options(options))
+                cmd.append(package_name)
+
+            logger.info(f"执行命令: {' '.join(cmd)}")
+            
+            process = await asyncio.create_subprocess_exec(
+                *cmd,
+                # stdout=asyncio.subprocess.PIPE,
+                stderr=asyncio.subprocess.PIPE,
+            )
+            _, stderr = await process.communicate()
+
+            if process.returncode == 0:
+                logger.info(f"包 {package_name} 安装成功")
+                return (True, "", venv_path)
+            elif use_llm:
+                result = {
+                    "install_cmd": f'pip install {package_name}',
+                    "status": "FAIL",
+                    "stdout": "",
+                    "stderr": stderr.decode() if stderr else "未知错误"
+                }
+                llm_result = await assign_task_to_llm_mcp(task = MCPTask.INSTALLATION, 
+                                                          pkg_name = package_name,
+                                                          venv_name = Path(venv_path).name,
+                                                          result = result)
+                if llm_result.get("status") == "success":
+                    logger.info(f"包 {package_name} 安装成功")
+                    return (True, 
+                            f'llm suggestion: {llm_result.get("conclusion", "")}',
+                            f'{VENVS_DIR}/{llm_result.get("final_venv_name", Path(venv_path).name)}')
+            else:
+                error_msg = stderr.decode() if stderr else "未知错误"
+                logger.error(f"包 {package_name} 安装失败: {error_msg}")
+                return (False, error_msg, venv_path)
+
+        except Exception as e:
+            logger.error(f"安装包 {package_name} 时发生错误: {str(e)}")
+            return (False, str(e), venv_path)
+
+    @classmethod
+    async def uninstall_package(
+        cls,
+        venv_path: str, 
+        package_name: str,
+        options: Optional[Dict[str, Any]] = None,
+        use_uv: bool = False
+    ) -> Tuple[bool, str]:
+        """在指定虚拟环境里卸载包
+        
+        Args:
+            venv_path: 虚拟环境路径
+            package_name: 包名
+            options: 卸载选项字典
+            use_uv: 是否使用uv
+            
+        Returns:
+            (成功状态, 错误信息)
+        """
+        python_exe = f'{venv_path}/bin/python'
+        uv_exe = f'{venv_path}/bin/uv'
+        has_uv = False
+        
+        if options is None:
+            options = {}
+        
+        if use_uv:
+            has_uv = await cls._check_uv_available(uv_exe)
+            
+        logger.info(f"开始卸载包: {package_name}")
+        try:
+            if use_uv and has_uv:
+                cmd = [str(uv_exe), "pip", "uninstall"]  # uv不需要-y参数
+                # uv卸载选项较少，主要支持基本选项
+                if options.get('verbose', False):
+                    cmd.append('--verbose')
+                if options.get('quiet', False):
+                    cmd.append('--quiet')
+                cmd.append(package_name)
+            else:
+                cmd = [str(python_exe), "-m", "pip", "uninstall", "-y"]  # pip默认需要-y确认
+                # pip卸载选项
+                if options.get('verbose', False):
+                    cmd.append('--verbose')
+                if options.get('quiet', False):
+                    cmd.append('--quiet')
+                if options.get('requirement'):
+                    cmd.extend(['--requirement', str(options['requirement'])])
+                cmd.append(package_name)
+            
+            logger.info(f"执行命令: {' '.join(cmd)}")
+            
+            process = await asyncio.create_subprocess_exec(
+                *cmd,
+                stdout=asyncio.subprocess.PIPE,
+                stderr=asyncio.subprocess.PIPE,
+            )
+            stdout, stderr = await process.communicate()
+
+            if process.returncode == 0:
+                logger.info(f"包 {package_name} 卸载成功")
+                return (True, stdout.decode() if stdout else "")
+            else:
+                error_msg = stderr.decode() if stderr else "未知错误"
+                logger.error(f"包 {package_name} 卸载失败: {error_msg}")
+                return (False, error_msg)
+
+        except Exception as e:
+            logger.error(f"卸载包 {package_name} 时发生错误: {str(e)}")
+            return (False, str(e))
+    
+    @classmethod
+    async def list_packages(
+        cls,
+        venv_path: str,
+        options: Optional[Dict[str, Any]] = None,
+        use_uv: bool = False
+    ) -> Dict[str, str]:
+        """列出虚拟环境中已安装的包
+        
+        Args:
+            venv_path: 虚拟环境路径
+            options: 列表选项字典
+            use_uv: 是否使用uv
+            
+        Returns:
+            (成功状态, 包列表或错误信息)
+        """
+        python_exe = f'{venv_path}/bin/python'
+        uv_exe = f'{venv_path}/bin/uv'
+        has_uv = False
+        
+        if options is None:
+            options = {}
+        
+        if use_uv:
+            has_uv = await cls._check_uv_available(uv_exe)
+            
+        logger.info("开始列出已安装的包")
+        try:
+            if use_uv and has_uv:
+                cmd = [str(uv_exe), "pip", "list"]
+                if options.get('format'):
+                    cmd.extend(['--format', str(options['format'])])
+                if options.get('outdated', False):
+                    cmd.append('--outdated')
+            else:
+                cmd = [str(python_exe), "-m", "pip", "list"]
+                if options.get('format'):
+                    cmd.extend(['--format', str(options['format'])])
+                if options.get('outdated', False):
+                    cmd.append('--outdated')
+                if options.get('uptodate', False):
+                    cmd.append('--uptodate')
+                if options.get('editable', False):
+                    cmd.append('--editable')
+                if options.get('local', False):
+                    cmd.append('--local')
+                if options.get('user', False):
+                    cmd.append('--user')
+            
+            logger.info(f"执行命令: {' '.join(cmd)}")
+            
+            process = await asyncio.create_subprocess_exec(
+                *cmd,
+                stdout=asyncio.subprocess.PIPE,
+                stderr=asyncio.subprocess.PIPE,
+            )
+            stdout, stderr = await process.communicate()
+
+            if process.returncode == 0:
+                output = stdout.decode() if stdout else ""
+                logger.info("包列表获取成功")
+                
+                # 解析包列表为字典格式
+                packages = {}
+                lines = output.strip().split('\n')
+                for line in lines[2:]:  # 跳过头部信息
+                    if line.strip():
+                        parts = line.split()
+                        if len(parts) >= 2:
+                            package_name = parts[0]
+                            version = parts[1]
+                            packages[package_name] = version
+                
+                return packages
+            else:
+                error_msg = stderr.decode() if stderr else "未知错误"
+                logger.error(f"获取包列表失败: {error_msg}")
+                return {}
+
+        except Exception as e:
+            logger.error(f"获取包列表时发生错误: {str(e)}")
+            return {}
+    
+    @classmethod
+    async def show_package(
+        cls,
+        venv_path: str,
+        package_name: str,
+        use_uv: bool = False
+    ) -> Tuple[bool, str]:
+        """显示包信息
+        
+        Args:
+            package_name: 包名
+            venv_path: 虚拟环境路径
+            use_uv: 是否使用uv
+            
+        Returns:
+            (成功状态, 包信息或错误信息)
+        """
+        python_exe = f'{venv_path}/bin/python'
+        uv_exe = f'{venv_path}/bin/uv'
+        has_uv = False
+        
+        if use_uv:
+            has_uv = await cls._check_uv_available(uv_exe)
+            
+        logger.info(f"开始获取包信息: {package_name}")
+        try:
+            if use_uv and has_uv:
+                cmd = [str(uv_exe), "pip", "show", package_name]
+            else:
+                cmd = [str(python_exe), "-m", "pip", "show", package_name]
+            
+            logger.info(f"执行命令: {' '.join(cmd)}")
+            
+            process = await asyncio.create_subprocess_exec(
+                *cmd,
+                stdout=asyncio.subprocess.PIPE,
+                stderr=asyncio.subprocess.PIPE,
+            )
+            stdout, stderr = await process.communicate()
+
+            if process.returncode == 0:
+                output = stdout.decode() if stdout else ""
+                logger.info(f"包 {package_name} 信息获取成功")
+                return (True, output)
+            else:
+                error_msg = stderr.decode() if stderr else "未知错误"
+                logger.error(f"获取包 {package_name} 信息失败: {error_msg}")
+                return (False, error_msg)
+
+        except Exception as e:
+            logger.error(f"获取包 {package_name} 信息时发生错误: {str(e)}")
+            return (False, str(e))
+
+    # -------------------------  private  -------------------------
+    @staticmethod
+    async def _check_uv_available(uv_exe: str) -> bool:
+        """检查虚拟环境里的 uv 是否可用"""
+        try:
+            proc = await asyncio.create_subprocess_exec(
+                str(uv_exe), "--version",
+                stdout=asyncio.subprocess.PIPE,
+                stderr=asyncio.subprocess.PIPE,
+            )
+            await proc.wait()
+            return proc.returncode == 0
+        except Exception:
+            return False
+
+if __name__ == '__main__':
+    # 测试代码和使用示例
+    async def test_basic_operations():
+        """测试基本操作"""
+        venv_path = "/root/contributor_rhino-bird/2025实战任务_作品文件夹/OpenCloudOS 9 AI软件自动化验证工具/黄振业_作品/oc_contributor_huangzhenye/code/.main_venv"
+        package_name = "pyaml"
+        
+        print("=== 基本安装测试 ===")
+        success, msg, final_venv_path = await PackageInstaller.install_package(venv_path=venv_path, package_name=package_name, use_llm=True)
+        print(f"在{final_venv_path} 安装 {package_name} 结果: {success}")
+        if not success:
+            print(f"错误信息: {msg}")
+        
+        if success:
+            print(f"\n=== 显示包信息 ===")
+            success, info = await PackageInstaller.show_package(venv_path, package_name)
+            if success:
+                print(f"包信息:\n{info}")
+            
+            print(f"\n=== 卸载包测试 ===")
+            success, msg = await PackageInstaller.uninstall_package(venv_path, package_name)
+            print(f"卸载 {package_name} 结果: {success}")
+            if not success:
+                print(f"错误信息: {msg}")
+    
+    async def test_advanced_options():
+        """测试高级选项"""
+        venv_path = "/root/contributor_rhino-bird/2025实战任务_作品文件夹/OpenCloudOS 9 AI软件自动化验证工具/黄振业_作品/oc_contributor_huangzhenye/code/.main_venv"
+        
+        print("\n=== 高级选项测试 ===")
+        
+        # 测试使用多个源安装（优先使用官方源，清华源作为备用）
+        options = {
+            'index_url': 'https://pypi.org/simple',
+            'extra_index_url': ['https://pypi.tuna.tsinghua.edu.cn/simple', 'https://pypi.douban.com/simple'],
+            'trusted_host': ['pypi.tuna.tsinghua.edu.cn', 'pypi.douban.com'],
+            'upgrade': True,
+            'verbose': True,
+            'timeout': 60,
+            'retries': 3
+        }
+        
+        success, msg, _ = await PackageInstaller.install_package(
+            venv_path=venv_path, 
+            package_name="wheel", 
+            options=options
+        )
+        print(f"使用多源安装 wheel 结果: {success}")
+        if not success:
+            print(f"错误信息: {msg}")
+            
+            # 如果失败，尝试仅使用官方源
+            print("尝试使用官方源重新安装...")
+            simple_options = {
+                'upgrade': True,
+                'timeout': 60,
+                'retries': 3
+            }
+            success, msg, _ = await PackageInstaller.install_package(
+                venv_path, 
+                "wheel", 
+                options=simple_options
+            )
+            print(f"使用官方源安装 wheel 结果: {success}")
+        
+        # 测试列出包
+        print(f"\n=== 列出已安装包 ===")
+        packages = await PackageInstaller.list_packages(venv_path)
+        if packages:
+            print(f"包列表获取成功，共 {len(packages)} 个包")
+            # 显示前10个包
+            package_list = list(packages.items())[:10]
+            for name, version in package_list:
+                print(f"  - {name}: {version}")
+            if len(packages) > 10:
+                print(f"  ... 还有 {len(packages) - 10} 个包")
+        else:
+            print(f"获取包列表失败: {packages}")
+    
+    async def test_uv_operations():
+        """测试uv操作"""
+        venv_path = "/root/contributor_rhino-bird/2025实战任务_作品文件夹/OpenCloudOS 9 AI软件自动化验证工具/黄振业_作品/oc_contributor_huangzhenye/code/.main_venv"
+        package_name = "beautifulsoup4"
+        
+        print("\n=== UV 安装测试 ===")
+        
+        # uv特有选项
+        uv_options = {
+            'upgrade': True,
+            'verbose': True,
+            'prerelease': 'allow',  # uv特有选项
+            'resolution': 'highest'  # uv特有选项
+        }
+        
+        success, msg, _ = await PackageInstaller.install_package(
+            venv_path, 
+            package_name, 
+            options=uv_options,
+            use_uv=True
+        )
+        print(f"使用 uv 安装 {package_name} 结果: {success}")
+        
+        if success:
+            print(f"\n=== UV 卸载测试 ===")
+            success, msg = await PackageInstaller.uninstall_package(
+                venv_path, 
+                package_name, 
+                options={'verbose': True},
+                use_uv=True
+            )
+            print(f"使用 uv 卸载 {package_name} 结果: {success}")
+    
+    async def main():
+        """主测试函数"""
+        try:
+            await test_basic_operations()
+            await test_advanced_options()
+            await test_uv_operations()
+        except Exception as e:
+            print(f"测试过程中发生错误: {e}")
+    
+    # 运行测试
+    asyncio.run(main())
+
+    venv_dirs = PackageInstaller.search_venvs("/root/AI_software_auto_verification_tool/oc_contributor_huangzhenye/code/package_manager/venvs")
+    print(f"搜索到的虚拟环境: {venv_dirs}")
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/package_manager.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/package_manager.py"
new file mode 100644
index 0000000000000000000000000000000000000000..16912907d695d649a9b79d484d08c245d49a3550
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/package_manager.py"	
@@ -0,0 +1,281 @@
+import os
+import sys
+import argparse
+import asyncio
+import json
+from pathlib import Path
+import time
+from typing import Any, Dict, List
+from dependency_analyst import DependencyAnalyst
+from package_installer import PackageInstaller
+from package_tester import TestType, TestCaseGenerator, TestExecutionEngine
+from environment_resolver import EnvironmentResolver
+
+root_dir = Path(__file__).parent.parent
+sys.path.insert(0, str(root_dir))
+from logger.logger import get_logger, ResultLogger, Status
+from utils.create_venv import create_venv
+from data_manager.package_info import Package, PackageInfo
+from data_manager.package_repository import PackageRepository
+
+logger = get_logger("自动化验证模块")
+
+class OpenCloudOSAIValidator:
+    """OpenCloudOS AI软件验证器主类"""
+    
+    def __init__(self, config: Dict = None):
+        self.config = config or {}
+        self.result_logger = ResultLogger(self.config.get('output_dir', './results'))
+        self.use_llm = self.config.get('use_llm', False)
+        status, self.venv_dir = asyncio.run(create_venv(self.config.get('venvs_dir'), self.config.get('venv_name'))) # 创建pip安装的虚拟环境,避免对主环境造成干扰
+        if status == False:
+            raise Exception(f'测试环境创建失败')
+        self.test_generator = TestCaseGenerator()
+        self.env_resolver = EnvironmentResolver(self.use_llm)
+        asyncio.run(self.env_resolver.detect_system_package_manager())
+
+    async def validate_packages(self, use_uv: bool, use_db: bool, db_path: str, json_path: str) -> Dict:
+        """批量验证Python软件包"""
+        logger.info(f"开始分析包依赖关系")
+
+        repository = PackageRepository(db_path, json_path)
+
+        package_list = []
+        try:
+            if use_db == True:
+                package_list = repository.get_certain_package_list_from_db("`exists` == true")
+                total_packages_num = len(repository.get_certain_package_list_from_db())
+            else:
+                package_list = repository.get_certain_package_list_from_json("exists", "True")
+                total_packages_num = len(repository.get_certain_package_list_from_json())
+        except Exception as e:
+            logger.error(f"获取包列表失败: {e}")
+
+        dependency_info = {}
+        
+        for package in package_list:
+            dependency_info[package.package_name] = package.info.dependency
+        
+        dependency_graph = DependencyAnalyst.build_dependency_graph(dependency_info)
+
+        ordered_package_list = DependencyAnalyst.topological_sort(dependency_graph)
+
+        logger.info(f"开始批量验证 {len(package_list)} 个包")
+        
+        results = []
+        
+        for package_name in ordered_package_list:
+            try:
+                logger.info(f"开始验证包: {package_name}")
+
+                # 0. 从数据库/json文件中寻找有没有相关包信息
+                packages = []
+                package = None
+                if use_db:
+                    packages = repository.get_certain_package_list_from_db(f"package_name='{package_name}'")
+                else:
+                    packages = repository.get_certain_package_list_from_json("package_name", package_name)
+                if packages != []:
+                    package = packages[0]
+                    # 如果包无法在PyPI仓库中找到，是因为在分析依赖包阶段由AI幻觉生成，由于不在所要验证包的范围，直接跳过并不记录
+                    if package.exists == False:
+                        logger.error(f"包 {package_name} 在无法在PyPI仓库中找到，跳过")
+                        # results.append({
+                        #     'package_name': package_name,
+                        #     'status': Status.NOT_FOUND,
+                        #     'error': 'Package not found in PyPI'
+                        # })
+                        continue
+
+                    # 如果包已经验证过，则跳过验证，并记录结果
+                    if package.info.verified == "True":
+                        logger.info(f"包 {package_name} 已验证通过，跳过")
+                        results.append({
+                            'package_name': package_name,
+                            'status': Status.COMPATIBLE,
+                            'error': 'Package already verified'
+                        })
+                        continue
+                # 0. 选择一个合适的虚拟环境进行安装
+                suitable_venv = None
+                if package is not None:
+                    # 使用默认的虚拟环境，查看是否满足依赖限制
+                    if DependencyAnalyst.detect_potential_version_conflicts(package.info.dependency, self.venv_dir) == {}:
+                        suitable_venv = self.venv_dir
+                    if suitable_venv is None:
+                        # 如果默认虚拟环境不满足要求，则遍历虚拟环境，查看是否有环境满足依赖限制
+                        for venv_dir in PackageInstaller.search_venvs(self.config.get('venvs_dir')):
+                            if DependencyAnalyst.detect_potential_version_conflicts(package.info.dependency, venv_dir) == {}:
+                                suitable_venv = venv_dir
+                                break
+                    if suitable_venv is None:
+                        # 如果没有满足要求的环境，则创建一个新的环境
+                        status, venv_dir = await create_venv(self.config.get('venvs_dir'), f".{str(int(time.time()))}_venv")
+                        if status == False:
+                            logger.error(f"包 {package_name} 依赖环境创建失败，跳过验证")
+                            results.append({
+                                'package_name': package_name,
+                                'status': Status.CREATE_ENV_FAILED,
+                                'error': 'Environment creation failed'
+                            })
+                            continue
+                        suitable_venv = venv_dir
+                else:
+                    suitable_venv = self.venv_dir
+                logger.info(f"包 {package_name} 选择虚拟环境 {suitable_venv} 进行安装和验证")
+
+                # 1. 安装包
+                success, error, suitable_venv = await PackageInstaller.install_package(venv_path=suitable_venv, 
+                                                                    package_name=package_name, 
+                                                                    use_uv=use_uv,
+                                                                    use_llm=self.use_llm)
+                if not success:
+                    logger.error(f"包 {package_name} 安装失败，跳过验证")
+                    # 只记录需要验证的包，对于依赖包不记录安装结果
+                    if package is not None:
+                        results.append({
+                            'package_name': package_name,
+                            'status': Status.INSTALL_FAILED,
+                            'error': error
+                        })
+                    continue
+                # 1.5 判断软件包能否正确链接到系统依赖
+                logger.info(f"开始对包 {package_name} 进行系统环境测试")
+                if not self.env_resolver.pre_resolve_environment(suitable_venv, package_name):
+                    logger.error(f"包 {package_name} 环境预解析失败，跳过验证")
+                    results.append({
+                        'package_name': package_name,
+                        'status': Status.ENV_RESOLVE_FAILED,
+                        'error': 'Environment pre-resolution failed'
+                    })
+                    continue
+
+                if package is not None:
+                    # 2. 生成测试用例
+                    test_types, test_cases, expected_results = self.test_generator.generate_test_cases(package)
+                    logger.info(f"生成了 {len(test_types)} 个测试用例")
+                    
+                    # 3. 执行测试
+                    test_results = await TestExecutionEngine.execute_tests(suitable_venv, package_name, test_types, test_cases, expected_results, self.use_llm)
+                    
+                    # 4. 修改package_info中的verified字段，如果测试代码被大模型修改过，同样更新到数据库/json中
+                    modify = False
+                    for result in test_results['test_results']:
+                        if result['status'] == 'PASS' and result['modified'] == True:
+                            modify = True
+                            if result['test_type'] == TestType.IMPORT_TEST:
+                                package.info.import_test_code = result['test_case']
+                                package.info.import_test_expected_result = result['expected_output']
+                            elif result['test_type'] == TestType.FUNCTIONAL_TEST:
+                                package.info.function_test_code = result['test_case']
+                                package.info.function_test_expected_result = result['expected_output']
+                            elif result['test_type'] == TestType.GPU_TEST:
+                                package.info.gpu_test_code = result['test_case']
+                                package.info.gpu_test_expected_result = result['expected_output']
+                    if test_results['summary']['failed'] == 0:
+                        package.info.verified = "True"
+                        modify = True
+                    if modify:
+                        if use_db:
+                            repository.save_to_db(package)
+                        else:
+                            repository.save_to_json(package)
+                    # 5. 保存结果
+                    self.result_logger.save_results(package_name, test_results)
+                    
+                    results.append(test_results)
+                    logger.info(f"包 {package_name} 验证完成")
+                else:
+                    # 对于一些依赖包，在数据库/json中没有验证代码，只进行安装不进行测试，且不记录安装结果
+                    logger.info(f"包 {package_name} 为依赖包，不在验证范围，跳过验证")
+                    # results.append({
+                    #     'package_name': package_name,
+                    #     'status': Status.SKIPPED,
+                    #     'error': 'Package has no verification codes'
+                    # })
+                
+            except Exception as e:
+                logger.error(f"验证包 {package_name} 时发生错误: {str(e)}")
+                results.append({
+                    'package_name': package_name,
+                    'status': Status.OTHER_ERROR,
+                    'error': str(e)
+                })
+        
+        # 生成汇总报告
+        summary = self.result_logger.generate_summary_report(results, len(package_list), total_packages_num)
+        
+        logger.info(f"批量验证完成，兼容性率: {summary['compatibility_rate']:.2%}")
+        return summary
+
+# my_package_list = ['numpy', 'six', 'protobuf', 'typing-extensions', 'networkx', 'mpmath', 'tokenizers', 'tqdm', 'pyyaml', 'packaging', 'urllib3', 'charset-normalizer', 'certifi', 'idna', 'msgpack', 'toolz', 'dm-tree', 'pillow', 'joblib', 'threadpoolctl', 'pytz', 'pyparsing', 'kiwisolver', 'tenacity>=6.2.0', 'click', 'blinker', 'tenacity', 'toml', 'watchdog', 'smmap', 'pygments', 'mdurl', 'cloudpickle', 'fsspec', 'filelock', 'attrs', 'pyrsistent', 'async-timeout', 'multidict', 'importlib-metadata', 'psutil', 'annotated-types', 'jsonpointer', 'more-itertools', 'regex', 'cmake', 'opt-einsum', 'scipy', 'tensorstore', 'opencv-python', 'pyarrow', 'partd', 'python-dateutil', 'grpcio', 'tensorflow', 'sqlalchemy', 'sympy', 'requests', 'paddlepaddle', 'plotly', 'gitdb', 'markdown-it-py', 'jsonschema', 'yarl', 'gymnasium', 'pydantic', 'jsonpatch', 'tiktoken', 'jax', 'scikit-learn', 'xgboost', 'lightgbm', 'pandas', 'matplotlib', 'torch', 'huggingface-hub', 'openai', 'gitpython', 'rich', 'aiohttp', 'langchain', 'chex', 'orbax-checkpoint', 'dask', 'seaborn', 'catboost', 'stable-baselines3', 'accelerate', 'triton', 'transformers', 'streamlit', 'ray', 'optax', 'openai-whisper', 'diffusers', 'flax']
+
+def main():
+    parser = argparse.ArgumentParser(description='OpenCloudOS AI软件自动化验证工具')
+    parser.add_argument('--use-uv', default=False, action='store_true',
+                        help='是否使用uv安装包')
+    parser.add_argument('--use-llm', default=False, action='store_true',
+                        help='是否使用大模型辅助安装验证')
+    
+    args = parser.parse_args()
+    CONFIG_PATH = f'{root_dir}/config.json'
+    with open(CONFIG_PATH, 'r') as f:
+        config = json.load(f)
+        USE_DB = True if config.get('save_method') == 'db' else False
+        DB_PATH = config['db_path']
+        JSON_PATH = config['json_file_path']
+        RESULT_PATH = config['result_path']
+        VENVS_DIR = config['venvs_path']
+
+    COMMON_VENV = ".common_venv"
+
+    config = {
+        'venvs_dir': VENVS_DIR,
+        'venv_name': COMMON_VENV,
+        'output_dir': RESULT_PATH,
+        'timeout': 300,
+        'max_retries': 3,
+        'use_llm': args.use_llm
+    }
+
+    # 创建验证器实例
+    validator = OpenCloudOSAIValidator(config)
+    
+    # 执行验证
+    try:
+        summary = asyncio.run(validator.validate_packages(args.use_uv, USE_DB, DB_PATH, JSON_PATH))
+
+        # 打印汇总结果
+        print("\n" + "="*50)
+        print("验证结果汇总")
+        print("="*50)
+        print(f"总包数: {summary['total_packages']}")
+        print(f"在PyPI中存在包数: {summary['total_exists_packages']}")
+        print(f"兼容包数: {summary['successful_packages']}")
+        print(f"安装失败包数: {summary['install_failed_packages']}")
+        print(f"安装成功率： {summary['install_rate']:.2%}")
+        print(f"兼容性率: {summary['compatibility_rate']:.2%}")
+        print(f"总安装成功率（包括未在PyPI仓库中找到的包）： {summary['install_rate_total']:.2%}")
+        print(f"总兼容性率（包括未在PyPI仓库中找到的包）: {summary['compatibility_rate_total']:.2%}")
+        print("="*50)
+        
+        # 打印详细信息
+        print("\n详细信息:")
+        for detail in summary['details']:
+            status_icon = "✅" if detail['status'] == 'COMPATIBLE' else "❌"
+            print(f"{status_icon} {detail['package_name']}: {detail['status']}")
+        
+    except KeyboardInterrupt:
+        logger.info("用户中断验证过程")
+    except Exception as e:
+        logger.error(f"验证过程中发生错误: {str(e)}")
+        sys.exit(1)
+
+    
+
+if __name__ == "__main__":
+    main() 
+    
+    
+
+    
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/package_tester.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/package_tester.py"
new file mode 100644
index 0000000000000000000000000000000000000000..83c205d776b4e1d474ceb900054c90cde8ddce64
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/package_manager/package_tester.py"	
@@ -0,0 +1,302 @@
+import sys
+import asyncio
+import re
+from pathlib import Path
+from datetime import datetime
+from typing import Dict, List, Union
+
+root_dir = Path(__file__).parent.parent
+sys.path.insert(0, str(root_dir))
+
+from mcp_chat_bot.mcp_task_executor import assign_task_to_llm_mcp, MCPTask
+from mcp_chat_bot.mcp_task_executor import MCPTask
+from logger.logger import Status, get_logger
+from data_manager.package_info import Package, PackageInfo
+
+class TestType:
+    IMPORT_TEST = "import test"
+    FUNCTIONAL_TEST = "functional test"
+    GPU_TEST = "gpu test"
+
+class TestCaseGenerator:
+    """测试用例生成器"""
+
+    def __init__(self):
+        self.test_templates = {
+            'basic_import': self._get_basic_import_template(),
+            'function_test': self._get_function_test_template(),
+            'gpu_test': self._get_gpu_test_template()
+        }
+        self.logger = get_logger("测试用例生成模块")
+    
+    def generate_test_cases(self, package: Package) -> tuple[List[str], List[str], List[str]]:
+        """生成测试用例"""
+        test_types = []
+        test_cases = []
+        expected_results = []
+        
+        # 基础导入测试
+        test_types.append(TestType.IMPORT_TEST)
+        test_cases.append(self._generate_import_test(package.info.import_test_code, package.package_name))
+        expected_results.append(package.info.import_test_expected_result)
+
+        # 功能测试
+        test_types.append(TestType.FUNCTIONAL_TEST)
+        test_cases.append(self._generate_function_test(package.info.function_test_code, package.package_name))
+        expected_results.append(package.info.function_test_expected_result)
+
+        # GPU测试
+        if package.info.gpu_test_code != "":
+            test_types.append(TestType.GPU_TEST)
+            test_cases.append(self._generate_gpu_test(package.info.gpu_test_code, package.package_name))
+            expected_results.append(package.info.gpu_test_expected_result)
+
+        return test_types, test_cases, expected_results
+    
+    def _get_basic_import_template(self) -> str:
+        return """
+{basic_import}
+"""
+    
+    def _get_function_test_template(self) -> str:
+        return """
+# 基本功能测试
+# 设置环境变量以使用镜像源（如果需要）
+import os
+os.environ["HF_ENDPOINT"] = "https://hf-mirror.com"
+{test_code}
+"""
+    
+    def _get_gpu_test_template(self) -> str:
+        return """
+# GPU可用性测试
+# 设置环境变量以使用镜像源（如果需要）
+import os
+os.environ["HF_ENDPOINT"] = "https://hf-mirror.com"
+{test_code}
+"""
+    
+    def _generate_import_test(self, basic_import: str, package_name: str) -> str:
+        """生成导入测试"""
+        return self.test_templates['basic_import'].format(basic_import=basic_import, package_name=package_name)
+    
+    def _generate_function_test(self, function_test_code: str, package_name: str) -> str:
+        """生成功能测试"""
+        return self.test_templates['function_test'].format(
+            package_name=package_name,
+            test_code=function_test_code
+        )
+
+    def _generate_gpu_test(self, gpu_test_code: str, package_name: str) -> str:
+        """生成GPU测试"""
+        return self.test_templates['gpu_test'].format(
+            package_name=package_name,
+            test_code=gpu_test_code
+        )
+
+class TestExecutionEngine:
+    """测试执行引擎"""
+    logger = get_logger("测试执行模块")
+    
+    @classmethod
+    async def execute_tests(cls, venv_dir: str, package_name: str, test_types: List[str], test_cases: List[str], expected_results: List[str], use_llm: bool=False) -> Dict:
+        """执行测试用例"""
+        cls.logger.info(f"开始执行 {package_name} 的测试用例")
+        results = {
+            'package_name': package_name,
+            'timestamp': datetime.now().isoformat(),
+            'test_results': [],
+            'summary': {
+                'total': len(test_cases),
+                'passed': 0,
+                'failed': 0,
+                'skipped': 0
+            }
+        }
+        
+        for i, (test_type, test_case, expected_result) in enumerate(zip(test_types, test_cases, expected_results)):
+            cls.logger.info(f"执行测试用例 {i+1}/{len(test_cases)}: {test_case[:100]}...")
+            result = await cls.execute_single_test(package_name, venv_dir, test_type, test_case, expected_result, use_llm)
+            results['test_results'].append(result)
+            
+            if result['status'] == 'PASS':
+                results['summary']['passed'] += 1
+            elif result['status'] == 'FAIL':
+                results['summary']['failed'] += 1
+                break  # 失败则停止后续测试
+            else:
+                results['summary']['skipped'] += 1
+        
+        # 确定整体兼容性状态
+        if results['summary']['failed'] == 0:
+            results['status'] = Status.COMPATIBLE
+            cls.logger.info(f"包 {package_name} 测试完成: {results['status']}")
+        else:
+            results['status'] = Status.INCOMPATIBLE
+            cls.logger.error(f"包 {package_name} 测试完成: {results['status']}，查看日志以了解更多内容")
+        return results
+        
+
+    @staticmethod
+    def _normalize_text(text: str) -> str:
+        """标准化文本：移除多余空格、tab、换行符，转换为小写"""
+        if not text:
+            return ""
+        
+        # 替换所有空白字符为单个空格
+        normalized = re.sub(r'\s+', ' ', text.strip())
+        # 转换为小写
+        normalized = normalized.lower()
+        return normalized
+    
+    @staticmethod
+    def _flexible_match(actual_output: str, expected_pattern: str) -> bool:
+        """灵活匹配，支持正则表达式和标准化文本比较"""
+        if not expected_pattern:
+            # 如果期望结果为空，则实际输出也应该为空或只有空白字符
+            return not actual_output.strip()
+        
+        # 首先尝试原始的正则表达式匹配（保持原有行为）
+        try:
+            pattern = re.compile(expected_pattern.strip(), re.IGNORECASE | re.DOTALL | re.MULTILINE)
+            if pattern.fullmatch(actual_output.strip()):
+                return True
+        except re.error:
+            pass
+        
+        # 如果原始匹配失败，尝试更宽松的正则表达式匹配
+        try:
+            # 如果期望模式包含 \n，先尝试完整匹配（包括换行符）
+            if '\\n' in expected_pattern:
+                pattern = re.compile(expected_pattern, re.IGNORECASE | re.DOTALL | re.MULTILINE)
+                if pattern.fullmatch(actual_output):
+                    return True
+                
+                # 如果完整匹配失败，将实际输出的换行符替换为 \n 字符串再匹配
+                escaped_actual = actual_output.replace('\n', '\\n').replace('\t', '\\t')
+                if pattern.fullmatch(escaped_actual):
+                    return True
+            
+            # 尝试在实际输出中搜索模式（而不是完全匹配）
+            pattern = re.compile(expected_pattern.strip(), re.IGNORECASE | re.DOTALL | re.MULTILINE)
+            if pattern.search(actual_output):
+                return True
+        except re.error:
+            pass
+        
+        # 如果正则匹配都失败，尝试标准化后的文本比较
+        normalized_actual = TestExecutionEngine._normalize_text(actual_output)
+        normalized_expected = TestExecutionEngine._normalize_text(expected_pattern)
+        
+        if normalized_actual == normalized_expected:
+            return True
+            
+        # 最后尝试将标准化后的期望结果作为正则表达式在标准化后的实际输出中搜索
+        try:
+            if normalized_expected:
+                norm_pattern = re.compile(normalized_expected, re.IGNORECASE)
+                if norm_pattern.search(normalized_actual):
+                    return True
+        except re.error:
+            pass
+        
+        return False
+
+    @classmethod
+    async def execute_single_test(cls, package_name: str, venv_dir: str, test_type: str, test_case: str, expected_result: str, use_llm: bool = False) -> Dict:
+        """执行单个测试用例"""
+        start_time = datetime.now()
+        py = f'{venv_dir}/bin/python'
+        try:
+            # 执行测试
+            process = await asyncio.create_subprocess_exec(
+                str(py), '-c', test_case,
+                stdout=asyncio.subprocess.PIPE,
+                stderr=asyncio.subprocess.PIPE
+            )
+            
+            stdout, stderr = await process.communicate()
+            actual_output = stdout.decode('utf-8', errors='ignore')
+            
+            status = 'PASS'
+            if process.returncode != 0:
+                status = 'FAIL'
+            else:
+                # 使用灵活匹配来比较输出结果
+                if cls._flexible_match(actual_output, expected_result):
+                    status = 'PASS'
+                else:
+                    status = 'FAIL'
+
+            result = {
+                'test_type': test_type,
+                'modified': False,
+                'test_case': test_case,
+                'status': status,
+                'actual_output': actual_output,
+                'expected_output': expected_result,
+                'stderr': stderr.decode('utf-8', errors='ignore'),
+                'return_code': process.returncode,
+                'execution_time': (datetime.now() - start_time).total_seconds()
+            }
+
+            if use_llm and status == 'FAIL':
+                llm_result = await assign_task_to_llm_mcp(task = MCPTask.TEST_EXECUTION,
+                                                          pkg_name=package_name,
+                                                          venv_name = Path(venv_dir).name,
+                                                          result = result)
+                if llm_result.get('status') == 'success':
+                    modified = False
+                    if llm_result.get('final_test_case') != test_case or llm_result.get('final_expected_result') != expected_result:
+                        modified = True 
+                    result = {
+                        'test_type': test_type,
+                        'modified': modified,
+                        'test_case': llm_result.get('final_test_case', test_case),
+                        'status': 'PASS',
+                        'actual_output': llm_result.get('final_stdout', actual_output),
+                        'expected_output': llm_result.get('final_expected_result', expected_result),
+                        'stderr': llm_result.get('final_stderr', stderr.decode('utf-8', errors='ignore')),
+                        'return_code': llm_result.get('final_exit_code', process.returncode),
+                        'execution_time': (datetime.now() - start_time).total_seconds(),
+                        'llm_suggestion': llm_result.get('conclusion', '')
+                    }
+                
+            
+        except Exception as e:
+            result = {
+                'test_type': test_type,
+                'modified': False,
+                'test_case': test_case,
+                'status': 'FAIL',
+                'error': str(e),
+                'execution_time': (datetime.now() - start_time).total_seconds()
+            }
+        
+        return result
+
+if __name__ == '__main__':
+    # 测试代码
+    async def test():
+        package = Package(
+            package_name="requests",
+            info=PackageInfo(
+                import_test_code="import requests",
+                import_test_expected_result="",
+                function_test_code="import request; print(request.__version__)",
+                function_test_expected_result=r"\d+(\.\d+)*\n",
+                gpu_test_code="",
+                gpu_test_expected_result=""
+            ),
+        exists = True
+        )
+        generator = TestCaseGenerator()
+        test_types, test_cases, expected_results = generator.generate_test_cases(package)
+        print("生成的测试用例:", test_cases)
+        print("预期结果:", expected_results)
+        venv_dir = f'/root/contributor_rhino-bird/2025实战任务_作品文件夹/OpenCloudOS 9 AI软件自动化验证工具/黄振业_作品/oc_contributor_huangzhenye/code/.main_venv'
+        results = await TestExecutionEngine.execute_tests(venv_dir, package.package_name, test_types, test_cases, expected_results, use_llm=True)
+
+        print("测试结果:", results)
+    
+    asyncio.run(test())
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/requirements.txt" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/requirements.txt"
new file mode 100644
index 0000000000000000000000000000000000000000..d7c7136f3834096355745bfc1b9f8065cc8a38eb
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/requirements.txt"	
@@ -0,0 +1,13 @@
+colorama==0.4.6
+numpy>=1.23.2
+pandas==2.3.2
+mcp==1.13.1
+openpyxl==3.1.5
+requests==2.32.5
+networkx==3.5
+openai==1.102.0
+regex==2025.9.1
+tinycss2==1.4.0
+typer==0.17.4
+uv==0.8.13
+packaging
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/run.sh" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/run.sh"
new file mode 100755
index 0000000000000000000000000000000000000000..db852667a3dc820f107d8b3edcf5ba1b91ab88d8
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/run.sh"	
@@ -0,0 +1,127 @@
+#!/usr/bin/env bash
+set -euo pipefail
+
+# run.sh - orchestrate crawling, package-info generation, and verification
+# Supported flags:
+#   --use-llm            Use LLM when generating package info
+#   --topic=xxx          Topic to search on GitHub (default: ai)
+#   --fetch-github-repos Fetch GitHub repos for the given topic
+#   --generate-package-info   Fetch GitHub repos and generate package info
+#   --verify-packages    Run package verification using package_manager
+
+use_llm=0
+venv=""
+topic="ai"
+fetch_github_repos=0
+generate_package_info=0
+verify_packages=0
+
+print_usage() {
+  cat <<-USAGE
+Usage: $0 [--venv=xxx] [--use-llm] [--topic=xxx] [--fetch-github-repos] [--generate-package-info] [--verify-packages] [--help]
+
+Flags:
+  --venv=xxx                Use the specified virtualenv (default: origin python environment).
+  --use-llm                 Use LLM-based analyzer (mcp/mcp_chat_bot.py).
+  --topic=xxx               GitHub topic to search for (default: ai).
+  --fetch-github-repos      Fetch GitHub repos for the given topic and store in ./tmp/github_topic_repos_with_desc.csv.
+  --generate-package-info   Generate package info.
+  --verify-packages         Verify packages using ./package_manager/package_manager.py and write results to ./results.
+  --help                    Show this help and exit.
+
+Example:
+  $0 --venv=/root/.main_venv --use-llm --topic=computer-vision --fetch-github-repos --generate-package-info --verify-packages
+USAGE
+}
+
+# parse long options
+for arg in "$@"; do
+  case "$arg" in
+    --venv=*)
+      venv="${arg#--venv=}"
+      ;;
+    --use-llm)
+      use_llm=1
+      ;;
+    --topic=*)
+      topic="${arg#--topic=}"
+      ;;
+    --fetch-github-repos)
+      fetch_github_repos=1
+      ;;
+    --generate-package-info)
+      generate_package_info=1
+      ;;
+    --verify-packages)
+      verify_packages=1
+      ;;
+    --help|-h)
+      print_usage
+      exit 0
+      ;;
+    *)
+      echo "Unknown option: $arg"
+      print_usage
+      exit 2
+      ;;
+  esac
+done
+
+echo "Starting run.sh with options: use_llm=$use_llm topic=$topic fetch_github_repos=$fetch_github_repos generate_package_info=$generate_package_info verify_packages=$verify_packages"
+
+# 1. Prepare python environment
+if [ -f "$venv/bin/activate" ]; then
+  # if the venv exists, activate it
+  echo "Activating virtualenv: $venv"
+  source "$venv/bin/activate"
+else
+  # use origin python environment
+  echo "Virtualenv '$venv' not found. Use original python environment."
+fi
+
+# install requirements.txt
+if [ -f "requirements.txt" ]; then
+  echo "Installing dependencies from requirements.txt"
+  pip install --upgrade pip
+  pip install -r requirements.txt
+else
+  echo "requirements.txt not found, skipping dependency installation."
+fi
+
+echo "Python environment ready."
+
+# 2. If fetch-github-repos, run the crawler to get repos for the topic 
+# and store in ./tmp/github_{topic}_repos_with_desc.csv
+if [ "$fetch_github_repos" -eq 1 ]; then
+  mkdir -p ./tmp
+  echo "(fetch-github-repos) Fetching GitHub repos for topic: $topic"
+  python ./info_crawler/get_abstracts_from_github.py --topic "$topic"
+fi
+
+# 3. If generate-package-info, run package-info generation
+if [ "$generate_package_info" -eq 1 ]; then
+  if [ "$use_llm" -eq 1 ]; then
+    echo "(generate-package-info) Generating package info with LLM for topic: $topic"
+    python ./package_manager/package_info_generator.py --topic "$topic" --use-llm  --get-pypi --generate-package-info
+  else
+    echo "(generate-package-info) Generating package info without LLM for topic: $topic"
+    python ./package_manager/package_info_generator.py --topic "$topic"  --get-pypi --generate-package-info
+  fi
+fi
+
+# 3. If verify-packages, run the package manager verification
+if [ "$verify_packages" -eq 1 ]; then
+  if [ "$use_llm" -eq 1 ]; then
+    echo "(verify-packages) Using LLM for package verification."
+    python ./package_manager/package_manager.py --use-llm
+  else
+    echo "(verify-packages) Not using LLM for package verification."
+    python ./package_manager/package_manager.py
+  fi
+fi
+
+# 4. Deactivate venv
+echo "Deactivating .main_venv"
+deactivate || true
+
+echo "run.sh completed."
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/run_for_\350\275\257\344\273\266\345\210\227\350\241\250.sh" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/run_for_\350\275\257\344\273\266\345\210\227\350\241\250.sh"
new file mode 100644
index 0000000000000000000000000000000000000000..ac42aee884493213b52bcdfd779652810a75206c
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/run_for_\350\275\257\344\273\266\345\210\227\350\241\250.sh"	
@@ -0,0 +1,131 @@
+#!/usr/bin/env bash
+set -euo pipefail
+
+# run.sh - orchestrate crawling, package-info generation, and verification
+# Supported flags:
+#   --use-llm            Use LLM when generating package info
+#   --generate-pypi-txt  Generate pypi.txt from xlsx file
+#   --generate-package-info   Fetch GitHub repos and generate package info
+#   --verify-packages    Run package verification using package_manager
+
+venv=""
+use_llm=0
+generate_pypi_txt=0
+generate_package_info=0
+verify_packages=0
+tmp_dir="./tmp"
+
+# 读取config.json文件
+if [ -f "config.json" ]; then
+  echo "Reading config.json"
+  # 使用jq解析JSON文件，读取tmp目录的位置
+  tmp_dir=$(jq -r '.tmp_path // "./tmp"' config.json)
+  echo "Using tmp directory: $tmp_dir"
+else
+  echo "config.json not found, using default values."
+fi
+
+print_usage() {
+  cat <<-USAGE
+Usage: $0 [--venv=xxx] [--use-llm] [--generate-package-info] [--verify-packages] [--help]
+
+Flags:
+
+  --venv=xxx                Use the specified virtualenv (default: origin python environment).
+  --use-llm                 Use LLM-based analyzer (mcp/mcp_chat_bot.py).
+  --generate-package-info   Read ./tmp/软件列表.xlsx, and generate package info.
+  --verify-packages         Verify packages using ./package_manager/package_manager.py and write results to ./results.
+  --help                    Show this help and exit.
+
+Example:
+  $0 --venv=/root/.main_venv --use-llm --generate-package-info --verify-packages
+USAGE
+}
+
+# parse long options
+for arg in "$@"; do
+  case "$arg" in
+    --venv=*)
+      venv="${arg#--venv=}"
+      ;;
+    --use-llm)
+      use_llm=1
+      ;;
+    --generate-package-info)
+      generate_package_info=1
+      ;;
+    --verify-packages)
+      verify_packages=1
+      ;;
+    --help|-h)
+      print_usage
+      exit 0
+      ;;
+    *)
+      echo "Unknown option: $arg"
+      print_usage
+      exit 2
+      ;;
+  esac
+done
+
+echo "Starting run.sh with options: use_llm=$use_llm generate_package_info=$generate_package_info verify_packages=$verify_packages"
+
+# 1. Prepare python environment
+if [ -f "$venv/bin/activate" ]; then
+  # if the venv exists, activate it
+  echo "Activating virtualenv: $venv"
+  source "$venv/bin/activate"
+else
+  # use origin python environment
+  echo "Virtualenv '$venv' not found. Use original python environment."
+fi
+
+# install requirements.txt
+if [ -f "requirements.txt" ]; then
+  echo "Installing dependencies from requirements.txt"
+  pip install --upgrade pip
+  pip install -r requirements.txt
+else
+  echo "requirements.txt not found, skipping dependency installation."
+fi
+
+echo "Python environment ready."
+
+# 2. If generate-package-info, run package-info generation
+if [ "$generate_package_info" -eq 1 ]; then
+  echo "(generate-package-info) Analysing $tmp_dir/软件列表.xlsx"
+  if [ -f "$tmp_dir/软件列表.xlsx" ]; then
+    python ./utils/transfer_to_csv_utf8.py --input "$tmp_dir/软件列表.xlsx" --output "$tmp_dir/软件列表-utf8.csv"
+    python ./utils/extract_packages.py --input "$tmp_dir/软件列表-utf8.csv" --output "$tmp_dir/软件列表-utf8-with-packages.txt"
+    python ./utils/csv_to_pypi_txt.py --input "$tmp_dir/软件列表-utf8-with-packages.txt" --output "$tmp_dir/pypi.txt"
+  else
+    echo "(generate-package-info) $tmp_dir/软件列表.xlsx not found, exit."
+    exit 1
+  fi
+
+  if [ "$use_llm" -eq 1 ]; then
+    echo "(generate-package-info) Generating package info with LLM"
+    python ./package_manager/package_info_generator.py --use-llm --generate-package-info
+  else
+    echo "(generate-package-info) Generating package info without LLM"
+    python ./package_manager/package_info_generator.py --generate-package-info
+  fi
+fi
+
+# 3. If verify-packages, run the package manager verification
+if [ "$verify_packages" -eq 1 ]; then
+  if [ "$use_llm" -eq 1 ]; then
+    echo "(verify-packages) Using LLM for package verification."
+    python ./package_manager/package_manager.py --use-llm
+  else
+    echo "(verify-packages) Not using LLM for package verification."
+    python ./package_manager/package_manager.py
+  fi
+fi
+
+# 4. Deactivate venv
+echo "Deactivating .main_venv"
+deactivate || true
+
+echo "run.sh completed."
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/create_venv.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/create_venv.py"
new file mode 100644
index 0000000000000000000000000000000000000000..1a4fc89d55aab8cbb7af399c6df20bb0ca18d7a2
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/create_venv.py"	
@@ -0,0 +1,47 @@
+import os
+import sys
+import asyncio
+import json
+from typing import Tuple
+from pathlib import Path
+
+root_dir = Path(__file__).parent.parent
+sys.path.insert(0, str(root_dir))
+from logger.logger import get_logger
+
+CONFIG_PATH = f'{root_dir}/config.json'
+with open(CONFIG_PATH, 'r') as f:
+    config = json.load(f)
+    VENVS_DIR = config['venvs_path']
+
+async def create_venv(path: str, venv_name: str)-> Tuple[bool, str]:
+    """若不存在则创建 venv"""
+    logger = get_logger("虚拟环境创建器")
+    venv_dir = f'{path}/{venv_name}'
+    if os.path.isdir(venv_dir):
+        logger.info(f"已存在虚拟环境：{venv_name}")
+        return (True, venv_dir)
+    logger.info("正在创建虚拟环境 …")
+    try:
+        process = await asyncio.create_subprocess_exec(
+            sys.executable, "-m", "venv", venv_dir, 
+            stdout = None,
+            stderr = asyncio.subprocess.PIPE
+        )
+        _, stderr = await process.communicate()
+
+        if process.returncode != 0:
+            logger.error(f"创建 {venv_name} 虚拟环境失败: {stderr.decode()}")
+            return (False, None)
+    except Exception as e:
+        logger.error(f"创建 {venv_name} 虚拟环境失败: {str(e)}")
+        return (False, None)
+        
+    return (True, venv_dir)
+
+if __name__ == "__main__":
+    # 测试创建虚拟环境
+    from pathlib import Path
+    root_dir = Path(__file__).parent.parent
+    result = asyncio.run(create_venv(VENVS_DIR, ".test_venv"))
+    print(result)
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/csv_to_pypi_txt.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/csv_to_pypi_txt.py"
new file mode 100644
index 0000000000000000000000000000000000000000..fc13f878e6d454db40745fd8897ee9bbbf473ae1
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/csv_to_pypi_txt.py"	
@@ -0,0 +1,167 @@
+#!/usr/bin/env python3
+# -*- coding: utf-8 -*-
+"""
+将软件列表-utf8-with-packages.csv转换为pypi.txt格式的脚本
+"""
+
+import pandas as pd
+import argparse
+import os
+from pathlib import Path
+
+def convert_csv_to_pypi(input_csv: str, output_txt: str):
+    """
+    将CSV格式的软件列表转换为pypi.txt格式
+    
+    Args:
+        input_csv: 输入的CSV文件路径
+        output_txt: 输出的pypi.txt文件路径
+    """
+    print(f"读取CSV文件: {input_csv}")
+    
+    try:
+        # 读取CSV文件
+        df = pd.read_csv(input_csv, encoding='utf-8')
+        print(f"成功读取 {len(df)} 行数据")
+        
+        # 检查必要的列
+        required_columns = ['名字', '仓库地址', '包名列表']
+        missing_columns = [col for col in required_columns if col not in df.columns]
+        if missing_columns:
+            raise ValueError(f"CSV文件缺少必要的列: {missing_columns}")
+        
+        print(f"开始转换为pypi.txt格式...")
+        
+        # 准备输出数据
+        output_lines = []
+        processed_count = 0
+        
+        for index, row in df.iterrows():
+            name = str(row['名字']).strip()
+            repo_url = str(row['仓库地址']).strip()
+            packages = row['包名列表']
+            
+            # 处理包名列表
+            package_list = []
+            if pd.notna(packages) and packages.strip():
+                # 支持管道符和分号分隔
+                if '|' in packages:
+                    package_list = [pkg.strip() for pkg in packages.split('|') if pkg.strip()]
+                elif ';' in packages:
+                    package_list = [pkg.strip() for pkg in packages.split(';') if pkg.strip()]
+                else:
+                    package_list = [packages.strip()]
+            
+            # 格式化包名列表为字符串
+            if package_list:
+                packages_str = '[' + ', '.join(package_list) + ']'
+            else:
+                packages_str = '[]'
+            
+            # 构造输出行: 软件名 GitHub地址 [包名列表]
+            output_line = f"{name} {repo_url} {packages_str}"
+            output_lines.append(output_line)
+            processed_count += 1
+            
+            # 每100行显示一次进度
+            if processed_count % 100 == 0:
+                print(f"已处理 {processed_count} 行...")
+        
+        # 写入输出文件
+        print(f"写入输出文件: {output_txt}")
+        with open(output_txt, 'w', encoding='utf-8') as f:
+            f.write('\n'.join(output_lines) + '\n')
+        
+        print(f"转换完成!")
+        print(f"- 输入文件: {input_csv} ({len(df)} 行)")
+        print(f"- 输出文件: {output_txt} ({len(output_lines)} 行)")
+        
+        # 统计信息
+        has_packages = sum(1 for line in output_lines if not line.endswith('[]'))
+        no_packages = len(output_lines) - has_packages
+        
+        print(f"- 包含包名的软件: {has_packages} 个 ({has_packages/len(output_lines)*100:.1f}%)")
+        print(f"- 无包名的软件: {no_packages} 个 ({no_packages/len(output_lines)*100:.1f}%)")
+        
+        # 统计多包软件
+        multi_package_lines = [line for line in output_lines if line.count(',') > 0 and not line.endswith('[]')]
+        if multi_package_lines:
+            print(f"- 多包软件: {len(multi_package_lines)} 个")
+            
+        # 验证文件完整性
+        print(f"\n文件验证:")
+        if os.path.exists(output_txt):
+            file_size = os.path.getsize(output_txt)
+            print(f"- 文件大小: {file_size:,} 字节")
+            with open(output_txt, 'r', encoding='utf-8') as f:
+                actual_lines = len(f.readlines())
+            print(f"- 实际行数: {actual_lines}")
+            if actual_lines == len(output_lines):
+                print(f"✅ 文件完整性验证通过")
+            else:
+                print(f"⚠️  文件行数不匹配")
+        
+        # 显示前几行作为示例
+        print(f"\n输出示例 (前5行):")
+        for i, line in enumerate(output_lines[:5]):
+            print(f"  {i+1}. {line}")
+        
+        if len(output_lines) > 5:
+            print(f"  ... (共 {len(output_lines)} 行)")
+            
+    except Exception as e:
+        print(f"转换失败: {e}")
+        raise
+
+def main():
+    parser = argparse.ArgumentParser(
+        description="将软件列表CSV文件转换为pypi.txt格式",
+        formatter_class=argparse.RawDescriptionHelpFormatter,
+        epilog="""
+示例用法:
+  python csv_to_pypi.py
+  python csv_to_pypi.py --input 软件列表-utf8-with-packages.csv --output pypi.txt
+  python csv_to_pypi.py --input custom.csv --output custom_pypi.txt
+
+输出格式:
+  每行格式: 软件名 GitHub地址 [包名1, 包名2, ...]
+  
+  示例:
+    transformers https://github.com/huggingface/transformers [transformers]
+    pytorch https://github.com/pytorch/pytorch [torch, torchvision, torchaudio]
+    AutoGPT https://github.com/Significant-Gravitas/AutoGPT []
+        """
+    )
+    
+    parser.add_argument(
+        '--input', '-i',
+        default='软件列表-utf8-with-packages.csv',
+        help='输入CSV文件路径 (默认: 软件列表-utf8-with-packages.csv)'
+    )
+    
+    parser.add_argument(
+        '--output', '-o',
+        default='pypi.txt',
+        help='输出文件路径 (默认: pypi.txt)'
+    )
+    
+    args = parser.parse_args()
+    
+    # 检查输入文件是否存在
+    if not os.path.exists(args.input):
+        print(f"错误: 输入文件不存在: {args.input}")
+        print(f"当前工作目录: {os.getcwd()}")
+        print(f"目录下的CSV文件:")
+        for file in Path('.').glob('*.csv'):
+            print(f"  - {file}")
+        return 1
+    
+    try:
+        convert_csv_to_pypi(args.input, args.output)
+        return 0
+    except Exception as e:
+        print(f"程序执行失败: {e}")
+        return 1
+
+if __name__ == "__main__":
+    exit(main())
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/extract_packages.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/extract_packages.py"
new file mode 100644
index 0000000000000000000000000000000000000000..59fc43bfc607849ea3d8533a5bb203606672d16c
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/extract_packages.py"	
@@ -0,0 +1,367 @@
+import argparse
+import pandas as pd
+import re
+import requests
+import logging
+from typing import List, Optional
+from urllib.parse import urlparse
+import time
+from requests.adapters import HTTPAdapter
+from urllib3.util.retry import Retry
+
+# 设置日志
+logging.basicConfig(level=logging.INFO, format='%(asctime)s - %(levelname)s - %(message)s')
+logger = logging.getLogger(__name__)
+
+def create_session_with_retry():
+    """
+    创建带有重试机制的requests会话
+    """
+    session = requests.Session()
+    
+    # 配置重试策略
+    retry_strategy = Retry(
+        total=3,  # 总重试次数
+        backoff_factor=1,  # 重试间隔
+        status_forcelist=[429, 500, 502, 503, 504],  # 需要重试的HTTP状态码
+    )
+    
+    adapter = HTTPAdapter(max_retries=retry_strategy)
+    session.mount("http://", adapter)
+    session.mount("https://", adapter)
+    
+    # 设置默认超时时间
+    session.timeout = 30
+    
+    return session
+
+def extract_pip_packages(install_command: str) -> List[str]:
+    """
+    从pip安装命令中提取包名列表
+    """
+    packages = []
+    
+    if pd.isna(install_command) or not install_command.strip():
+        return packages
+    
+    # 处理各种pip安装格式
+    install_command = install_command.strip()
+    
+    # 处理多行安装命令（用换行符分隔）
+    lines = install_command.split('\n')
+    
+    for line in lines:
+        line = line.strip()
+        if not line:
+            continue
+            
+        # 跳过注释行
+        if line.startswith('#'):
+            continue
+            
+        # 处理pip install命令
+        if 'pip install' in line:
+            # 移除pip install前缀
+            after_pip = re.sub(r'.*pip install\s+', '', line, flags=re.IGNORECASE)
+            
+            # 处理引号包围的多个包
+            after_pip = re.sub(r'["\']([^"\']*)["\']', r'\1', after_pip)
+            
+            # 移除选项参数（以-开头的）
+            parts = after_pip.split()
+            current_packages = []
+            skip_next = False
+            
+            for part in parts:
+                if skip_next:
+                    skip_next = False
+                    continue
+                    
+                if part.startswith('-'):
+                    # 如果是需要参数的选项，跳过下一个参数
+                    if part in ['--index-url', '--extra-index-url', '--constraint', '--requirement', '-r', '--find-links', '-f']:
+                        skip_next = True
+                    continue
+                
+                # 清理包名
+                package = part.strip(',').strip()
+                if package and package not in ['\\', '&', '&&', '||']:  # 过滤掉特殊字符
+                    # 跳过URL
+                    if package.startswith('http'):
+                        continue
+                    # 移除版本约束
+                    package = re.sub(r'[><=!]+.*$', '', package)
+                    # 移除extras（方括号内容）
+                    package = re.sub(r'\[.*?\]', '', package)
+                    # 最终验证：只包含字母、数字、连字符、下划线和点的才是有效包名
+                    if package and re.match(r'^[a-zA-Z0-9._-]+$', package):
+                        current_packages.append(package)
+            
+            packages.extend(current_packages)
+    
+    return list(set(packages))  # 去重
+
+def extract_github_requirements_packages(github_url: str, requirements_files: List[str] = None) -> List[str]:
+    """
+    从GitHub仓库的requirements文件中提取包名列表
+    """
+    if requirements_files is None:
+        requirements_files = ['requirements.txt', 'requirements-app.txt', 'requirements/requirements.txt', 
+                             'requirements/base.txt', 'requirements/production.txt']
+    
+    packages = []
+    
+    # 解析GitHub URL
+    try:
+        parsed = urlparse(github_url)
+        if 'github.com' not in parsed.netloc:
+            return packages
+            
+        path_parts = parsed.path.strip('/').split('/')
+        if len(path_parts) < 2:
+            return packages
+            
+        owner, repo = path_parts[0], path_parts[1]
+        
+        # 创建带重试机制的会话
+        session = create_session_with_retry()
+        
+        # 尝试获取requirements文件
+        for req_file in requirements_files:
+            success = False
+            
+            # 尝试多个分支
+            branches = ['main', 'master', 'develop']
+            
+            for branch in branches:
+                try:
+                    # 构建raw GitHub URL
+                    raw_url = f"https://raw.githubusercontent.com/{owner}/{repo}/{branch}/{req_file}"
+                    logger.info(f"尝试获取: {raw_url}")
+                    
+                    response = session.get(raw_url, timeout=30)
+                    
+                    if response.status_code == 200:
+                        logger.info(f"成功获取 {req_file} from {github_url} ({branch}分支)")
+                        content = response.text
+                        
+                        # 解析requirements文件内容
+                        file_packages = parse_requirements_content(content)
+                        packages.extend(file_packages)
+                        success = True
+                        break
+                        
+                    elif response.status_code == 404:
+                        logger.debug(f"文件不存在于{branch}分支: {req_file}")
+                        continue
+                        
+                    else:
+                        logger.warning(f"获取文件失败，状态码: {response.status_code}")
+                        
+                except requests.exceptions.Timeout:
+                    logger.warning(f"请求超时: {raw_url}")
+                    continue
+                    
+                except requests.exceptions.ConnectionError:
+                    logger.warning(f"连接错误: {raw_url}")
+                    continue
+                    
+                except requests.RequestException as e:
+                    logger.warning(f"请求异常 {raw_url}: {e}")
+                    continue
+                    
+                # 添加延迟避免过于频繁的请求
+                time.sleep(1)
+            
+            if success:
+                break  # 找到一个requirements文件就够了
+                
+            # 在尝试下一个requirements文件前稍作延迟
+            time.sleep(0.5)
+            
+    except Exception as e:
+        logger.error(f"处理GitHub URL时出错 {github_url}: {e}")
+        
+    return list(set(packages))  # 去重
+
+def parse_requirements_content(content: str) -> List[str]:
+    """
+    解析requirements文件内容，提取包名
+    """
+    packages = []
+    
+    for line in content.split('\n'):
+        line = line.strip()
+        if not line or line.startswith('#'):
+            continue
+            
+        # 处理 -r 引用其他requirements文件的情况
+        if line.startswith('-r ') or line.startswith('--requirement'):
+            continue
+            
+        # 处理 -f 或其他选项
+        if line.startswith('-'):
+            continue
+            
+        # 移除注释
+        if '#' in line:
+            line = line.split('#')[0].strip()
+            
+        # 提取包名（移除版本约束和选项）
+        package = re.sub(r'[><=!~]+.*$', '', line)
+        package = re.sub(r'\[.*?\]', '', package)  # 移除extras
+        package = package.strip()
+        
+        if package and not package.startswith('-'):
+            packages.append(package)
+    
+    return packages
+
+def analyze_install_method(install_method: str, repo_url: str = None) -> List[str]:
+    """
+    分析安装方式并提取包名列表
+    """
+    packages = []
+    
+    if pd.isna(install_method) or not install_method.strip():
+        return packages
+    
+    install_method = install_method.strip()
+    
+    # 情况1: 直接pip install
+    if 'pip install' in install_method and 'git clone' not in install_method:
+        packages = extract_pip_packages(install_method)
+    
+    # 情况2: git clone + requirements.txt
+    elif 'git clone' in install_method and ('requirements.txt' in install_method or 'requirements-app.txt' in install_method):
+        if repo_url:
+            # 从GitHub获取requirements文件内容
+            packages = extract_github_requirements_packages(repo_url)
+            if not packages:
+                logger.warning(f"无法从GitHub获取requirements文件: {repo_url}")
+        else:
+            logger.warning("缺少仓库地址，无法获取requirements文件")
+    
+    # 情况3: 无法直接pip安装
+    elif '无法直接pip安装' in install_method:
+        packages = []  # 这种情况下返回空列表
+    
+    # 情况4: 其他复杂安装方式
+    else:
+        # 尝试提取其中的pip install命令
+        packages = extract_pip_packages(install_method)
+    
+    return packages
+
+def process_csv_file(input_file: str, output_file: str = None, start_from: int = 0):
+    """
+    处理CSV文件，添加包名列表列
+    """
+    if output_file is None:
+        output_file = input_file.replace('.csv', '_with_packages.csv')
+    
+    logger.info(f"读取CSV文件: {input_file}")
+    
+    try:
+        # 读取CSV文件
+        df = pd.read_csv(input_file)
+        logger.info(f"成功读取，共 {len(df)} 行")
+        
+        # 检查必要的列是否存在
+        if '安装方式' not in df.columns:
+            logger.error("CSV文件中未找到'安装方式'列")
+            return
+        
+        # 检查是否已有包名列表列（支持断点续传）
+        if '包名列表' in df.columns:
+            logger.info("发现已有包名列表列，支持断点续传")
+            package_lists = df['包名列表'].fillna('').tolist()
+        else:
+            # 创建包名列表列
+            package_lists = [''] * len(df)
+        
+        processed_count = 0
+        skipped_count = 0
+        
+        for index, row in df.iterrows():
+            # 支持从指定行开始处理
+            if index < start_from:
+                continue
+                
+            # 如果已经有包名列表且不为空，跳过
+            if package_lists[index] and package_lists[index].strip():
+                skipped_count += 1
+                if index % 50 == 0:  # 每50行打印一次进度
+                    logger.info(f"跳过已处理的第 {index + 1} 行")
+                continue
+            
+            install_method = row.get('安装方式', '')
+            repo_url = row.get('仓库地址', '')
+            
+            logger.info(f"处理第 {index + 1} 行: {row.get('名字', 'Unknown')}")
+            
+            try:
+                # 分析安装方式
+                packages = analyze_install_method(install_method, repo_url)
+                
+                # 转换为字符串格式（用竖线分隔，避免与CSV逗号冲突）
+                package_str = '|'.join(packages) if packages else ''
+                package_lists[index] = package_str
+                
+                logger.info(f"  提取到包: {package_str}")
+                processed_count += 1
+                
+                # 每处理10行保存一次，避免数据丢失
+                if processed_count % 10 == 0:
+                    df['包名列表'] = package_lists
+                    df.to_csv(output_file, index=False, encoding='utf-8')
+                    logger.info(f"已保存进度，处理了 {processed_count} 行")
+                
+            except KeyboardInterrupt:
+                logger.info("收到中断信号，保存当前进度...")
+                df['包名列表'] = package_lists
+                df.to_csv(output_file, index=False, encoding='utf-8')
+                raise
+                
+            except Exception as e:
+                logger.error(f"  处理第 {index + 1} 行时出错: {e}")
+                package_lists[index] = ''  # 出错时设为空字符串
+                continue
+        
+        # 添加新列
+        df['包名列表'] = package_lists
+        
+        # 保存结果
+        df.to_csv(output_file, index=False, encoding='utf-8')
+        logger.info(f"结果已保存到: {output_file}")
+        
+        # 显示统计信息
+        non_empty_packages = sum(1 for pkg in package_lists if pkg and pkg.strip())
+        logger.info(f"统计: 新处理了 {processed_count} 行，跳过了 {skipped_count} 行")
+        logger.info(f"总计: {non_empty_packages}/{len(package_lists)} 行包含包名列表")
+        
+    except Exception as e:
+        logger.error(f"处理CSV文件时出错: {e}")
+        raise
+
+def main():
+    parser = argparse.ArgumentParser(description="从软件列表CSV文件中提取Python包名")
+    parser.add_argument("--input", default="软件列表-utf8.csv", help="输入CSV文件路径")
+    parser.add_argument("--output", default="软件列表-utf8_with_packages.csv", help="输出CSV文件路径")
+    parser.add_argument("--start-from", type=int, default=0, help="从第几行开始处理（支持断点续传）")
+    args = parser.parse_args()
+
+    logger.info("开始处理软件列表CSV文件...")
+    
+    try:
+        process_csv_file(args.input, args.output, args.start_from)
+        logger.info("处理完成!")
+        
+    except KeyboardInterrupt:
+        logger.info("用户中断，程序已保存当前进度")
+        
+    except Exception as e:
+        logger.error(f"程序执行失败: {e}")
+
+if __name__ == "__main__":
+    main()
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/prepare_pypi_input.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/prepare_pypi_input.py"
new file mode 100644
index 0000000000000000000000000000000000000000..3ad300e28e2e42bb1a2e1ae54cf3d37423fd1fcf
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/prepare_pypi_input.py"	
@@ -0,0 +1,59 @@
+
+import argparse
+import csv
+
+def convert_github_csv_to_pypi_input(input_csv_file, output_txt_file):
+    """
+    Reads a CSV file generated by get_abstracts_from_github.py and extracts
+    repository name and URL to create an input file for get_pypi_name.py.
+
+    Args:
+        input_csv_file (str): Path to the input CSV file.
+        output_txt_file (str): Path to the output text file.
+    """
+    try:
+        with open(input_csv_file, 'r', newline='', encoding='utf-8') as infile, \
+                open(output_txt_file, 'w', encoding='utf-8') as outfile:
+            reader = csv.DictReader(infile)
+            required_fields = {'name', 'url', 'language'}
+            if not required_fields.issubset(reader.fieldnames):
+                print(f"错误：输入 CSV 文件 '{input_csv_file}' 必须包含 'name'、'url' 和 'language' 列。")
+                return
+
+            count = 0
+            total_count = 0
+            for row in reader:
+                total_count += 1
+                # 只处理 Python 项目
+                if row['language'] and row['language'].strip().lower() == 'python':
+                    repo_name = row['name']
+                    repo_url = row['url']
+                    outfile.write(f"{repo_name} {repo_url}\n")
+                    count += 1
+            print(f"总共检查 {total_count} 个仓库，筛选出 {count} 个Python项目。")
+            print(f"输出已写入 '{output_txt_file}'。")
+    except FileNotFoundError:
+        print(f"错误：找不到输入 CSV 文件 '{input_csv_file}'。")
+    except Exception as e:
+        print(f"发生错误：{e}")
+
+def main():
+    parser = argparse.ArgumentParser(
+        description="将 GitHub 仓库 CSV 转换为 get_pypi_name.py 所需的格式。"
+    )
+    parser.add_argument(
+        "--input_csv",
+        default="github_ai_repos_with_desc.csv",
+        help="get_tools_from_github.py 生成的输入 CSV 文件路径 (默认: github_ai_repos_with_desc.csv)"
+    )
+    parser.add_argument(
+        "--output_txt",
+        default="repos.txt",
+        help="get_pypi_name.py 所需的输出文本文件路径 (默认: repos.txt)"
+    )
+    args = parser.parse_args()
+    convert_github_csv_to_pypi_input(args.input_csv, args.output_txt)
+
+
+if __name__ == "__main__":
+    main()
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/transfer_to_csv_utf8.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/transfer_to_csv_utf8.py"
new file mode 100644
index 0000000000000000000000000000000000000000..1a51c4b29d52aaa6618796722f3a2ede5650999f
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/transfer_to_csv_utf8.py"	
@@ -0,0 +1,128 @@
+#!/usr/bin/env python3
+# -*- coding: utf-8 -*-
+
+import argparse
+import os
+import sys
+import csv
+import logging
+import shutil
+from pathlib import Path
+import pandas as pd
+
+def setup_logging():
+    """
+    设置日志配置
+    """
+    logging.basicConfig(
+        level=logging.INFO,
+        format='%(asctime)s - %(levelname)s - %(message)s',
+        datefmt='%Y-%m-%d %H:%M:%S'
+    )
+
+def convert_xlsx_to_csv_utf8(input_file: str, output_file: str = None) -> bool:
+    """
+    将XLSX文件转换为UTF-8编码的CSV文件
+    
+    Args:
+        input_file: 输入文件路径（XLSX文件）
+        output_file: 输出文件路径（如果为None，则生成同名的CSV文件）
+    
+    Returns:
+        bool: 转换是否成功
+    """
+    try:
+        if not os.path.exists(input_file):
+            logging.error(f"输入文件不存在: {input_file}")
+            return False
+        
+        # 检查输入文件是否为Excel文件
+        if not input_file.lower().endswith(('.xlsx', '.xls')):
+            logging.error(f"输入文件不是Excel文件: {input_file}")
+            return False
+            
+        # 如果没有指定输出文件，生成同名的CSV文件
+        if output_file is None:
+            input_path = Path(input_file)
+            output_file = str(input_path.with_suffix('.csv'))
+            
+        logging.info(f"开始转换Excel文件: {input_file}")
+        
+        # 读取Excel文件
+        try:
+            # 尝试读取所有工作表
+            excel_file = pd.ExcelFile(input_file)
+            sheet_names = excel_file.sheet_names
+            
+            if len(sheet_names) == 1:
+                # 只有一个工作表，直接转换
+                df = pd.read_excel(input_file, sheet_name=0)
+                df.to_csv(output_file, index=False, encoding='utf-8')
+                logging.info(f"成功转换工作表: {sheet_names[0]}")
+                
+            else:
+                # 多个工作表，转换第一个工作表或让用户选择
+                logging.info(f"Excel文件包含 {len(sheet_names)} 个工作表: {sheet_names}")
+                logging.info(f"正在转换第一个工作表: {sheet_names[0]}")
+                
+                df = pd.read_excel(input_file, sheet_name=0)
+                df.to_csv(output_file, index=False, encoding='utf-8')
+                
+                # 如果有多个工作表，为每个工作表生成单独的CSV文件
+                if len(sheet_names) > 1:
+                    output_path = Path(output_file)
+                    for i, sheet_name in enumerate(sheet_names):
+                        if i == 0:
+                            continue  # 第一个工作表已经处理
+                        try:
+                            df_sheet = pd.read_excel(input_file, sheet_name=sheet_name)
+                            # 清理工作表名称，移除不适合文件名的字符
+                            clean_sheet_name = "".join(c for c in sheet_name if c.isalnum() or c in (' ', '-', '_')).rstrip()
+                            sheet_output = output_path.with_name(f"{output_path.stem}_{clean_sheet_name}{output_path.suffix}")
+                            df_sheet.to_csv(sheet_output, index=False, encoding='utf-8')
+                            logging.info(f"成功转换工作表 '{sheet_name}' 到文件: {sheet_output}")
+                        except Exception as e:
+                            logging.error(f"转换工作表 '{sheet_name}' 时出错: {str(e)}")
+                            continue
+                        
+        except Exception as e:
+            logging.error(f"读取Excel文件时出错: {str(e)}")
+            return False
+            
+        # 显示转换后的文件信息
+        try:
+            df_info = pd.read_csv(output_file, nrows=0)  # 只读取表头
+            logging.info(f"成功将Excel文件转换为UTF-8编码的CSV文件: {output_file}")
+            logging.info(f"CSV文件包含 {len(df_info.columns)} 列: {list(df_info.columns)}")
+            
+            # 统计行数
+            with open(output_file, 'r', encoding='utf-8') as f:
+                line_count = sum(1 for line in f) - 1  # 减去标题行
+            logging.info(f"CSV文件包含 {line_count} 行数据")
+        except Exception as e:
+            logging.warning(f"无法读取转换后的文件信息: {str(e)}")
+            
+        return True
+        
+    except Exception as e:
+        logging.error(f"转换过程中出现错误: {str(e)}")
+        return False
+
+def main():
+    """
+    主函数
+    """
+    setup_logging()
+    
+    parser = argparse.ArgumentParser(description="将Excel文件转换为UTF-8编码的CSV文件")
+    parser.add_argument("--input", required=True, help="输入Excel文件路径")
+    parser.add_argument("--output", required=True, help="输出CSV文件路径")
+    args = parser.parse_args()
+
+    if convert_xlsx_to_csv_utf8(args.input, args.output):
+        sys.exit(0)
+    else:
+        sys.exit(1)
+
+if __name__ == "__main__":
+    main()
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/validte_llm_result.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/validte_llm_result.py"
new file mode 100644
index 0000000000000000000000000000000000000000..8dd4042b8d017c755869818756c6e092b891d696
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/validte_llm_result.py"	
@@ -0,0 +1,253 @@
+from __future__ import annotations
+
+import argparse
+import json
+import shutil
+import sys
+from pathlib import Path
+from typing import Any, Dict, List
+
+
+def validate_package_info_payload(payload: Dict[str, Any]) -> Dict[str, Any]:
+    """Read a JSON object (stdin or --json-file) with analysis result and merge into package_info.json.
+    Expected input format:
+    A. Package does not exist:
+    {
+        "package_name": "< the repo of the given github url like 'https://github.com/owner/repo' >",
+        "info": "No package found in PyPI",
+        "exists": false
+    }
+
+    B. Package Exists:
+    {
+        "package_name": "< the name of the package in PyPI >",
+        "info": {
+            "dependency": [],
+            "import_test_code": "import package_name",
+            "import_test_expected_result": "",
+            "function_test_code": "< a one-line escaped Python code >",
+            "function_test_expected_result": "< regex of the expected output of the function_test_code >",
+            "gpu_test_code": "< a one-line escaped Python code or empty string if no GPU required, may be empty >",
+            "gpu_test_expected_result": "< regex of the expected output of the gpu_test_code or empty string if no GPU required, may be empty >",
+            "verified": "False"
+        },
+        "exists": true
+    }
+    """
+    if not isinstance(payload, dict):
+        raise ValueError("payload must be a JSON object")
+    package_name = payload.get("package_name")
+    if not isinstance(package_name, str):
+        raise ValueError("package_name must be a string")
+    info = payload.get("info")
+    exists = payload.get("exists")
+    if not isinstance(exists, bool):
+        raise ValueError("exists must be a boolean")
+    if exists:
+        if not isinstance(info, dict):
+            raise ValueError("info must be an object when exists is true")
+        dependency = info.get("dependency")
+        if not isinstance(dependency, list):
+            raise ValueError("info.dependency must be a list")
+        for dep in dependency:
+            if not isinstance(dep, str):
+                raise ValueError("each item in info.dependency must be a string")
+        import_test_code = info.get("import_test_code")
+        if not isinstance(import_test_code, str):
+            raise ValueError("info.import_test_code must be a string")
+        import_test_expected_result = info.get("import_test_expected_result")
+        if not isinstance(import_test_expected_result, str):
+            raise ValueError("info.import_test_expected_result must be a string")
+        function_test_code = info.get("function_test_code")
+        if not isinstance(function_test_code, str):
+            raise ValueError("info.function_test_code must be a string")
+        function_test_expected_result = info.get("function_test_expected_result")
+        if not isinstance(function_test_expected_result, str):
+            raise ValueError("info.function_test_expected_result must be a string")
+        gpu_test_code = info.get("gpu_test_code")
+        if not isinstance(gpu_test_code, str):
+            raise ValueError("info.gpu_test_code must be a string")
+        gpu_test_expected_result = info.get("gpu_test_expected_result")
+        if not isinstance(gpu_test_expected_result, str):
+            raise ValueError("info.gpu_test_expected_result must be a string")
+        verified = info.get("verified")
+        if verified not in ("True", "False"):
+            raise ValueError("info.verified must be 'True' or 'False'")
+    return {
+        "package_name": package_name,
+        "info": info,
+        "exists": exists
+    }
+
+
+def validate_installation_result_payload(payload: Dict[str, Any]) -> Dict[str, Any]:
+    """Read a JSON object (stdin or --json-file) with installation result and merge into package_info.json.
+    Expected input format:
+    {
+        "status": "success" | "failed",
+        "retry_count": int,
+        "final_venv_name": str,
+        "actions_taken": [
+            {"step": 1, "detail": "Description of the error analysis and how to fix it"},
+            {"step": 2, "detail": "Description of the error analysis and how to fix it"},
+            ...
+            {"step": 1, "detail": "Description of the error analysis and how to fix it"},
+            ...
+        ],
+        "conclusion": "A summary of the resolution process and final status."
+    }
+    """
+    if not isinstance(payload, dict):
+        raise ValueError("payload must be a JSON object")
+    status = payload.get("status")
+    if status not in ("success", "failed"):
+        raise ValueError("status must be 'success' or 'failed'")
+    retry_count = payload.get("retry_count")
+    if not isinstance(retry_count, int) or retry_count < 0:
+        raise ValueError("retry_count must be a non-negative integer")
+    final_venv_name = payload.get("final_venv_name")
+    if not isinstance(final_venv_name, str):
+        raise ValueError("final_venv_name must be a string")
+    actions_taken = payload.get("actions_taken")
+    if not isinstance(actions_taken, list):
+        raise ValueError("actions_taken must be a list")
+    for action in actions_taken:
+        if not isinstance(action, dict):
+            raise ValueError("each action in actions_taken must be an object")
+        if "step" not in action or "detail" not in action:
+            raise ValueError("each action must contain 'step' and 'detail'")
+        if not isinstance(action["step"], int) or action["step"] < 1:
+            raise ValueError("action.step must be a positive integer")
+        if not isinstance(action["detail"], str):
+            raise ValueError("action.detail must be a string")
+    conclusion = payload.get("conclusion")
+    if not isinstance(conclusion, str):
+        raise ValueError("conclusion must be a string")
+    
+    return {
+        "status": status,
+        "retry_count": retry_count,
+        "actions_taken": actions_taken,
+        "conclusion": conclusion
+    }
+
+
+def validate_test_result_payload(payload: Dict[str, Any]) -> Dict[str, Any]:
+    """Read a JSON object (stdin or --json-file) with analysis result and merge into package_info.json.
+    Expected input format:
+    {
+    "status": "success" | "failed",
+    "retry_count": int,
+    "final_test_case": str,
+    "final_expected_result": str,
+    "final_exit_code": int,
+    "final_stderr": str,
+    "final_stdout": str,
+    "actions_taken": [
+        {"step": 1, "detail": "Description of the error analysis and how to fix it"},
+        {"step": 2, "detail": "Description of the error analysis and how to fix it"},
+        ...
+        {"step": 1, "detail": "Description of the error analysis and how to fix it"}
+        ...
+    ],
+    "conclusion": "A summary of the resolution process and final status."
+    }
+    """
+    if not isinstance(payload, dict):
+        raise ValueError("payload must be a JSON object")
+    status = payload.get("status")
+    if status not in ("success", "failed"):
+        raise ValueError("status must be 'success' or 'failed'")
+    retry_count = payload.get("retry_count")
+    if not isinstance(retry_count, int) or retry_count < 0:
+        raise ValueError("retry_count must be a non-negative integer")
+    final_test_case = payload.get("final_test_case")
+    if not isinstance(final_test_case, str):
+        raise ValueError("final_test_case must be a string")
+    final_expected_result = payload.get("final_expected_result")
+    if not isinstance(final_expected_result, str):
+        raise ValueError("final_expected_result must be a string")
+    final_exit_code = payload.get("final_exit_code")
+    if not isinstance(final_exit_code, int):
+        raise ValueError("final_exit_code must be an integer")
+    final_stderr = payload.get("final_stderr")
+    if not isinstance(final_stderr, str):
+        raise ValueError("final_stderr must be a string")
+    final_stdout = payload.get("final_stdout")
+    if not isinstance(final_stdout, str):
+        raise ValueError("final_stdout must be a string")
+    actions_taken = payload.get("actions_taken")
+    if not isinstance(actions_taken, list):
+        raise ValueError("actions_taken must be a list")
+    for action in actions_taken:
+        if not isinstance(action, dict):
+            raise ValueError("each action in actions_taken must be an object")
+        if "step" not in action or "detail" not in action:
+            raise ValueError("each action must contain 'step' and 'detail'")
+        if not isinstance(action["step"], int) or action["step"] < 1:
+            raise ValueError("action.step must be a positive integer")
+        if not isinstance(action["detail"], str):
+            raise ValueError("action.detail must be a string")
+    conclusion = payload.get("conclusion")
+    if not isinstance(conclusion, str):
+        raise ValueError("conclusion must be a string")
+    
+    return {
+        "status": status,
+        "retry_count": retry_count,
+        "final_test_case": final_test_case,
+        "final_expected_result": final_expected_result,
+        "final_exit_code": final_exit_code,
+        "final_stderr": final_stderr,
+        "final_stdout": final_stdout,
+        "actions_taken": actions_taken,
+        "conclusion": conclusion
+    }
+
+
+if __name__ == "__main__":
+    sample_payload1 = """
+    {
+        "package_name": "abc",
+        "info": {
+            "dependency": ["dep1", "dep2"],
+            "import_test_code": "import abc",
+            "import_test_expected_result": "",
+            "function_test_code": "abc.some_function()",
+            "function_test_expected_result": "^result$",
+            "gpu_test_code": "abc.some_gpu_function()",
+            "gpu_test_expected_result": "^(True|False)$",
+            "verified": "False"
+        },
+        "exists": true
+    }
+    """
+    sample_payload2 = """
+    {
+        "package_name": "abc",
+        "info": "No package found in PyPI",
+        "exists": false
+    }
+    """
+    validated_payload = validate_package_info_payload(json.loads(sample_payload2))
+    print(validated_payload)
+
+    sample_payload3 = """
+    {
+        "status": "success",
+        "retry_count": 2,
+        "final_test_case": "import abc; abc.some_function()",
+        "final_expected_result": "^result$",
+        "final_exit_code": 0,
+        "final_stderr": "",
+        "final_stdout": "result",
+        "actions_taken": [
+            {"step": 1, "detail": "Identified missing import statement and added it."},
+            {"step": 2, "detail": "Corrected function name from 'somefunc' to 'some_function'."}
+        ],
+        "conclusion": "The test case was modified to include the correct import statement and function name, leading to a successful execution."
+    }
+    """
+    validated_payload = validate_test_result_payload(json.loads(sample_payload3))
+    print(validated_payload)
+    
\ No newline at end of file
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/workflow.py" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/workflow.py"
new file mode 100644
index 0000000000000000000000000000000000000000..033303540b38984e03e9fe3aece2da7ac15d6763
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/code/utils/workflow.py"	
@@ -0,0 +1,219 @@
+"""The workflow tracer for the chatbot.
+
+Example:
+-------
+WorkflowTrace
+├── 🔍 USER_QUERY: Please summarize the content of the ...
+├── 💭 LLM_THINKING: LLM is processing your query...
+├── 🤖 LLM_RESPONSE: { "tool": "read_markdown_file", "arguments
+├── 🔧 TOOL_CALL: Call 1: read_markdown_file
+│   └── Tool: read_markdown_file, Args: {"directory_path": "...
+├── ⚡️ TOOL_EXECUTION: Executing read_markdown_file...
+├── 📊 TOOL_RESULT: Success
+│   └── Status: Success
+│      └── Result: meta=None content=[TextContent(type='text', tex...
+├── 💭 LLM_THINKING: LLM processing tool results (iteration 1)...
+├── 🤖 LLM_RESPONSE: { "tool": "write_markdown_file", "argument
+├── 🔧 TOOL_CALL: Call 1: write_markdown_file
+│   └── Tool: write_markdown_file, Args: {"directory_path": "...
+├── ⚡️ TOOL_EXECUTION: Executing write_markdown_file...
+├── 📊 TOOL_RESULT: Success
+│   └── Status: Success
+│      └── Result: meta=None content=[TextContent(type='text', tex...
+├── 💭 LLM_THINKING: LLM processing tool results (iteration 2)...
+├── 🤖 LLM_RESPONSE: 看起来在指定的目录下已经存在名为`summary.md`的文件，...
+└── ✅ FINAL_RESPONSE: Final response after 2 tool iterations
+"""
+
+import json
+import time
+from datetime import datetime
+from enum import Enum
+from typing import Any, Dict, List, Optional
+
+import colorama
+
+
+class WorkflowEventType(Enum):
+    USER_QUERY = "USER_QUERY"
+    LLM_THINKING = "LLM_THINKING"
+    LLM_RESPONSE = "LLM_RESPONSE"
+    TOOL_CALL = "TOOL_CALL"
+    TOOL_EXECUTION = "TOOL_EXECUTION"
+    TOOL_RESULT = "TOOL_RESULT"
+    FINAL_RESPONSE = "FINAL_RESPONSE"
+
+
+class WorkflowEvent:
+    def __init__(
+        self,
+        event_type: WorkflowEventType,
+        message: str,
+        metadata: Optional[Dict[str, Any]] = None,
+        timestamp: Optional[float] = None,
+    ):
+        self.event_type = event_type
+        self.message = message
+        self.metadata = metadata or {}
+        self.timestamp = timestamp or time.time()
+        self.formatted_time = datetime.fromtimestamp(self.timestamp).strftime(
+            "%H:%M:%S.%f"
+        )[:-3]
+
+
+class WorkflowTracer:
+    def __init__(self):
+        self.events: List[WorkflowEvent] = []
+
+    def add_event(
+        self,
+        event_type: WorkflowEventType,
+        message: str,
+        metadata: Optional[Dict[str, Any]] = None,
+    ):
+        event = WorkflowEvent(event_type, message, metadata)
+        self.events.append(event)
+        return event
+
+    def _format_json_content(self, content: str, max_length: int = 70) -> str:
+        """Format JSON content by compressing it into a single line.
+
+        Args:
+            content: The content to format
+            max_length: Maximum length before truncation
+
+        Returns:
+            Formatted string
+        """
+        # Try to parse as JSON and compress
+        try:
+            if "{" in content and ('"tool"' in content or '"arguments"' in content):
+                # Remove newlines and extra spaces
+                compressed = content.replace("\n", " ").strip()
+                # Replace multiple spaces with a single space
+                while "  " in compressed:
+                    compressed = compressed.replace("  ", " ")
+                # Truncate if too long
+                if len(compressed) > max_length:
+                    return compressed[: max_length - 3] + "..."
+                return compressed
+        except Exception:
+            pass
+
+        # If not JSON or couldn't compress, just truncate if needed
+        if len(content) > max_length:
+            return content[: max_length - 3] + "..."
+        return content
+
+    def render_tree_workflow(self) -> str:
+        """Render workflow trace as a tree-like structure.
+
+        Returns:
+            A formatted tree string representing the workflow
+        """
+        if not self.events:
+            return "No workflow events recorded"
+
+        # Color definitions
+        COLORS = {
+            WorkflowEventType.USER_QUERY: colorama.Fore.GREEN,
+            WorkflowEventType.LLM_THINKING: colorama.Fore.BLUE,
+            WorkflowEventType.LLM_RESPONSE: colorama.Fore.YELLOW,
+            WorkflowEventType.TOOL_CALL: colorama.Fore.CYAN,
+            WorkflowEventType.TOOL_EXECUTION: colorama.Fore.MAGENTA,
+            WorkflowEventType.TOOL_RESULT: colorama.Fore.BLUE,
+            WorkflowEventType.FINAL_RESPONSE: colorama.Fore.WHITE,
+        }
+
+        # Icons
+        ICONS = {
+            WorkflowEventType.USER_QUERY: "🔍",
+            WorkflowEventType.LLM_THINKING: "💭",
+            WorkflowEventType.LLM_RESPONSE: "🤖",
+            WorkflowEventType.TOOL_CALL: "🔧",
+            WorkflowEventType.TOOL_EXECUTION: "⚡️",
+            WorkflowEventType.TOOL_RESULT: "📊",
+            WorkflowEventType.FINAL_RESPONSE: "✅",
+        }
+
+        output = []
+        title = (
+            f"{colorama.Style.BRIGHT}{colorama.Fore.CYAN}"
+            f"WorkflowTrace{colorama.Style.RESET_ALL}"
+        )
+        output.append(title)
+
+        for i, event in enumerate(self.events):
+            color = COLORS.get(event.event_type, colorama.Fore.WHITE)
+            icon = ICONS.get(event.event_type, "•")
+
+            # Format message, handling JSON specially
+            message = event.message
+            if event.event_type == WorkflowEventType.LLM_RESPONSE:
+                message = self._format_json_content(message)
+
+            # Tree structure - last item gets └── others get ├──
+            is_last = i == len(self.events) - 1
+            prefix = "└── " if is_last else "├── "
+
+            # Main event line with BOLD event type
+            event_type_str = (
+                f"{colorama.Style.BRIGHT}{event.event_type.name}{colorama.Style.NORMAL}"
+            )
+
+            line = (
+                f"{colorama.Fore.CYAN}{prefix}{color}{icon} "
+                f"{event_type_str}: {colorama.Style.RESET_ALL}{message}"
+            )
+            output.append(line)
+
+            # Add metadata details with appropriate indentation
+            detail_prefix = "    " if is_last else "│   "
+
+            if (
+                event.event_type == WorkflowEventType.TOOL_CALL
+                and "tool_name" in event.metadata
+            ):
+                tool_name = event.metadata.get("tool_name", "unknown")
+                if "arguments" in event.metadata:
+                    args = json.dumps(event.metadata["arguments"])
+                    if len(args) > 50:
+                        args = args[:47] + "..."
+                    output.append(
+                        f"{colorama.Fore.CYAN}{detail_prefix}"
+                        f"└── Tool: {colorama.Fore.WHITE}{tool_name}"
+                        f"{colorama.Fore.CYAN}, Args: {colorama.Fore.WHITE}{args}"
+                        f"{colorama.Style.RESET_ALL}"
+                    )
+                else:
+                    output.append(
+                        f"{colorama.Fore.CYAN}{detail_prefix}"
+                        f"└── Tool: {colorama.Fore.WHITE}{tool_name}"
+                        f"{colorama.Style.RESET_ALL}"
+                    )
+
+            elif (
+                event.event_type == WorkflowEventType.TOOL_RESULT
+                and "success" in event.metadata
+            ):
+                success = event.metadata.get("success", False)
+                status_color = colorama.Fore.GREEN if success else colorama.Fore.RED
+                status_text = "Success" if success else "Failed"
+                output.append(
+                    f"{colorama.Fore.CYAN}{detail_prefix}"
+                    f"└── Status: {status_color}{status_text}"
+                    f"{colorama.Style.RESET_ALL}"
+                )
+
+                # Add abbreviated result if available and successful
+                if success and "result" in event.metadata and event.metadata["result"]:
+                    result = str(event.metadata["result"])
+                    if len(result) > 50:
+                        result = result[:47] + "..."
+                    output.append(
+                        f"{colorama.Fore.CYAN}{detail_prefix}"
+                        f"   └── Result: {colorama.Fore.WHITE}{result}"
+                        f"{colorama.Style.RESET_ALL}"
+                    )
+
+        return "\n".join(output)
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/important/package_info.db" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/important/package_info.db"
new file mode 100644
index 0000000000000000000000000000000000000000..eb0c1c20e83d6814914f33d087e10af2ed55e92e
Binary files /dev/null and "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/important/package_info.db" differ
diff --git "a/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/\351\241\271\347\233\256\350\256\241\345\210\222\344\271\246.md" "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/\351\241\271\347\233\256\350\256\241\345\210\222\344\271\246.md"
new file mode 100644
index 0000000000000000000000000000000000000000..31448301168904176b630683a77aae07fc5d7a07
--- /dev/null
+++ "b/2025\345\256\236\346\210\230\344\273\273\345\212\241_\344\275\234\345\223\201\346\226\207\344\273\266\345\244\271/OpenCloudOS 9 AI\350\275\257\344\273\266\350\207\252\345\212\250\345\214\226\351\252\214\350\257\201\345\267\245\345\205\267/\351\273\204\346\214\257\344\270\232_\344\275\234\345\223\201/oc_contributor_huangzhenye/\351\241\271\347\233\256\350\256\241\345\210\222\344\271\246.md"	
@@ -0,0 +1,139 @@
+# 项目计划书
+## 1. 项目概述
+
+### 1.1 项目背景
+OpenCloudOS 9在AI生态建设过程中面临的主要挑战：
+- AI软件栈数量庞大，逐一手工验证耗时耗力
+- 部分软件上游只针对Ubuntu等其他发行版开发，不兼容OpenCloudOS 9
+- 缺乏标准化的验证流程和自动化工具
+
+### 1.2 项目目标
+构建一个智能化的AI软件自动化验证工具，实现：
+- 自动分析软件源码和文档，获取安装方式
+- 自动生成测试用例
+- 批量验证Python软件包兼容性
+- 标准化测试框架和输出格式
+
+## 2. 整体设计思路
+### 2.1 架构设计
+
+```
+┌─────────────────────────────────────────────────────────────┐
+│                    AI软件自动化验证工具                       │
+├─────────────────────────────────────────────────────────────┤
+│  用户接口层  │  AI Agent层  │  安装执行层  │  测试执行层       │
+├─────────────────────────────────────────────────────────────┤
+│  信息获取    │  日志管理    │  结果分析    │  报告生成         │
+├─────────────────────────────────────────────────────────────┤
+│                    OpenCloudOS 9 基础环境                    │
+└─────────────────────────────────────────────────────────────┘
+```
+
+### 2.2 任务管理
+- 执行流程(两步并行进行)：
+  1. 爬取网页获取包信息->保存到数据库
+  2. 数据库获取未安装测试的包信息->AI分析包信息->安装Python包->生成测试用例->执行测试->保存结果->生成汇总报告
+- 其中，对于没有依赖关系的AI软件包使用异步方式批量安装并测试
+- 其中，对于有依赖关系的AI软件包使用同步方式按照依赖关系进行安装测试
+
+### 2.3 调研常用AI软件
+- 调研常用AI软件并使用可解析格式进行记录(如json格式)
+- 记录内容包括：
+  + AI软件pypi包名
+  + AI软件安装方式
+  + AI软件验证方式
+
+### 2.4 用户接口层
+- 将接口暴露给用户，并提供清晰易懂的接口说明
+
+### 2.5 AI Agent层
+- 在服务器上实现一个AI Agent；
+- 在本地实现一个小型AI Agent；
+- 当用户可以联网时，默认使用服务器的Agent来分析软件包的依赖关系、安装方法以及生成测试方法；当本地无法联网时，则使用本地的小型AI Agent完成上述操作；如果本地无法部署AI Agent且无法联网，则使用默认的安装方法和测试方法。
+- 设计好上下文以使大模型能更准确地分析包的依赖关系和安装方式
+
+### 2.6 安装执行层
+- **确定依赖关系**：通过AI Agent分析包所依赖的前置包有哪些，基于图论算法生成所要安装的包的依赖关系
+- **批量安装AI软件包**：根据依赖关系批量安装包，验证前置包是否已经安装，若未安装则自动安装前置所需包
+
+### 2.6 测试执行层
+- **测试用例生成器**：基于AI软件特性自动生成测试用例，对于常用的AI软件，根据之前调研的软件验证方法进行验证；对于未记录的AI软件，使用AI Agent分析并生成测试用例。
+- **测试执行引擎**：支持多种测试类型（功能测试、GPU测试）
+- **结果收集器**：标准化收集测试结果和日志（如json格式）
+
+### 2.7 信息获取 
+- **网页信息获取**：使用python脚本自动抓取有关AI软件包的信息
+- **获取途径调研**：找到能够获取大量软件包安装和验证方法的网站
+- **信息整理**：将信息整理到数据库中
+
+### 2.8 输出管理
+- **日志管理**：将安装与测试的结果以json日志的格式保存下来
+- **结果分析**：使用AI Agent分析安装与测试结果，包括哪些包安装成功或失败，失败原因分析，下一步操作建议等
+- **报告生成**：将报告以格式化形式输出
+
+## 3. 重点技术说明
+### 3.1 AI Agent实现
+- **regex_filter**：使用正则表达式辅助文档分析，如使用正则表达式分析文档中是否存在"**install**, **安装**, **pip**，**conda**"等字样，帮助AI Agent过滤掉无用信息。
+- **agent_analyst**：使用HuggingFace的模型进行**README**，**requirements.txt**，**setup.py**等文档分析，通过使用合适的prompt让其生成格式化的内容
+- **context_usage**：安装OceanBase数据库，使用其向量数据库的功能为AI Agent提供上下文；将之前安装包的输出作为上下文喂给AI Agent，以辅助其更准确地分析包名和安装方法
+
+### 3.2 自动安装模块实现
+#### 3.2.1 自动分析包依赖关系
+- **dependency_analyst**：使用图论算法（拓扑排序，DFS等）获取包依赖关系，发现是否有循环依赖问题等
+- **compatibility_analyst**：检测版本兼容范围并选择合适的版本进行安装
+
+#### 3.2.2 基于模板的软件包安装
+将安装分为三个步骤：
+- **order_generator**：根据包依赖关系确定安装顺序
+- **package_installer**：选择合适的方式安装包(pip，conda，dnf等)，每种包设置最多多少次尝试安装，如果失败再输出失败信息
+- **info_collector**：将安装结果以格式化形式（如json）收集起来
+
+### 3.3 测试模块实现
+#### 3.3.1 基于模板的测试生成
+将测试分为三个步骤：
+- **basic_import**：导入安装的包以及测试需要的包
+- **function_test**：生成功能测试代码，比如**pytorch**的`print(torch.__version__)`等，根据报错和与预计结果的差别来判断是否完成测试
+- **info_collector**：将测试结果以格式化形式(如json)收集起来
+
+#### 3.3.2 自适应测试生成策略
+- **test_executor**：执行测试
+- **compatibility_resolve**：对于兼容性错误，通过创建一个新的虚拟环境来安装该包的前置依赖和该包来解决
+- **stderr_resolve**：对于stderr的输出，因为不能判断是warning还是error，通过AI验证该输出是否影响软件的正常运行
+
+#### 3.4 爬虫模块实现 
+- **get_abstract_from_github**：使用python脚本从github等自动抓取有关AI软件包的信息，包括：**name**，**description**，**url**，**language**，**stars**，**updated_at**等
+- **tools**：实现一系列工具，包括：**读取项目文件树**,**读取相关文件（readme，requirements.txt）**等；
+- **information_analyst_MCP**：实现一个MCP，可以让AI调用tools分析是否可以通过pip安装来完成，分析包名，分析包的依赖，生成验证代码以及期望输出
+- **info_collect**：将信息整理到数据库中
+
+## 4. 预期效果
+- **自动化程度**：安装和测试过程实现完全自动化
+- **成功率**：对于常用AI软件能做到100%成功安装测试
+- **效率提升**：相比手工验证，效率显著提升
+- **格式化输出**：能够以可解析形式输出日志（json），以便后续分析
+
+## 5. 时间规划
+### 5.1 第一阶段（1周）：调研&基础框架搭建
+- [√] 常用AI软件调研并使用可解析方式记录
+- [√] 基础框架搭建
+- [√] 将ai-info-tools的抓取和安装的源代码整合到现有代码中
+
+### 5.2 第二阶段（1周）：主要模块实现
+- [√] 自动软件包安装模块实现
+- [√] 自动软件包测试模块实现
+- [√] 常用软件包可以正常安装测试
+
+### 5.3 第三阶段（1周）：AI Agent&MCP实现
+- [√] 模型选择和prompt编写
+- [√] 将AI Agent整合到框架中
+- [√] 实现一个MCP，提供其获取仓库文件树和读取文件内容的功能，让其自主判断仓库是否可以通过pip安装、仓库对应软件包名、依赖包名、验证方法等
+
+### 5.4 第四阶段（1周）：集成和优化
+- [√] 完整系统集成
+- [√] 实现批量安装测试和异步安装测试
+- [√] 实现在安装完包后查看site-package以判断这个包的依赖
+- [√] 实现在缺少依赖的情况下自动安装相关依赖
+- [√] 实现自动准备好GPU环境（比如OpenCL，cuda等的安装）
+- [√] 由于有些库是运行时动态链接，无法使用ldd等工具查到，当发生由于缺少依赖而导致安装验证失败时，由AI分析并完善相关依赖（需要给AI提供环境分析工具，安装依赖工具）
+- [√] 解决由于版本冲突导致的安装失败问题
+- [√] 文档编写，接口说明，实现更方便用户的接口，比如自动生成安装包文档并通过文档执行批量安装测试
\ No newline at end of file