浅析ado.net获取数据库元数据信息-白红宇的个人博客

发布日期：2021-06-30 19:07:46 浏览次数：2 分类：技术文章

本文共 5569 字，大约阅读时间需要 18 分钟。

写这个文章源于早先对ADO.Net获取数据库元数据上的认识，去年我在阅读ADO.Net Core Reference的时候曾经注意过DataSet的FillSchema的这个方法。这方面，在我之前的随笔中提到过Typed DataSet，而FillSchem与WriteXmlSchema的结合使用可以获得数据库的表结构架构，从而使用相应工具生成强类型的DataSet。但是我记得作者建议在具体应用开发中尽量少用FillSchema这个方法，因为出于性能考虑，其一般只适合作为测试过程中的一个方法。

当时我的理解就是，这是一个获取数据库元数据的一个方便的方法，但是由于其对性能的影响，因此通常应用中比较少用。而在我后面的开发中也未曾有机会接触这个方法。

今年早先1月份的时候看DAAB，注意到其封装的DataCommand对象提供了动态获取存储过程信息的支持：DeriveParameters。当时我的第一印象是，这也是获取数据库的“元数据”，因为之前有过FillSchema对性能影响上的认识，我当时就产生了一个问号：这样做适合吗？自动填充Command对象的Parameter集合，会影响应用程序的性能吗？

就此我也请教过M$的专家，给我的回答是两者机制不同，后者对性能影响不大。

昨日翻倒年初对这个问题疑惑而提的一篇帖子，突然很想进一步找找这两中方法的区别之处，简单了解了一下，以下做个简单的归纳。

DeriveParameters方法

先说简单的一个。DeriveParameters是SqlCommandBuilder类的一个公共方法，提供一个SqlCommannd的参数，该Command对象作为获取到的Parameters的存放容器。其实SqlCommand本身就有一个DeriveParameters的方法，但是它是内部方法，而SqlCommandBuilder.DeriveParameters就是封装了该方法的调用：

public

static

void

DeriveParameters(SqlCommand command)

{

SqlConnection.SqlClientPermission.Demand();

if (command == null)

{

// throw an exception

}

command.DeriveParameters();

}

来看一下SqlCommand的DeriveParameters方法：

internal

void

DeriveParameters()

{

// Validate command type(is storedprocedure?) and command info

// Retrieve command text detail

string[] txtCommand = ADP.ParseProcedureName(this.CommandText);

SqlCommand cmdDeriveCommand = null;

this.cmdText = "sp_procedure_params_rowset";

if (txtCommand[1] != null)

{

this.cmdText = "[" + txtCommand[1] + "].." + this.cmdText;

if (txtCommand[0] != null)

{

this.cmdText = txtCommand[0] + "." + this.cmdText;

}

cmdDeriveCommand = new SqlCommand(this.cmdText, this.Connection);

}

else

{

cmdDeriveCommand = new SqlCommand(this.cmdText, this.Connection);

}

cmdDeriveCommand.CommandType = CommandType.StoredProcedure;

cmdDeriveCommand.Parameters.Add(new SqlParameter("@procedure_name", SqlDbType.NVarChar, 0xff));

cmdDeriveCommand.Parameters[0].Value = txtCommand[3];

ArrayList parms = new ArrayList();

try

{

try

{

using (SqlDataReader drParam = cmdDeriveCommand.ExecuteReader())

{

SqlParameter parameter = null;

while (drParam.Read())

{

parameter = new SqlParameter();

parameter.ParameterName = (string) drParam["PARAMETER_NAME"];

parameter.SqlDbType = MetaType.GetSqlDbTypeFromOleDbType((short) drParam["DATA_TYPE"], (string) drParam["TYPE_NAME"]);

object len = drParam["CHARACTER_MAXIMUM_LENGTH"];

if (len is int)

{

parameter.Size = (int) len;

}

parameter.Direction = this.ParameterDirectionFromOleDbDirection((short) drParam["PARAMETER_TYPE"]);

if (parameter.SqlDbType == SqlDbType.Decimal)

{

parameter.Scale = (byte) (((short) drParam["NUMERIC_SCALE"]) & 0xff);

parameter.Precision = (byte) (((short) drParam["NUMERIC_PRECISION"]) & 0xff);

}

parms.Add(parameter);

}

finally

{

cmdDeriveCommand.Connection = null;

}

catch

{

throw;

}

if (params.Count == 0)

{

// throw an exception that current storedprocedure does not exist

}

this.Parameters.Clear();

foreach (object parm in parms)

{

this._parameters.Add(parm);

}

ADP.ParseProcedureName其实就是获取存储过程命令的细节信息，有兴趣的可以反编译来看看。

纵观整个方法，有效性验证-〉获取命令字符串-〉执行查询-〉填充参数列表-〉返回。应该是非常简洁明朗的，最多也就是在数据库Query的阶段需要有一个来回，其他操作根本就谈不上有什么复杂度，而且也不存在大数据的对象，对性能的损耗谈不上多巨大。

下面来看看FillSchema的处理过程

FillSchema方法

这个部分因为代码比较多，所以我就抽关键的部分来看一下。

首先，FillSchema是DataAdapter类定义的一个方法，而具体实现则是在该类的子类DBDataAdapter中完成的（SqlDataAdapter继承于DBDataAdapter）。

通过反编译，可以发现FillSchema的关键处理步骤是在其调用私有方法FillSchemaFromCommand来完成的。简单看一下该方法体的内容：

private

DataTable[] FillSchemaFromCommand(

object

data, SchemaType schemaType, IDbCommand command,

string

srcTable, CommandBehavior behavior)

{

IDbConnection connection = DbDataAdapter.GetConnection(command, "FillSchema");

ConnectionState state = ConnectionState.Open;

DataTable[] arrTables = new DataTable[0];

try

{

try

{

DbDataAdapter.QuietOpen(connection, out state);

using (IDataReader reader = command.ExecuteReader((behavior | CommandBehavior.SchemaOnly) | CommandBehavior.KeyInfo))

{

if (reader == null)

{

return arrTables;

}

int tblIndex = 0;

while (true)

{

if (0 < reader.FieldCount)

{

try

{

string txtTableName = null;

SchemaMapping mapping = new SchemaMapping(this, reader, true);

if (data is DataTable)

{

mapping.DataTable = (DataTable) data;

}

else

{

mapping.DataSet = (DataSet) data;

txtTableName = DbDataAdapter.GetSourceTableName(srcTable, tblIndex);

}

mapping.SetupSchema(schemaType, txtTableName, false, null, null);

DataTable currentTable = mapping.DataTable;

if (currentTable != null)

{

arrTables = DbDataAdapter.AddDataTableToArray(arrTables, currentTable);

}

finally

{

tblIndex++;

}

if (!reader.NextResult())

{

return arrTables;

}

finally

{

DbDataAdapter.QuietClose(connection, state);

}

catch

{

throw;

}

return arrTables;

}

首先，该操作含有一个数据库的Query操作，这里其实是调用DBDataAdapter的SelectCommand的对象，执行一次查询，然后遍历查询返回的所有表，每遍历到一个表的时候，通过该表的信息实例化一个SchemaMapping对象，再有该对象创建为DataSet/DataTable创建架构信息。

这里，DataSet/DataTable是作为参数提供的，整个处理过程，首先必然的需要完成一次查询操作，由于使用IDataReader，所以在查询之后的所有操作期间，连接是保持着的，这一定程度上占用了一些资源（也可以说这些资源还不算太昂贵）；其次，实例化一个SchemaMapping对象（该对象是内部类，我在MSDN上没有查到相关介绍性资料），我简单看了一下这个类的代码，在我看来，它的处理过程应该是占据了整个过程蛮大一部分资源的，这方面属于个人见解。

由于我的认识上的有限，也为了保证文章的内容无误导，暂且说到这里。这个方法的进一步讨论希望留给有兴趣的朋友。

总结

以上是我对这两个方法认识方面简单的一个概括，其实从上面的描述，也打消了我原先认为的这两个方法在获取元数据上有本质的差别。个人认为，之所以获取结构性元数据的消耗大，是因为获取逻辑的繁琐以及使用的对象的庞大，而参数信息相对而言完全属于轻量级的东西，所以所谓性能上的差异并非因为获取机制的本质差异引起的。

转载地址：https://linuxstyle.blog.csdn.net/article/details/1536921 如侵犯您的版权，请留言回复原文章的地址，我们会给您删除此文章，给您带来不便请您谅解！

上一篇：按拼音模糊匹配查询条件的生成类

下一篇：将Excel文件数据库导入SQL Server

发表评论

关于作者

喝酒易醉，品茶养心，人生如梦，品茶悟道，何以解忧？唯有杜康！

-- 愿君每日到此一游！

发表评论

最新留言

关于作者

推荐文章