WO2016127880A1 - Method and device for determining quality of offline positioning data - Google Patents

Method and device for determining quality of offline positioning data Download PDF

Info

Publication number
WO2016127880A1
WO2016127880A1 PCT/CN2016/073271 CN2016073271W WO2016127880A1 WO 2016127880 A1 WO2016127880 A1 WO 2016127880A1 CN 2016073271 W CN2016073271 W CN 2016073271W WO 2016127880 A1 WO2016127880 A1 WO 2016127880A1
Authority
WO
WIPO (PCT)
Prior art keywords
positioning
data
offline
coverage
success rate
Prior art date
Application number
PCT/CN2016/073271
Other languages
French (fr)
Chinese (zh)
Inventor
吴兴昊
刘欣
房宇星
邵茂材
张杨
林海涛
Original Assignee
阿里巴巴集团控股有限公司
吴兴昊
刘欣
房宇星
邵茂材
张杨
林海涛
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 阿里巴巴集团控股有限公司, 吴兴昊, 刘欣, 房宇星, 邵茂材, 张杨, 林海涛 filed Critical 阿里巴巴集团控股有限公司
Publication of WO2016127880A1 publication Critical patent/WO2016127880A1/en

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S5/00Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations
    • G01S5/02Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations using radio waves
    • G01S5/0205Details
    • G01S5/0244Accuracy or reliability of position solution or of measurements contributing thereto

Definitions

  • the present invention relates to the field of computer application technologies, and in particular, to a method and apparatus for determining the quality of offline positioning data.
  • GPS positioning requires the mobile terminal to have a GPS module, and the location of the mobile device is often unable to be located when it is blocked.
  • the base station is configured to send the base station ID (identification) to the server through the network request, and the server determines the location of the mobile device according to the latitude and longitude information of the base station in the database, and sends the location of the mobile device to the mobile device through the network.
  • the positioning of the base station requires that the mobile device must access the network every time it locates. It belongs to the online positioning (also called network positioning) mode, which consumes network traffic, and cannot be located without the network.
  • Offline positioning data is the premise of offline positioning, and its quality directly affects the quality of offline positioning.
  • there is no method for determining the quality of offline positioning data there is no method for determining the quality of offline positioning data, and it is impossible to measure the quality of offline positioning and can not guide the optimization of offline positioning data. effect.
  • the present invention provides a method and apparatus for determining the quality of offline positioning data, so as to be able to measure the quality of offline positioning, thereby being able to guide the optimization of offline positioning data.
  • the present invention provides a method for determining the quality of offline location data, the method comprising:
  • the determining the coverage rate of the offline positioning data includes: counting the total number of records X in the set time period in the online positioning log, and counting the LAC identification information in the set time period in the online positioning log in the offline positioning database.
  • the determining the success rate of the offline positioning data includes: counting the total number of records X in the set time period in the online positioning log, and the number of the positioning result in the online positioning log and the offline positioning data in the statistical setting time period Z, using the ratio of Z to X to determine the success rate of offline positioning data.
  • the LAC data includes: LAC identification information and corresponding location information.
  • the LAC data is obtained by combining base station data according to the LAC, and the base station data includes a base station identifier, a LAC identifier of the base station, and location information of the base station.
  • the merging the base station data according to the LAC includes:
  • Base station data belonging to the same LAC is clustered based on location information to obtain at least one class
  • the identification information of the LAC and the location information of the center point of the corresponding class, as well as the contour information or the radius information are recorded.
  • the number Z of the positioning result in the online positioning log and the offline positioning data in the statistical setting time period includes:
  • the comparison operation is performed for all the positioning results in the online positioning log in the set time period, and the number of the final statistical positioning results is the Z.
  • offline positioning is further recorded when the statistical positioning result is obtained.
  • Positioning accuracy information of the position information in the data that is consistent with the positioning result is obtained.
  • the method further includes: determining a number of positioning results of the preset precision information from the statistical positioning result, and determining, by using the ratio of the number and the X, the success rate of the offline positioning data corresponding to the preset precision.
  • the method for determining offline positioning data is optimized using the determined coverage and/or success rate.
  • the determining method for optimizing offline positioning data by using the determined coverage rate and/or success rate includes:
  • the coverage rate and/or the correct rate of the offline positioning data calculated by different clustering methods used when determining the offline positioning data are respectively selected, and the coverage ratio and/or the correct rate satisfying the preset coverage requirement and/or the correct rate requirement are selected. Class way; or,
  • the method further comprises:
  • Adjusting the coverage requirement and/or the success rate requirement according to the data volume requirement of the offline positioning data so as to select the clustering manner and/or clustering of the data amount of the obtained offline positioning data that can meet the data volume requirement. parameter.
  • the present invention also provides an apparatus for determining the quality of offline positioning data, which is used to determine coverage and/or success rate of offline positioning data by using online positioning logs and offline positioning data of the server; the apparatus includes: a coverage determining unit and / or success rate determination unit;
  • the coverage determining unit is configured to count the total number of records X in the set time period in the online positioning log, and record the records in the offline positioning database of the LAC identification information in the set time period in the online positioning log.
  • the number Y using the ratio of Y to X to determine the coverage of offline positioning data;
  • the success rate determining unit is configured to count the total number of records X in the set time period in the online positioning log, and the number Z of the positioning result in the online positioning log and the offline positioning data in the statistical setting time period, The ratio of Z to X is used to determine the success rate of offline positioning data.
  • the LAC data includes: LAC identification information and corresponding location information.
  • the apparatus further includes: an offline data determining unit, configured to combine the base station data according to the LAC to obtain the LAC data, where the base station data includes a base station identifier, a LAC identifier of the base station, and a location of the base station. information.
  • the offline data determining unit is configured to cluster base station data belonging to the same LAC according to location information to obtain at least one class; record the identification information of the LAC and the center point location of the corresponding class. Information, as well as contour information or radius information.
  • the success rate determining unit performs: when counting the Z, specifically:
  • the comparison operation is performed for all the positioning results in the online positioning log in the set time period, and the number of the final statistical positioning results is the Z.
  • the success rate determining unit further records the positioning accuracy information of the position information in the offline positioning data that is consistent with the positioning result, and determines the pre-determination from the statistical positioning result.
  • the number of the positioning results of the precision information is set, and the ratio of the number to the X is used to determine the success rate of the offline positioning data corresponding to the preset precision.
  • the apparatus further includes: an optimizing unit, configured to optimize a method for determining offline positioning data by using a coverage rate determined by the coverage determining unit and/or a success rate determined by the success rate determining unit.
  • the optimization unit is specifically configured to measure coverage and/or accuracy rate of the offline positioning data respectively calculated by using different clustering methods used when determining offline positioning data, and select coverage rate and/or a clustering method that meets the preset coverage requirements and/or the correct rate requirements; or
  • Adjusting the coverage and/or success rate of offline positioning data obtained by using a clustering method The clustering parameter value in a clustering method until the coverage and/or success rate of the obtained offline positioning data meets the preset coverage requirement and/or the success rate requirement.
  • the optimization unit is further configured to adjust the coverage requirement and/or the success rate requirement according to the data volume requirement of the offline positioning data, thereby selecting the data amount of the obtained offline positioning data. Clustering methods and/or clustering parameters that can satisfy the data volume requirements.
  • the present invention utilizes the online positioning log and the offline positioning data of the server to determine the coverage and/or success rate of the offline positioning data, and utilizes the coverage rate and/or the success rate to reflect the quality of the offline positioning data, thereby Measure the quality of offline targeting and guide the optimization of offline targeting data.
  • FIG. 1 is a structural diagram of a device according to an embodiment of the present invention.
  • the core idea of the present invention is to use the online positioning log and the offline positioning database of the server to determine the coverage and/or success rate of the offline positioning data, thereby measuring the quality of the offline positioning data. And the method for determining the offline positioning data can be adaptively adjusted according to the quality of the offline positioning data.
  • the online positioning log in the embodiment of the present invention refers to the behavior record of the positioning by using the online positioning mode.
  • the online positioning mode mainly includes the base station positioning and the wifi positioning. Therefore, the online positioning log may include: the positioning result corresponding to the base station ID.
  • the positioning result corresponding to the wifi identifier, and the like may further include a positioning time, an LAC (Location Area Code) identification information to which the base station ID belongs, and the like.
  • the LAC is the only one that identifies each location in the digital public land mobile network in China. It is a 2-byte hexadecimal BCD code.
  • the LAC sends the system message on the broadcast channel of each cell, and the mobile device acquires the base station ID when performing online positioning. LAC identification information to which the base station belongs.
  • the server records the online location log, it also records the LAC identification information to which each base station ID belongs.
  • the offline positioning data is data that can be used for the offline positioning of the mobile device.
  • the offline positioning data may include LAC data
  • the LAC data may include LAC identification information and corresponding location information.
  • the offline positioning data may be obtained by combining base station data. The following describes the process of generating offline positioning data.
  • Each base station data includes attribute information such as a base station ID, a LAC, and location information (for example, latitude and longitude) of the base station, where the location information of the base station may be the location information of the base station acquired from the operator, or may be the result of positioning the base station (The central point position information obtained by clustering the positioning results corresponding to the same base station in the online positioning log may be obtained.
  • attribute information such as a base station ID, a LAC, and location information (for example, latitude and longitude) of the base station
  • location information of the base station may be the location information of the base station acquired from the operator, or may be the result of positioning the base station ( The central point position information obtained by clustering the positioning results corresponding to the same base station in the online positioning log may be obtained.
  • Base station data in the database is merged based on the LAC to obtain all base station data belonging to the same LAC.
  • clustering can be used.
  • the base station data that belongs to the same LAC is clustered based on the location information.
  • Most of the base station data belonging to the same LAC are clustered into the same class, and the location information of the center point of the corresponding class of the LAC is recorded, and the location information may be latitude and longitude. It can also be the administrative division information of the location, for example, in the xx district of the xx province, and further, the contour information of the LAC corresponding class or the radius of the coverage area can be recorded.
  • the location information of different MNCs and the same LAC of the same carrier can be merged as much as possible. If they cannot be merged, they are all retained. For example, if the two location information of the same LAC belong to different MNCs of the same carrier, when the center offset of the two location information is small (for example, less than the specified distance range), the two location information may be combined into one.
  • MNC Mobile Network Code
  • the area covered by the LAC is too large, it may produce the same LAC.
  • the base station data is clustered into a plurality of classes ("multiple" refers to more than one in the embodiment of the present invention).
  • the obtained classes can be respectively determined to meet the preset attribute standards, and if not, Then filter the class out.
  • the location information of the center point of the class corresponding to the attribute standard corresponding to the LAC is recorded, and the profile information of the class conforming to the attribute standard may be further recorded.
  • the creation time of the base station in the class is within the most recent preset time range. This situation is to prevent some base stations from being created for a long time. It is possible that the actual base station is no longer used, but the base station data in the database is not updated in time. For the class generated in this case, it can be filtered out without recording.
  • the coverage area of the class is greater than or equal to the preset area standard. Some base stations are distributed in a small class separated from most other base stations. The probability of users in the area covered by this type is usually much lower than other classes, often some places far away from the urban area, based on some kind of A base station created with special needs, so classes generated for this situation can be filtered out.
  • the number of base stations in the class is greater than or equal to the preset number standard. Generally, a larger number of base stations are usually set in places where users are denser and demand is higher. Similar to the case of the above 2), some base stations are distributed into a small class away from most other base stations, and the probability of users in the coverage area of the class is usually much lower than other classes, so this situation is generated.
  • the class can filter it out.
  • clustering methods such as K-means, K-medoids, DBSCAN (Density-Based Spatial Clustering of Applications with Noise, and noise-based clustering methods) may be employed.
  • the present invention does not limit the clustering method.
  • the LAC data obtained by the LAC may include: LAC identification information and corresponding location information, where the location information corresponding to the LAC may include location information of the center of the coverage area, and may also include contour information or radius information of the coverage area.
  • the location information of the central point may be described by latitude and longitude, or may be described by administrative division information.
  • Determining the base station data on which the LAC data is based may be obtained from the operator or through Collect the location logs of various applications with positioning functions and obtain them from the location logs.
  • Applications with positioning capabilities can be, but are not limited to, various maps.
  • the quality of the offline positioning data may be reflected by at least one of coverage rate and success rate.
  • the total number of records X in the set time period in the online positioning log is counted, and the records in which the LAC identification information in the set time period in the online positioning log exists in the offline positioning database are counted.
  • the number Y; then the ratio of Y to X is used to determine the coverage of offline positioning data.
  • the number of records of the LAC identification information in the X-location record in the offline location database is Y.
  • the number Z of the online positioning result and the offline positioning data in the set time period may be counted, and the ratio of the Z and the X is used to determine the success rate of the offline positioning data.
  • the offline positioning result is the location information in the offline positioning database.
  • the locating data of the online positioning log in one day can be compared with the offline positioning data, that is, the LAC identification information of the base station ID in the positioning data is first determined, and then the LAC identification information is determined in the offline positioning data.
  • Corresponding location information determining whether the base station location data in the piece of positioning data is consistent with the location information corresponding to the LAC identification information in the offline positioning data, for example, whether it falls within an area corresponding to the LAC identification information.
  • a positioning data in the online positioning log is: [base station a, LAC1, (39.98, 116.30)], where base station a is the base station ID, LAC1 is the LAC identifier to which base station a belongs, (39.98, 116.30) is Latitude and longitude.
  • the LAC1 is used to query the offline positioning data, and the corresponding location information is assumed to be: Haidian District, Beijing, whether the judgment (39.98, 116.30) falls within the scope of Haidian District, Beijing, and the judgment result is yes, indicating the positioning data and offline positioning.
  • the data is consistent. All the number of positions in the online location log for the day According to a similar comparison, the statistics show the number of consistent results, which is Z.
  • “one day” is an example of a preset time period.
  • the accuracy may be reflected by the area radius or administrative division information (for example, city level, district level, etc.). Therefore, the success rate can be calculated separately for different precisions. For example, when comparing the location information corresponding to the LAC identification information in the offline positioning data in the online positioning log, in addition to the number of consistent results obtained by the statistics, the offline positioning accuracy corresponding to the consistent result is further recorded. . The number of consistent results corresponding to the preset precision is determined, and the ratio of the number to the X determines the success rate of the offline positioning data corresponding to the preset precision.
  • the method for determining the offline positioning data can be adaptively optimized.
  • the optimization of the method for determining the offline positioning data mainly includes the optimization of the clustering method and the optimization of the clustering parameters.
  • the offline positioning data is obtained by using the base station data, and specifically, the base station data belonging to the same LAC is clustered based on the location information.
  • clustering a variety of clustering methods can be used, such as K-means, K-medois, DBSCAN and other clustering methods.
  • the coverage rate and/or the correct rate of the offline positioning data obtained by different clustering methods may be respectively calculated, and the clustering with the coverage rate and/or the correct rate meeting the preset coverage requirement and/or the correct rate requirement may be selected. the way.
  • the offline positioning data obtained by the selected clustering method can then be set to the mobile device for use in offline positioning.
  • the offline positioning data when clustering the base station data belonging to the same LAC based on the location information, some clustering parameters are involved in the process of performing the clustering algorithm, for example, each round of iterations is involved in the k-means algorithm.
  • the distance between the new cluster center and the cluster center obtained in the previous iteration when the distance is less than or equal to the preset distance threshold, the iteration ends. Then the distance The threshold is a clustering parameter.
  • the parameters involved in the DBScan clustering method are mainly the scan radius eps and the minimum inclusion point minPts.
  • the approximate clustering process is: select one of the unvisited points to find all nearby points within the eps.
  • the number of nearby points is greater than or equal to minPts, then the current point forms a class with its nearby points, and the trigger point is marked as accessed, then recursively, and all points in the class that are not marked as accessed are processed in the same way, thus Extend the class. If the number of nearby points is less than minPts, the trigger point is temporarily marked as a noise point.
  • the coverage and/or success rate of the obtained offline positioning data may be determined according to the obtained coverage rate and/or The success rate adjusts the clustering parameter values in the clustering method until the coverage and/or success rate of the obtained offline positioning data meets the preset coverage requirement and/or the success rate requirement. Then, the clustered parameter values are used for clustering to obtain offline positioning data, and the offline positioning data is configured on the mobile device for offline positioning.
  • the clustering method and the clustering parameter may be respectively optimized for different positioning precisions, thereby obtaining offline positioning data for different positioning precisions.
  • the success rate and coverage rate of the offline positioning data can be separately calculated for the accuracy of the city level, and the clustering method and the clustering parameter are selected by the method described in the above 1) or 2), and the selected clustering method is utilized. And the adjusted clustering parameters get the city-level offline positioning data. Then calculate the success rate and coverage rate of the offline positioning data separately for the accuracy of the district and county level, select the clustering method and adjust the clustering parameters by the method described in the above 1) or 2), and use the selected clustering method and The adjusted clustering parameters obtain offline positioning data of the city district and county level.
  • the city-level offline positioning data and the district-level offline positioning data can be configured on the mobile device, so that when the application in the mobile device needs the city-level positioning accuracy, the city-level offline positioning data is used for offline positioning, when the mobile device When the application in the middle needs the location accuracy of the district and county level, the offline positioning data of the district and county level is used for offline positioning.
  • the coverage can be adjusted according to the requirement of the amount of data of the offline positioning data.
  • the preset requirements of the rate and/or success rate, thereby selecting the number of offline positioning data obtained can satisfy the required clustering mode and clustering parameters. Since the storage capacity of mobile devices is limited, there is a certain requirement for the amount of data for offline positioning data. If the amount of data is too large, the mobile device may not be able to store or have a great impact on performance. Generally, the higher the coverage rate, the larger the amount of data for offline positioning data, and the higher the success rate, the larger the amount of data for offline positioning data.
  • the requirements for the coverage and/or success rate of the offline positioning database can be adjusted, thereby selecting the clustering method and adjusting the clustering parameters so as to satisfy the coverage of the offline positioning database.
  • the requirements of the success rate and the success rate of the offline positioning data are met.
  • FIG. 1 is a structural diagram of a device according to an embodiment of the present invention.
  • the device is configured to determine the coverage and/or success rate of offline positioning data by using online positioning logs and offline positioning data of a server, thereby measuring the quality of offline positioning data.
  • the device specifically includes: a coverage ratio determining unit 01 and/or a success rate determining unit 02.
  • the example includes the two units in FIG. 1 , and may further include an offline data determining unit 03 and an optimizing unit 04 . .
  • the coverage ratio determining unit 01 is responsible for counting the total number of records X in the set time period in the online positioning log, and counting the number of records Y existing in the offline positioning database of the LAC identification information in the set time period in the online positioning log, using Y
  • the ratio to X determines the coverage of offline location data.
  • the success rate determining unit 02 is responsible for counting the total number of records X in the set time period in the online positioning log, and counting the number Z of the positioning results in the online positioning log and the offline positioning data in the set time period, using Z and X The ratio determines the success rate of offline location data.
  • the locating data of the online positioning log in one day can be compared with the offline positioning data, that is, the LAC identification information of the base station ID in the positioning data is first determined, and then the LAC identification information is determined in the offline positioning data.
  • Corresponding location information determining whether the base station location data in the piece of positioning data is consistent with the location information corresponding to the LAC identification information in the offline positioning data, for example, whether it falls within an area corresponding to the LAC identification information.
  • a similar comparison is performed on all the positioning data in the online positioning log in one day, and the number of consistent results is obtained, and the number is Z.
  • the success rate determining unit 02 can further record the number of offline positioning when the positioning result is statistically calculated. According to the positioning accuracy information of the position information consistent with the positioning result; determining the number of positioning results of the preset precision information from the statistical positioning result, and using the ratio of the number to the X, the success of the offline positioning data corresponding to the preset precision can be determined. rate.
  • the LAC data in the offline positioning database includes: LAC identification information and corresponding location information.
  • the LAC data can be obtained by the offline data determining unit 03. Specifically, the offline data determining unit 03 combines the base station data according to the LAC to obtain LAC data, where the base station data includes the base station identifier, the LAC identifier of the base station, and the location information of the base station.
  • the offline data determining unit 03 may cluster the base station data belonging to the same LAC based on the location information to obtain at least one class; record the identification information of the LAC and the center point location information of the corresponding class, and the contour information or the radius information.
  • the determining method of the offline positioning data may be adaptively optimized, and the optimization processing is performed by the optimizing unit 04.
  • the optimization process may include optimization of a clustering method and optimization of clustering parameters.
  • the optimization unit 04 may separately calculate the coverage and/or the correct rate of the offline positioning data calculated by using different clustering methods used when determining the offline positioning data, and select the coverage rate and/or the correct rate to meet the preset.
  • the optimization unit 04 may adjust the clustering parameter values in a clustering method according to the coverage and/or success rate of the offline positioning data obtained by using a certain clustering method, until the obtained offline positioning data is obtained. Coverage and/or success rate meets preset coverage requirements and/or success rate requirements.
  • the optimization unit 04 may adjust the coverage requirement and/or the success rate requirement according to the data volume requirement of the offline positioning data, thereby selecting a clustering manner in which the data amount of the obtained offline positioning data can satisfy the data volume requirement. / or clustering parameters. Since the storage capacity of mobile devices is limited, there is a certain requirement for the amount of data for offline positioning data. If the amount of data is too large, the mobile device may not be able to store or have a great impact on performance. Generally, the higher the coverage rate, the larger the amount of data for offline positioning data, and the higher the success rate, the larger the amount of data for offline positioning data.
  • the coverage and/or success rate requirements of the offline positioning database can be adjusted, thereby selecting the clustering mode and adjusting the clustering parameters so as to meet the requirements of the coverage and/or success rate of the offline positioning database. , to meet the data volume requirements of offline positioning data.
  • the device provided by the invention may be arranged on the server side or as a separate device.
  • the method and apparatus provided by the present invention can have the following advantages:
  • the present invention utilizes the online positioning log and offline positioning data of the server to determine the coverage and/or success rate of the offline positioning data, and utilizes the coverage rate and/or the success rate to reflect the quality of the offline positioning data, thereby realizing the quality measurement of the offline positioning. And can guide the optimization of offline positioning data.
  • the present invention can optimize the method for determining offline positioning data by using the coverage and/or success rate of offline positioning data, that is, without relying on the feedback of the mobile device, the closed loop reverse data can be optimized on the server side to improve The quality of offline targeting.
  • the disclosed apparatus and method may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division, and the actual implementation may have another division manner.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of hardware plus software functional units.
  • the above-described integrated unit implemented in the form of a software functional unit can be stored in a computer readable storage medium.
  • the above software functional unit is stored in a storage medium, including several fingers
  • the steps used to cause a computer device (which may be a personal computer, server, or network device, etc.) or a processor to perform the methods described in various embodiments of the present invention.
  • the foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like, which can store program codes. .

Abstract

The present invention provides a method and device for determining the quality of offline positioning data. The method comprises: determining the coverage rate and/or success rate of offline positioning data according to an online positioning log on the side of a server and the offline positioning data, wherein the determining the coverage rate comprises: computing the total number X of records in the online positioning log within a set time period, computing the number Y of records, existing in an offline positioning database, of LAC identification information in the online positioning log within the set time period, and determining the coverage rate of the offline positioning data according to the ratio of Y to X; and the determining the success rate comprises: computing the total number X of records in the online positioning log within a set time period, computing the number Z of positioning results in the online positioning log within the set time period that are consistent with the offline positioning data, and determining the success rate of the offline positioning data according to the ratio of Z to X. By means of the present invention, the quality of offline positioning is measured according to the coverage rate and/or success rate of offline positioning data, so as to give guidance for optimization of the offline positioning data.

Description

一种确定离线定位数据质量的方法和装置Method and device for determining offline positioning data quality 【技术领域】[Technical Field]
本发明涉及计算机应用技术领域,特别涉及一种确定离线定位数据质量的方法和装置。The present invention relates to the field of computer application technologies, and in particular, to a method and apparatus for determining the quality of offline positioning data.
【背景技术】【Background technique】
随着移动设备的不断普及和发展,用户越来越多的使用移动设备进行通讯、信息获取、娱乐甚至工作。在用户享受移动设备的服务的过程中,往往会涉及到对移动设备定位的需求。目前对移动设备进行定位的常用方式主要是GPS(Global Positioning System,全球定位系统)定位和基站定位。其中GPS定位需要移动终端具备GPS模块,且移动设备位置受到遮挡时常常无法定位。基站定位是移动设备将基站ID(标识)通过网络请求发送给服务端,服务端根据数据库中基站的经纬度信息确定移动设备的位置,将移动设备的位置通过网络下发给移动设备。基站定位要求移动设备每次定位必须访问网络,属于在线定位(也称为网络定位)方式,会消耗网络流量,且没有网络情况下则无法实现定位。With the continuous popularization and development of mobile devices, users are increasingly using mobile devices for communication, information acquisition, entertainment and even work. In the process of users enjoying the services of mobile devices, the need for positioning of mobile devices is often involved. At present, the common methods for locating mobile devices are mainly GPS (Global Positioning System) positioning and base station positioning. GPS positioning requires the mobile terminal to have a GPS module, and the location of the mobile device is often unable to be located when it is blocked. The base station is configured to send the base station ID (identification) to the server through the network request, and the server determines the location of the mobile device according to the latitude and longitude information of the base station in the database, and sends the location of the mobile device to the mobile device through the network. The positioning of the base station requires that the mobile device must access the network every time it locates. It belongs to the online positioning (also called network positioning) mode, which consumes network traffic, and cannot be located without the network.
考虑到GPS定位和基站定位的上述限制,提出了一种离线定位的设想,即在对定位精度要求不高的情况下,移动设备能够利用本地存储的离线定位数据进行定位。离线定位数据是离线定位的前提,其质量直接影响离线定位的质量,现有技术中并没有确定离线定位数据质量的方法,也就无法衡量离线定位的质量以及无法对离线定位数据的优化起指导作用。Considering the above limitation of GPS positioning and base station positioning, an idea of offline positioning is proposed, that is, in the case where the positioning accuracy is not high, the mobile device can perform positioning using the locally stored offline positioning data. Offline positioning data is the premise of offline positioning, and its quality directly affects the quality of offline positioning. In the prior art, there is no method for determining the quality of offline positioning data, and it is impossible to measure the quality of offline positioning and can not guide the optimization of offline positioning data. effect.
【发明内容】[Summary of the Invention]
有鉴于此,本发明提供了一种确定离线定位数据质量的方法和装置,以便于能够衡量离线定位的质量,从而能够对离线定位数据的优化起指导作用。In view of this, the present invention provides a method and apparatus for determining the quality of offline positioning data, so as to be able to measure the quality of offline positioning, thereby being able to guide the optimization of offline positioning data.
具体技术方案如下: The specific technical solutions are as follows:
本发明提供了一种确定离线定位数据质量的方法,该方法包括:The present invention provides a method for determining the quality of offline location data, the method comprising:
利用服务器端的在线定位日志和离线定位数据,确定离线定位数据的覆盖率和/或成功率;Determining the coverage and/or success rate of offline positioning data by using online positioning logs and offline positioning data on the server side;
其中所述确定离线定位数据的覆盖率包括:统计所述在线定位日志中设定时间段内的总记录数X,以及统计在线定位日志中设定时间段内的LAC标识信息在离线定位数据库中存在的记录数目Y,利用Y与X的比值确定离线定位数据的覆盖率;The determining the coverage rate of the offline positioning data includes: counting the total number of records X in the set time period in the online positioning log, and counting the LAC identification information in the set time period in the online positioning log in the offline positioning database. The number of records Y existing, using the ratio of Y to X to determine the coverage of offline positioning data;
所述确定离线定位数据的成功率包括:统计所述在线定位日志中设定时间段内的总记录数X,以及统计设定时间段内在线定位日志中的定位结果与离线定位数据一致的数量Z,利用Z与X的比值确定离线定位数据的成功率。The determining the success rate of the offline positioning data includes: counting the total number of records X in the set time period in the online positioning log, and the number of the positioning result in the online positioning log and the offline positioning data in the statistical setting time period Z, using the ratio of Z to X to determine the success rate of offline positioning data.
根据本发明一优选实施方式,所述LAC数据包括:LAC标识信息及其对应的位置信息。According to a preferred embodiment of the present invention, the LAC data includes: LAC identification information and corresponding location information.
根据本发明一优选实施方式,所述LAC数据是根据LAC对基站数据进行合并后得到的,所述基站数据包括基站标识、基站所属LAC标识以及基站的位置信息。According to a preferred embodiment of the present invention, the LAC data is obtained by combining base station data according to the LAC, and the base station data includes a base station identifier, a LAC identifier of the base station, and location information of the base station.
根据本发明一优选实施方式,所述根据LAC对基站数据进行合并包括:According to a preferred embodiment of the present invention, the merging the base station data according to the LAC includes:
将属于同一LAC的基站数据基于位置信息进行聚类,得到至少一个类;Base station data belonging to the same LAC is clustered based on location information to obtain at least one class;
记录LAC的标识信息及其对应类的中心点位置信息,以及轮廓信息或半径信息。The identification information of the LAC and the location information of the center point of the corresponding class, as well as the contour information or the radius information are recorded.
根据本发明一优选实施方式,所述统计设定时间段内在线定位日志中的定位结果与离线定位数据一致的数量Z包括:According to a preferred embodiment of the present invention, the number Z of the positioning result in the online positioning log and the offline positioning data in the statistical setting time period includes:
比较在线定位日志中某基站ID对应的定位结果与该基站ID所属LAC标识信息在离线定位数据中对应的位置信息是否一致,如果是,则统计该定位结果;Comparing the positioning result corresponding to a certain base station ID in the online positioning log with the location information corresponding to the LAC identification information of the base station ID in the offline positioning data, and if yes, counting the positioning result;
针对设定时间段内在线定位日志中的所有定位结果均执行所述比较的操作,最终统计的定位结果数量为所述Z。The comparison operation is performed for all the positioning results in the online positioning log in the set time period, and the number of the final statistical positioning results is the Z.
根据本发明一优选实施方式,在所述统计定位结果时,进一步记录离线定位 数据中与所述定位结果一致的位置信息的定位精度信息;According to a preferred embodiment of the present invention, offline positioning is further recorded when the statistical positioning result is obtained. Positioning accuracy information of the position information in the data that is consistent with the positioning result;
该方法进一步包括:从统计的定位结果中确定预设精度信息的定位结果数目,利用该数目与X的比值确定所述预设精度对应的离线定位数据的成功率。The method further includes: determining a number of positioning results of the preset precision information from the statistical positioning result, and determining, by using the ratio of the number and the X, the success rate of the offline positioning data corresponding to the preset precision.
根据本发明一优选实施方式,利用确定的覆盖率和/成功率,优化离线定位数据的确定方法。According to a preferred embodiment of the present invention, the method for determining offline positioning data is optimized using the determined coverage and/or success rate.
根据本发明一优选实施方式,所述利用确定的覆盖率和/成功率,优化离线定位数据的确定方法包括:According to a preferred embodiment of the present invention, the determining method for optimizing offline positioning data by using the determined coverage rate and/or success rate includes:
针对确定离线定位数据时采用的不同聚类方法分别计算得到的离线定位数据的覆盖率和/或正确率,选择覆盖率和/或正确率满足预设覆盖率要求和/或正确率要求的聚类方式;或者,The coverage rate and/or the correct rate of the offline positioning data calculated by different clustering methods used when determining the offline positioning data are respectively selected, and the coverage ratio and/or the correct rate satisfying the preset coverage requirement and/or the correct rate requirement are selected. Class way; or,
依据采用某聚类方法得到的离线定位数据的覆盖率和/或成功率,调整所述某聚类方法中的聚类参数值,直至得到的离线定位数据的覆盖率和/或成功率满足预设覆盖率要求和/或成功率要求。Adjusting the clustering parameter value in the clustering method according to the coverage and/or success rate of the offline positioning data obtained by using a clustering method, until the coverage and/or success rate of the obtained offline positioning data meets the pre-determination Set coverage requirements and/or success rate requirements.
根据本发明一优选实施方式,该方法进一步包括:According to a preferred embodiment of the present invention, the method further comprises:
根据对离线定位数据的数据量要求,调整所述覆盖率要求和/或成功率要求,从而选择出得到的离线定位数据的数据量能够满足所述数据量要求的聚类方式和/或聚类参数。Adjusting the coverage requirement and/or the success rate requirement according to the data volume requirement of the offline positioning data, so as to select the clustering manner and/or clustering of the data amount of the obtained offline positioning data that can meet the data volume requirement. parameter.
本发明还提供了一种确定离线定位数据质量的装置,用于利用服务器端的在线定位日志和离线定位数据,确定离线定位数据的覆盖率和/或成功率;该装置包括:覆盖率确定单元和/或成功率确定单元;The present invention also provides an apparatus for determining the quality of offline positioning data, which is used to determine coverage and/or success rate of offline positioning data by using online positioning logs and offline positioning data of the server; the apparatus includes: a coverage determining unit and / or success rate determination unit;
所述覆盖率确定单元,用于统计所述在线定位日志中设定时间段内的总记录数X,以及统计在线定位日志中设定时间段内的LAC标识信息在离线定位数据库中存在的记录数目Y,利用Y与X的比值确定离线定位数据的覆盖率;The coverage determining unit is configured to count the total number of records X in the set time period in the online positioning log, and record the records in the offline positioning database of the LAC identification information in the set time period in the online positioning log. The number Y, using the ratio of Y to X to determine the coverage of offline positioning data;
所述成功率确定单元,用于统计所述在线定位日志中设定时间段内的总记录数X,以及统计设定时间段内在线定位日志中的定位结果与离线定位数据一致的数量Z,利用Z与X的比值确定离线定位数据的成功率。 The success rate determining unit is configured to count the total number of records X in the set time period in the online positioning log, and the number Z of the positioning result in the online positioning log and the offline positioning data in the statistical setting time period, The ratio of Z to X is used to determine the success rate of offline positioning data.
根据本发明一优选实施方式,所述LAC数据包括:LAC标识信息及其对应的位置信息。According to a preferred embodiment of the present invention, the LAC data includes: LAC identification information and corresponding location information.
根据本发明一优选实施方式,该装置还包括:离线数据确定单元,用于根据LAC对基站数据进行合并后得到所述LAC数据,所述基站数据包括基站标识、基站所属LAC标识以及基站的位置信息。According to a preferred embodiment of the present invention, the apparatus further includes: an offline data determining unit, configured to combine the base station data according to the LAC to obtain the LAC data, where the base station data includes a base station identifier, a LAC identifier of the base station, and a location of the base station. information.
根据本发明一优选实施方式,所述离线数据确定单元,具体用于将属于同一LAC的基站数据基于位置信息进行聚类,得到至少一个类;记录LAC的标识信息及其对应类的中心点位置信息,以及轮廓信息或半径信息。According to a preferred embodiment of the present invention, the offline data determining unit is configured to cluster base station data belonging to the same LAC according to location information to obtain at least one class; record the identification information of the LAC and the center point location of the corresponding class. Information, as well as contour information or radius information.
根据本发明一优选实施方式,所述成功率确定单元在统计所述Z时,具体执行:According to a preferred embodiment of the present invention, the success rate determining unit performs: when counting the Z, specifically:
比较在线定位日志中某基站ID对应的定位结果与该基站ID所属LAC标识信息在离线定位数据中对应的位置信息是否一致,如果是,则统计该定位结果;Comparing the positioning result corresponding to a certain base station ID in the online positioning log with the location information corresponding to the LAC identification information of the base station ID in the offline positioning data, and if yes, counting the positioning result;
针对设定时间段内在线定位日志中的所有定位结果均执行所述比较的操作,最终统计的定位结果数量为所述Z。The comparison operation is performed for all the positioning results in the online positioning log in the set time period, and the number of the final statistical positioning results is the Z.
根据本发明一优选实施方式,所述成功率确定单元在所述统计定位结果时,进一步记录离线定位数据中与所述定位结果一致的位置信息的定位精度信息;从统计的定位结果中确定预设精度信息的定位结果数目,利用该数目与X的比值确定所述预设精度对应的离线定位数据的成功率。According to a preferred embodiment of the present invention, the success rate determining unit further records the positioning accuracy information of the position information in the offline positioning data that is consistent with the positioning result, and determines the pre-determination from the statistical positioning result. The number of the positioning results of the precision information is set, and the ratio of the number to the X is used to determine the success rate of the offline positioning data corresponding to the preset precision.
根据本发明一优选实施方式,该装置还包括:优化单元,用于利用所述覆盖率确定单元确定的覆盖率和/或成功率确定单元确定的成功率,优化离线定位数据的确定方法。According to a preferred embodiment of the present invention, the apparatus further includes: an optimizing unit, configured to optimize a method for determining offline positioning data by using a coverage rate determined by the coverage determining unit and/or a success rate determined by the success rate determining unit.
根据本发明一优选实施方式,所述优化单元,具体用于针对确定离线定位数据时采用的不同聚类方法分别计算得到的离线定位数据的覆盖率和/或正确率,选择覆盖率和/或正确率满足预设覆盖率要求和/或正确率要求的聚类方式;或者,According to a preferred embodiment of the present invention, the optimization unit is specifically configured to measure coverage and/or accuracy rate of the offline positioning data respectively calculated by using different clustering methods used when determining offline positioning data, and select coverage rate and/or a clustering method that meets the preset coverage requirements and/or the correct rate requirements; or
依据采用某聚类方法得到的离线定位数据的覆盖率和/或成功率,调整所述 某聚类方法中的聚类参数值,直至得到的离线定位数据的覆盖率和/或成功率满足预设覆盖率要求和/或成功率要求。Adjusting the coverage and/or success rate of offline positioning data obtained by using a clustering method The clustering parameter value in a clustering method until the coverage and/or success rate of the obtained offline positioning data meets the preset coverage requirement and/or the success rate requirement.
根据本发明一优选实施方式,所述优化单元,还用于根据对离线定位数据的数据量要求,调整所述覆盖率要求和/或成功率要求,从而选择出得到的离线定位数据的数据量能够满足所述数据量要求的聚类方式和/或聚类参数。According to a preferred embodiment of the present invention, the optimization unit is further configured to adjust the coverage requirement and/or the success rate requirement according to the data volume requirement of the offline positioning data, thereby selecting the data amount of the obtained offline positioning data. Clustering methods and/or clustering parameters that can satisfy the data volume requirements.
由以上技术方案可以看出,本发明利用服务器端的在线定位日志和离线定位数据,确定离线定位数据的覆盖率和/或成功率,利用覆盖率和/或成功率反映离线定位数据的质量,从而衡量离线定位的质量并对离线定位数据的优化起指导作用。It can be seen from the above technical solution that the present invention utilizes the online positioning log and the offline positioning data of the server to determine the coverage and/or success rate of the offline positioning data, and utilizes the coverage rate and/or the success rate to reflect the quality of the offline positioning data, thereby Measure the quality of offline targeting and guide the optimization of offline targeting data.
【附图说明】[Description of the Drawings]
图1为本发明实施例提供的装置结构图。FIG. 1 is a structural diagram of a device according to an embodiment of the present invention.
【具体实施方式】【detailed description】
为了使本发明的目的、技术方案和优点更加清楚,下面结合附图和具体实施例对本发明进行详细描述。The present invention will be described in detail below with reference to the drawings and specific embodiments.
本发明的核心思想在于,利用服务器端的在线定位日志和离线定位数据库,确定离线定位数据的覆盖率和/或成功率,以此衡量离线定位数据的质量。并能够进一步依据离线定位数据的质量自适应地调节离线定位数据的确定方法。The core idea of the present invention is to use the online positioning log and the offline positioning database of the server to determine the coverage and/or success rate of the offline positioning data, thereby measuring the quality of the offline positioning data. And the method for determining the offline positioning data can be adaptively adjusted according to the quality of the offline positioning data.
本发明实施例所涉及的在线定位日志指的是对采用在线定位方式进行定位的行为记录,在线定位方式主要包括基站定位和wifi定位,因此,在线定位日志中可以包括:基站ID对应的定位结果、wifi标识对应的定位结果等等,还可以包括定位时间,基站ID所属的LAC(Location Area Code,位置区码)标识信息等。LAC是唯一识别我国数字公共陆地移动网络中每个位置区的,是一个2字节16进制的BCD码。LAC在每个小区广播信道上的系统消息中发送,移动设备在进行在线定位时获取基站ID的同时也会获取到该 基站所属的LAC标识信息。服务器端在进行在线定位日志的记录时,也会记录各基站ID所属的LAC标识信息。The online positioning log in the embodiment of the present invention refers to the behavior record of the positioning by using the online positioning mode. The online positioning mode mainly includes the base station positioning and the wifi positioning. Therefore, the online positioning log may include: the positioning result corresponding to the base station ID. The positioning result corresponding to the wifi identifier, and the like, may further include a positioning time, an LAC (Location Area Code) identification information to which the base station ID belongs, and the like. The LAC is the only one that identifies each location in the digital public land mobile network in China. It is a 2-byte hexadecimal BCD code. The LAC sends the system message on the broadcast channel of each cell, and the mobile device acquires the base station ID when performing online positioning. LAC identification information to which the base station belongs. When the server records the online location log, it also records the LAC identification information to which each base station ID belongs.
离线定位数据是可以用于移动设备进行离线定位所使用的数据,在本发明实施例中离线定位数据可以包括LAC数据,该LAC数据可以包括LAC标识信息及其对应的位置信息。该离线定位数据可以是对基站数据进行合并后得到的。下面对离线定位数据的产生过程进行介绍。The offline positioning data is data that can be used for the offline positioning of the mobile device. In the embodiment of the present invention, the offline positioning data may include LAC data, and the LAC data may include LAC identification information and corresponding location information. The offline positioning data may be obtained by combining base station data. The following describes the process of generating offline positioning data.
每一个基站数据都包含基站ID、所属LAC、基站的位置信息(例如经纬度)等属性信息,其中基站的位置信息可以是从运营商处获取的基站位置信息,也可以是经过对基站定位结果(可以从在线定位日志中获取)中同一基站对应的定位结果进行聚类后得到的中心点位置信息。Each base station data includes attribute information such as a base station ID, a LAC, and location information (for example, latitude and longitude) of the base station, where the location information of the base station may be the location information of the base station acquired from the operator, or may be the result of positioning the base station ( The central point position information obtained by clustering the positioning results corresponding to the same base station in the online positioning log may be obtained.
对数据库中的基站数据基于LAC进行合并,得到属于同一LAC的所有基站数据。在进行合并时,可以采用聚类的方式。即将属于同一LAC的基站数据基于位置信息进行聚类,大多数下,属于同一LAC的基站数据会被聚类成同一类,记录该LAC对应类的中心点的位置信息,该位置信息可以是经纬度,也可以是所在的行政区划信息,例如在xx省xx市xx区,更进一步地,还可以记录LAC对应类的轮廓信息或覆盖区域的半径。Base station data in the database is merged based on the LAC to obtain all base station data belonging to the same LAC. When merging, clustering can be used. The base station data that belongs to the same LAC is clustered based on the location information. Most of the base station data belonging to the same LAC are clustered into the same class, and the location information of the center point of the corresponding class of the LAC is recorded, and the location information may be latitude and longitude. It can also be the administrative division information of the location, for example, in the xx district of the xx province, and further, the contour information of the LAC corresponding class or the radius of the coverage area can be recorded.
经过观察发现一些相同运营商、不同MNC(移动网络号码,Mobile Network Code)、相同LAC对应的位置信息通常会相互重叠,若重叠范围较大,则可以考虑合并,从而减少位置信息的数量。基于此,可以将同一个运营商的不同MNC、相同LAC的位置信息尽量合并,如果不能合并,则全部保留。例如,若同一LAC的两条位置信息分属于同一运营商的不同MNC,则当两个位置信息的中心偏移较小(例如小于指定距离范围)时,可以将这两条位置信息合并为一条;当两条位置信息的中心偏移较大(大于指定距离范围)时,认为两条位置信息不能被合并,需要分别作为一条位置信息。通过合并有利于减少数据量,节约存储资源。It has been observed that some location information of the same operator, different MNC (Mobile Network Code), and the same LAC usually overlap each other. If the overlapping range is large, the combination may be considered, thereby reducing the amount of location information. Based on this, the location information of different MNCs and the same LAC of the same carrier can be merged as much as possible. If they cannot be merged, they are all retained. For example, if the two location information of the same LAC belong to different MNCs of the same carrier, when the center offset of the two location information is small (for example, less than the specified distance range), the two location information may be combined into one. When the center offset of the two pieces of position information is large (greater than the specified distance range), it is considered that the two pieces of position information cannot be combined, and need to be separately used as one piece of position information. Consolidation helps reduce the amount of data and saves storage resources.
在有些情况下,例如LAC覆盖的区域过大,则可能会产生同一LAC的 基站数据会被聚类成多个类(本发明实施例中所述“多个”指代多于一个),此时可以对得到的类分别判断是否符合预设的属性标准,如果不符合,则将该类过滤掉。记录该LAC对应的符合属性标准的类的中心点的位置信息,还可以进一步记录符合属性标准的类的轮廓信息。In some cases, such as the area covered by the LAC is too large, it may produce the same LAC. The base station data is clustered into a plurality of classes ("multiple" refers to more than one in the embodiment of the present invention). In this case, the obtained classes can be respectively determined to meet the preset attribute standards, and if not, Then filter the class out. The location information of the center point of the class corresponding to the attribute standard corresponding to the LAC is recorded, and the profile information of the class conforming to the attribute standard may be further recorded.
其中,上述属性标准可以采用但不限于以下所列中之一或任意组合:Wherein, the above attribute standards may be used but are not limited to one or any combination of the following:
1)类内基站的创建时间在最近的预设时间范围内。这种情况是为了防止有些基站创建时间很久,有可能实际的基站已经不再使用,但数据库中的基站数据未及时更新,对于这种情况产生的类可以将其过滤掉,不进行记录。1) The creation time of the base station in the class is within the most recent preset time range. This situation is to prevent some base stations from being created for a long time. It is possible that the actual base station is no longer used, but the base station data in the database is not updated in time. For the class generated in this case, it can be filtered out without recording.
2)类的覆盖面积大于或等于预设的面积标准。有些基站离散于其他大多数基站而分布成一小类,用户在该类所覆盖区域的概率相比较其他类通常要低得多,往往是一些在距离市区很远的某些地方,基于某种特殊的需求而创建的基站,因此对于这种情况产生的类可以将其过滤掉。2) The coverage area of the class is greater than or equal to the preset area standard. Some base stations are distributed in a small class separated from most other base stations. The probability of users in the area covered by this type is usually much lower than other classes, often some places far away from the urban area, based on some kind of A base station created with special needs, so classes generated for this situation can be filtered out.
3)类内基站的数量大于或等于预设的数量标准。通常对于用户较密集、需求量比较高的地方,通常会设置较多数量的基站。与上述第2)种情况类似的,有些基站离离散于其他大多数基站而分布成一小类,用户在该类所覆盖区域的概率相比较其他类通常要低得多,因此对于这种情况产生的类可以将其过滤掉。3) The number of base stations in the class is greater than or equal to the preset number standard. Generally, a larger number of base stations are usually set in places where users are denser and demand is higher. Similar to the case of the above 2), some base stations are distributed into a small class away from most other base stations, and the probability of users in the coverage area of the class is usually much lower than other classes, so this situation is generated. The class can filter it out.
在对基站数据基于位置信息进行聚类时,可以采用诸如K-means、K-medoids、DBSCAN(Density-Based Spatial Clustering of Applications with Noise,,具有噪声的基于密度的聚类方法)等聚类方法,本发明对聚类方式不加以限制。When clustering base station data based on location information, clustering methods such as K-means, K-medoids, DBSCAN (Density-Based Spatial Clustering of Applications with Noise, and noise-based clustering methods) may be employed. The present invention does not limit the clustering method.
最终得到的LAC数据可以包括:LAC标识信息及其对应的位置信息,其中LAC对应的位置信息可以包括所覆盖区域的中心点位置信息,还可以包括所覆盖区域的轮廓信息或半径信息。其中中心点位置信息可以采用经纬度描述,也可以采用行政区划信息描述。The LAC data obtained by the LAC may include: LAC identification information and corresponding location information, where the location information corresponding to the LAC may include location information of the center of the coverage area, and may also include contour information or radius information of the coverage area. The location information of the central point may be described by latitude and longitude, or may be described by administrative division information.
确定LAC数据所基于的基站数据可以是从运营商处获取,也可以通过 收集各种具有定位功能的应用的定位日志,从定位日志中获取。具有定位功能的应用可以是但不限于:各种地图。Determining the base station data on which the LAC data is based may be obtained from the operator or through Collect the location logs of various applications with positioning functions and obtain them from the location logs. Applications with positioning capabilities can be, but are not limited to, various maps.
本发明实施例中,离线定位数据的质量可以通过覆盖率和成功率中的至少一个来反映。In the embodiment of the present invention, the quality of the offline positioning data may be reflected by at least one of coverage rate and success rate.
在确定离线定位数据的覆盖率时,首先统计在线定位日志中设定时间段内的总记录数X,以及统计在线定位日志中设定时间段内的LAC标识信息在离线定位数据库中存在的记录数目Y;然后利用Y与X的比值确定离线定位数据的覆盖率。例如,统计在线日志中一天内的总记录数,有X条定位记录,通常一天内在线定位日志的总记录数为亿次级别。这X条定位记录中的LAC标识信息在离线定位数据库中存在的记录数目为Y条,例如这X条定位记录中存在LAC1、LAC2、LAC3、LAC4(可能每个LAC都存在于多条定位记录中),其中LAC1、LAC2和LAC3在离线定位数据库中存在记录,那么Y为3。那么就可以将
Figure PCTCN2016073271-appb-000001
X的值作为离线定位数据的覆盖率。
When determining the coverage rate of the offline positioning data, firstly, the total number of records X in the set time period in the online positioning log is counted, and the records in which the LAC identification information in the set time period in the online positioning log exists in the offline positioning database are counted. The number Y; then the ratio of Y to X is used to determine the coverage of offline positioning data. For example, to count the total number of records in a day in the online log, there are X positioning records. Usually, the total number of records in the online positioning log in one day is 100 million times. The number of records of the LAC identification information in the X-location record in the offline location database is Y. For example, there are LAC1, LAC2, LAC3, and LAC4 in the X-location records. (Each LAC may exist in multiple positioning records. Medium), where LAC1, LAC2, and LAC3 have records in the offline location database, then Y is 3. Then you can
Figure PCTCN2016073271-appb-000001
The value of X is used as the coverage of offline positioning data.
在确定离线定位数据的成功率时,可以统计设定时间段内在线定位结果与离线定位数据一致的数量Z,利用Z与X的比值确定离线定位数据的成功率。其中离线定位结果就是离线定位数据库中的位置信息。When determining the success rate of the offline positioning data, the number Z of the online positioning result and the offline positioning data in the set time period may be counted, and the ratio of the Z and the X is used to determine the success rate of the offline positioning data. The offline positioning result is the location information in the offline positioning database.
具体地,可以将一天内的在线定位日志中的一条定位数据与离线定位数据进行比较,即首先确定该条定位数据中基站ID所属的LAC标识信息,再确定该LAC标识信息在离线定位数据中对应的位置信息,判断该条定位数据中的基站定位数据与LAC标识信息在离线定位数据中对应的位置信息一致,例如是否落入LAC标识信息对应的区域。举个例子,假设在线定位日志中的一条定位数据为:[基站a,LAC1,(39.98,116.30)],其中基站a为基站ID,LAC1为基站a所属的LAC标识,(39.98,116.30)为经纬度。利用该LAC1查询离线定位数据,假设得到的对应位置信息为:北京市海淀区,判断(39.98,116.30)是否落入北京市海淀区的范围,判断结果为是,说明该条定位数据与离线定位数据一致。对一天内的在线定位日志中的所有定位数 据都执行类似比较,统计得出一致结果的数目,该数目就是Z。这里“一天”是预设时间段的一个例子。Specifically, the locating data of the online positioning log in one day can be compared with the offline positioning data, that is, the LAC identification information of the base station ID in the positioning data is first determined, and then the LAC identification information is determined in the offline positioning data. Corresponding location information, determining whether the base station location data in the piece of positioning data is consistent with the location information corresponding to the LAC identification information in the offline positioning data, for example, whether it falls within an area corresponding to the LAC identification information. For example, suppose a positioning data in the online positioning log is: [base station a, LAC1, (39.98, 116.30)], where base station a is the base station ID, LAC1 is the LAC identifier to which base station a belongs, (39.98, 116.30) is Latitude and longitude. The LAC1 is used to query the offline positioning data, and the corresponding location information is assumed to be: Haidian District, Beijing, whether the judgment (39.98, 116.30) falls within the scope of Haidian District, Beijing, and the judgment result is yes, indicating the positioning data and offline positioning. The data is consistent. All the number of positions in the online location log for the day According to a similar comparison, the statistics show the number of consistent results, which is Z. Here "one day" is an example of a preset time period.
另外,由于各LAC标识信息在离线定位数据库中对应的位置信息是存在不同精度的,即离线定位精度,该精度可以由区域半径或者行政区划信息(例如城市级别、区县级别等)来反映。因此可以分别针对不同精度统计出成功率。例如,在比较出在线定位日志中基站定位数据与LAC标识信息在离线定位数据中对应的位置信息一致时,除了统计得出一致结果的数目之外,进一步记录得出一致结果对应的离线定位精度。统计预设精度对应的一致结果数目,将该数目与X的比值确定该预设精度对应的离线定位数据的成功率。In addition, since the corresponding location information of each LAC identification information in the offline positioning database has different precision, that is, offline positioning accuracy, the accuracy may be reflected by the area radius or administrative division information (for example, city level, district level, etc.). Therefore, the success rate can be calculated separately for different precisions. For example, when comparing the location information corresponding to the LAC identification information in the offline positioning data in the online positioning log, in addition to the number of consistent results obtained by the statistics, the offline positioning accuracy corresponding to the consistent result is further recorded. . The number of consistent results corresponding to the preset precision is determined, and the ratio of the number to the X determines the success rate of the offline positioning data corresponding to the preset precision.
在确定出离线定位数据的覆盖率和成功率后,可以自适应地优化离线定位数据的确定方法。其中对离线定位数据的确定方法进行的优化主要包括聚类方式的优化和聚类参数的优化。After determining the coverage and success rate of the offline positioning data, the method for determining the offline positioning data can be adaptively optimized. The optimization of the method for determining the offline positioning data mainly includes the optimization of the clustering method and the optimization of the clustering parameters.
1)优化离线定位数据所采用的聚类方式。1) Optimize the clustering method used for offline positioning data.
由之前描述的确定离线定位数据的过程中可以得知,离线定位数据是利用基站数据得到的,具体地,是基于位置信息对属于同一LAC的基站数据进行聚类后得到的。在聚类时,可以采用多种聚类方式,诸如K-means、K-medois、DBSCAN等聚类方法。It can be known from the process of determining the offline positioning data that the offline positioning data is obtained by using the base station data, and specifically, the base station data belonging to the same LAC is clustered based on the location information. In clustering, a variety of clustering methods can be used, such as K-means, K-medois, DBSCAN and other clustering methods.
在本发明实施例中,可以分别计算不同聚类方式得到的离线定位数据的覆盖率和/正确率,选择覆盖率和/或正确率满足预设覆盖率要求和/或正确率要求的聚类方式。然后可以将利用选择的聚类方式得到的离线定位数据设置于移动设备,以便进行离线定位时使用。In the embodiment of the present invention, the coverage rate and/or the correct rate of the offline positioning data obtained by different clustering methods may be respectively calculated, and the clustering with the coverage rate and/or the correct rate meeting the preset coverage requirement and/or the correct rate requirement may be selected. the way. The offline positioning data obtained by the selected clustering method can then be set to the mobile device for use in offline positioning.
2)优化确定离线定位数据所使用的聚类方法中的聚类参数。2) Optimize the clustering parameters in the clustering method used to determine the offline positioning data.
当确定离线定位数据时,在基于位置信息对属于同一LAC的基站数据进行聚类时,执行聚类算法过程中会涉及到一些聚类参数,例如k-means算法中会涉及到每轮迭代得到的新的聚类中心与上一轮迭代得到的聚类中心之间的距离,当距离小于或等于预设距离阈值时,迭代结束。那么其中的距离 阈值就是一个聚类参数。DBScan聚类方法涉及到的参数主要是扫描半径eps和最小包含点数minPts,大概的聚类过程为:任选一个未被访问的点开始,找出与其距离在eps之内的所有附近点,如果附近点的数量大于或等于minPts,则当前点与其附近点形成一个类,并且触发点被标记为已访问,然后递归,以相同的方法处理该类内所有未被标记为已访问的点,从而对类进行扩展。如果附近点的数量小于minPts,则该触发点被暂时标记为噪声点。When determining the offline positioning data, when clustering the base station data belonging to the same LAC based on the location information, some clustering parameters are involved in the process of performing the clustering algorithm, for example, each round of iterations is involved in the k-means algorithm. The distance between the new cluster center and the cluster center obtained in the previous iteration, when the distance is less than or equal to the preset distance threshold, the iteration ends. Then the distance The threshold is a clustering parameter. The parameters involved in the DBScan clustering method are mainly the scan radius eps and the minimum inclusion point minPts. The approximate clustering process is: select one of the unvisited points to find all nearby points within the eps. The number of nearby points is greater than or equal to minPts, then the current point forms a class with its nearby points, and the trigger point is marked as accessed, then recursively, and all points in the class that are not marked as accessed are processed in the same way, thus Extend the class. If the number of nearby points is less than minPts, the trigger point is temporarily marked as a noise point.
当确认使用一种聚类方法来对属于同一LAC的基站数据进行聚类从而得到离线定位数据时,可以对得到的离线定位数据确定覆盖率和/或成功率,依据得到的覆盖率和/或成功率调整聚类方法中的聚类参数值,直至得到的离线定位数据的覆盖率和/或成功率满足预设覆盖率要求和/或成功率要求。然后采用调整后的聚类参数值进行聚类后得到离线定位数据,使用该离线定位数据配置于移动设备,以便进行离线定位时使用。When it is confirmed that a clustering method is used to cluster base station data belonging to the same LAC to obtain offline positioning data, the coverage and/or success rate of the obtained offline positioning data may be determined according to the obtained coverage rate and/or The success rate adjusts the clustering parameter values in the clustering method until the coverage and/or success rate of the obtained offline positioning data meets the preset coverage requirement and/or the success rate requirement. Then, the clustered parameter values are used for clustering to obtain offline positioning data, and the offline positioning data is configured on the mobile device for offline positioning.
更进一步地,根据应用需要的定位精度不同,可以分别针对不同定位精度来分别优化聚类方法和聚类参数,从而得到针对不同定位精度的离线定位数据。Further, according to different positioning precisions required by the application, the clustering method and the clustering parameter may be respectively optimized for different positioning precisions, thereby obtaining offline positioning data for different positioning precisions.
例如,可以针对城市级别的精度分别计算离线定位数据的成功率和覆盖率,采用上述第1)或第2)所述的方式选择聚类方法和调整聚类参数,并利用选择的聚类方法和调整后的聚类参数得到城市级别的离线定位数据。再针对区县级别的精度分别计算离线定位数据的成功率和覆盖率,采用上述第1)或第2)所述的方式选择聚类方法和调整聚类参数,并利用选择的聚类方法和调整后的聚类参数得到城市区县级别的离线定位数据。For example, the success rate and coverage rate of the offline positioning data can be separately calculated for the accuracy of the city level, and the clustering method and the clustering parameter are selected by the method described in the above 1) or 2), and the selected clustering method is utilized. And the adjusted clustering parameters get the city-level offline positioning data. Then calculate the success rate and coverage rate of the offline positioning data separately for the accuracy of the district and county level, select the clustering method and adjust the clustering parameters by the method described in the above 1) or 2), and use the selected clustering method and The adjusted clustering parameters obtain offline positioning data of the city district and county level.
然后可以将城市级别的离线定位数据和区县级别的离线定位数据配置于移动设备,以便移动设备中的应用需要城市级别的定位精度时,采用城市级别的离线定位数据进行离线定位,当移动设备中的应用需要区县级别的定位精度时,采用区县级别的离线定位数据进行离线定位。Then, the city-level offline positioning data and the district-level offline positioning data can be configured on the mobile device, so that when the application in the mobile device needs the city-level positioning accuracy, the city-level offline positioning data is used for offline positioning, when the mobile device When the application in the middle needs the location accuracy of the district and county level, the offline positioning data of the district and county level is used for offline positioning.
更进一步地,还可以根据对离线定位数据的数据量的要求,调整对覆盖 率和/或成功率的预设要求,从而选择出得到的离线定位数据的数量能够满足要求的聚类方式和聚类参数。由于移动设备的存储量是限制的,那么对离线定位数据的数据量也会有一定要求,如果数据量太大移动设备可能无法存储或对性能有很大影响。通常覆盖率越高离线定位数据的数据量越大,成功率越高离线定位数据的数据量越大。因此,在选择聚类方式和调整聚类参数时,可以调整对离线定位数据库的覆盖率和/或成功率的要求,从而选择聚类方式和调整聚类参数使得在满足离线定位数据库的覆盖率和/或成功率的要求同时,满足离线定位数据的数据量要求。Further, the coverage can be adjusted according to the requirement of the amount of data of the offline positioning data. The preset requirements of the rate and/or success rate, thereby selecting the number of offline positioning data obtained can satisfy the required clustering mode and clustering parameters. Since the storage capacity of mobile devices is limited, there is a certain requirement for the amount of data for offline positioning data. If the amount of data is too large, the mobile device may not be able to store or have a great impact on performance. Generally, the higher the coverage rate, the larger the amount of data for offline positioning data, and the higher the success rate, the larger the amount of data for offline positioning data. Therefore, when selecting the clustering method and adjusting the clustering parameters, the requirements for the coverage and/or success rate of the offline positioning database can be adjusted, thereby selecting the clustering method and adjusting the clustering parameters so as to satisfy the coverage of the offline positioning database. At the same time, the requirements of the success rate and the success rate of the offline positioning data are met.
图1为本发明实施例提供的装置结构图,该装置用于利用服务器端的在线定位日志和离线定位数据,确定离线定位数据的覆盖率和/或成功率,以此衡量离线定位数据的质量。如图1所示,该装置具体包括:覆盖率确定单元01和/或成功率确定单元02,图1中以同时包含这两个单元为例,还可以包括离线数据确定单元03和优化单元04。FIG. 1 is a structural diagram of a device according to an embodiment of the present invention. The device is configured to determine the coverage and/or success rate of offline positioning data by using online positioning logs and offline positioning data of a server, thereby measuring the quality of offline positioning data. As shown in FIG. 1 , the device specifically includes: a coverage ratio determining unit 01 and/or a success rate determining unit 02. The example includes the two units in FIG. 1 , and may further include an offline data determining unit 03 and an optimizing unit 04 . .
覆盖率确定单元01负责统计在线定位日志中设定时间段内的总记录数X,以及统计在线定位日志中设定时间段内的LAC标识信息在离线定位数据库中存在的记录数目Y,利用Y与X的比值确定离线定位数据的覆盖率。The coverage ratio determining unit 01 is responsible for counting the total number of records X in the set time period in the online positioning log, and counting the number of records Y existing in the offline positioning database of the LAC identification information in the set time period in the online positioning log, using Y The ratio to X determines the coverage of offline location data.
成功率确定单元02负责统计在线定位日志中设定时间段内的总记录数X,以及统计设定时间段内在线定位日志中的定位结果与离线定位数据一致的数量Z,利用Z与X的比值确定离线定位数据的成功率。The success rate determining unit 02 is responsible for counting the total number of records X in the set time period in the online positioning log, and counting the number Z of the positioning results in the online positioning log and the offline positioning data in the set time period, using Z and X The ratio determines the success rate of offline location data.
具体地,可以将一天内的在线定位日志中的一条定位数据与离线定位数据进行比较,即首先确定该条定位数据中基站ID所属的LAC标识信息,再确定该LAC标识信息在离线定位数据中对应的位置信息,判断该条定位数据中的基站定位数据与LAC标识信息在离线定位数据中对应的位置信息一致,例如是否落入LAC标识信息对应的区域。对一天内的在线定位日志中的所有定位数据都执行类似比较,统计得出一致结果的数目,该数目就是Z。Specifically, the locating data of the online positioning log in one day can be compared with the offline positioning data, that is, the LAC identification information of the base station ID in the positioning data is first determined, and then the LAC identification information is determined in the offline positioning data. Corresponding location information, determining whether the base station location data in the piece of positioning data is consistent with the location information corresponding to the LAC identification information in the offline positioning data, for example, whether it falls within an area corresponding to the LAC identification information. A similar comparison is performed on all the positioning data in the online positioning log in one day, and the number of consistent results is obtained, and the number is Z.
另外,成功率确定单元02在统计定位结果时,可以进一步记录离线定位数 据中与定位结果一致的位置信息的定位精度信息;从统计的定位结果中确定预设精度信息的定位结果数目,利用该数目与X的比值就能够确定预设精度对应的离线定位数据的成功率。In addition, the success rate determining unit 02 can further record the number of offline positioning when the positioning result is statistically calculated. According to the positioning accuracy information of the position information consistent with the positioning result; determining the number of positioning results of the preset precision information from the statistical positioning result, and using the ratio of the number to the X, the success of the offline positioning data corresponding to the preset precision can be determined. rate.
离线定位数据库中的LAC数据包括:LAC标识信息及其对应的位置信息。该LAC数据可以由离线数据确定单元03得到。具体地,离线数据确定单元03根据LAC对基站数据进行合并后得到LAC数据,基站数据包括基站标识、基站所属LAC标识以及基站的位置信息。The LAC data in the offline positioning database includes: LAC identification information and corresponding location information. The LAC data can be obtained by the offline data determining unit 03. Specifically, the offline data determining unit 03 combines the base station data according to the LAC to obtain LAC data, where the base station data includes the base station identifier, the LAC identifier of the base station, and the location information of the base station.
离线数据确定单元03可以将属于同一LAC的基站数据基于位置信息进行聚类,得到至少一个类;记录LAC的标识信息及其对应类的中心点位置信息,以及轮廓信息或半径信息。得到LAC数据过程的更多细节可以参见上述方法实施例中的描述。The offline data determining unit 03 may cluster the base station data belonging to the same LAC based on the location information to obtain at least one class; record the identification information of the LAC and the center point location information of the corresponding class, and the contour information or the radius information. For more details on the process of obtaining LAC data, reference can be made to the description in the above method embodiments.
在确定出离线定位数据的覆盖率和成功率后,可以自适应地优化离线定位数据的确定方法,该优化处理由优化单元04执行。具体地,该优化处理可以包括聚类方法的优化和聚类参数的优化。After determining the coverage and success rate of the offline positioning data, the determining method of the offline positioning data may be adaptively optimized, and the optimization processing is performed by the optimizing unit 04. Specifically, the optimization process may include optimization of a clustering method and optimization of clustering parameters.
对于聚类方法的优化,优化单元04可以针对确定离线定位数据时采用的不同聚类方法分别计算得到的离线定位数据的覆盖率和/或正确率,选择覆盖率和/或正确率满足预设覆盖率要求和/或正确率要求的聚类方式。For the optimization of the clustering method, the optimization unit 04 may separately calculate the coverage and/or the correct rate of the offline positioning data calculated by using different clustering methods used when determining the offline positioning data, and select the coverage rate and/or the correct rate to meet the preset. The clustering method required for coverage requirements and/or correct rate.
对于聚类参数的优化,优化单元04可以依据采用某聚类方法得到的离线定位数据的覆盖率和/或成功率,调整某聚类方法中的聚类参数值,直至得到的离线定位数据的覆盖率和/或成功率满足预设覆盖率要求和/或成功率要求。For the optimization of the clustering parameters, the optimization unit 04 may adjust the clustering parameter values in a clustering method according to the coverage and/or success rate of the offline positioning data obtained by using a certain clustering method, until the obtained offline positioning data is obtained. Coverage and/or success rate meets preset coverage requirements and/or success rate requirements.
更进一步地,优化单元04可以根据对离线定位数据的数据量要求,调整覆盖率要求和/或成功率要求,从而选择出得到的离线定位数据的数据量能够满足数据量要求的聚类方式和/或聚类参数。由于移动设备的存储量是限制的,那么对离线定位数据的数据量也会有一定要求,如果数据量太大移动设备可能无法存储或对性能有很大影响。通常覆盖率越高离线定位数据的数据量越大,成功率越高离线定位数据的数据量越大。因此,在选择聚类方式和 调整聚类参数时,可以调整对离线定位数据库的覆盖率和/或成功率的要求,从而选择聚类方式和调整聚类参数使得在满足离线定位数据库的覆盖率和/或成功率的要求同时,满足离线定位数据的数据量要求。Further, the optimization unit 04 may adjust the coverage requirement and/or the success rate requirement according to the data volume requirement of the offline positioning data, thereby selecting a clustering manner in which the data amount of the obtained offline positioning data can satisfy the data volume requirement. / or clustering parameters. Since the storage capacity of mobile devices is limited, there is a certain requirement for the amount of data for offline positioning data. If the amount of data is too large, the mobile device may not be able to store or have a great impact on performance. Generally, the higher the coverage rate, the larger the amount of data for offline positioning data, and the higher the success rate, the larger the amount of data for offline positioning data. So, when choosing the clustering method and When adjusting the clustering parameters, the coverage and/or success rate requirements of the offline positioning database can be adjusted, thereby selecting the clustering mode and adjusting the clustering parameters so as to meet the requirements of the coverage and/or success rate of the offline positioning database. , to meet the data volume requirements of offline positioning data.
本发明提供的装置可以设置于服务器端,也可以设置为独立的装置。The device provided by the invention may be arranged on the server side or as a separate device.
由以上描述可以看出,本发明提供的方法和装置可以具备以下优点:As can be seen from the above description, the method and apparatus provided by the present invention can have the following advantages:
1)本发明利用服务器端的在线定位日志和离线定位数据,确定离线定位数据的覆盖率和/或成功率,利用覆盖率和/或成功率反映离线定位数据的质量,从而实现离线定位的质量衡量,并能够对离线定位数据的优化起指导作用。1) The present invention utilizes the online positioning log and offline positioning data of the server to determine the coverage and/or success rate of the offline positioning data, and utilizes the coverage rate and/or the success rate to reflect the quality of the offline positioning data, thereby realizing the quality measurement of the offline positioning. And can guide the optimization of offline positioning data.
2)本发明利用离线定位数据的覆盖率和/或成功率能够对确定离线定位数据的方法进行优化,即不需要依赖移动设备的反馈,在服务器端就能形成闭环反向进行数据优化,提高离线定位的质量。2) The present invention can optimize the method for determining offline positioning data by using the coverage and/or success rate of offline positioning data, that is, without relying on the feedback of the mobile device, the closed loop reverse data can be optimized on the server side to improve The quality of offline targeting.
在本发明所提供的几个实施例中,应该理解到,所揭露的装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式。In the several embodiments provided by the present invention, it should be understood that the disclosed apparatus and method may be implemented in other manners. For example, the device embodiments described above are merely illustrative. For example, the division of the unit is only a logical function division, and the actual implementation may have another division manner.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用硬件加软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of hardware plus software functional units.
上述以软件功能单元的形式实现的集成的单元,可以存储在一个计算机可读取存储介质中。上述软件功能单元存储在一个存储介质中,包括若干指 令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)或处理器(processor)执行本发明各个实施例所述方法的部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。The above-described integrated unit implemented in the form of a software functional unit can be stored in a computer readable storage medium. The above software functional unit is stored in a storage medium, including several fingers The steps used to cause a computer device (which may be a personal computer, server, or network device, etc.) or a processor to perform the methods described in various embodiments of the present invention. The foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like, which can store program codes. .
以上所述仅为本发明的较佳实施例而已,并不用以限制本发明,凡在本发明的精神和原则之内,所做的任何修改、等同替换、改进等,均应包含在本发明保护的范围之内。 The above are only the preferred embodiments of the present invention, and are not intended to limit the present invention. Any modifications, equivalents, improvements, etc., which are made within the spirit and principles of the present invention, should be included in the present invention. Within the scope of protection.

Claims (18)

  1. 一种确定离线定位数据质量的方法,其特征在于,该方法包括:A method for determining quality of offline location data, the method comprising:
    利用服务器端的在线定位日志和离线定位数据,确定离线定位数据的覆盖率和/或成功率;Determining the coverage and/or success rate of offline positioning data by using online positioning logs and offline positioning data on the server side;
    其中所述确定离线定位数据的覆盖率包括:统计所述在线定位日志中设定时间段内的总记录数X,以及统计在线定位日志中设定时间段内的LAC标识信息在离线定位数据库中存在的记录数目Y,利用Y与X的比值确定离线定位数据的覆盖率;The determining the coverage rate of the offline positioning data includes: counting the total number of records X in the set time period in the online positioning log, and counting the LAC identification information in the set time period in the online positioning log in the offline positioning database. The number of records Y existing, using the ratio of Y to X to determine the coverage of offline positioning data;
    所述确定离线定位数据的成功率包括:统计所述在线定位日志中设定时间段内的总记录数X,以及统计设定时间段内在线定位日志中的定位结果与离线定位数据一致的数量Z,利用Z与X的比值确定离线定位数据的成功率。The determining the success rate of the offline positioning data includes: counting the total number of records X in the set time period in the online positioning log, and the number of the positioning result in the online positioning log and the offline positioning data in the statistical setting time period Z, using the ratio of Z to X to determine the success rate of offline positioning data.
  2. 根据权利要求1所述的方法,其特征在于,所述LAC数据包括:LAC标识信息及其对应的位置信息。The method according to claim 1, wherein the LAC data comprises: LAC identification information and corresponding location information.
  3. 根据权利要求2所述的方法,其特征在于,所述LAC数据是根据LAC对基站数据进行合并后得到的,所述基站数据包括基站标识、基站所属LAC标识以及基站的位置信息。The method according to claim 2, wherein the LAC data is obtained by combining base station data according to the LAC, and the base station data includes a base station identifier, a LAC identifier of the base station, and location information of the base station.
  4. 根据权利要求3所述的方法,其特征在于,所述根据LAC对基站数据进行合并包括:The method according to claim 3, wherein the merging the base station data according to the LAC comprises:
    将属于同一LAC的基站数据基于位置信息进行聚类,得到至少一个类;Base station data belonging to the same LAC is clustered based on location information to obtain at least one class;
    记录LAC的标识信息及其对应类的中心点位置信息,以及轮廓信息或半径信息。The identification information of the LAC and the location information of the center point of the corresponding class, as well as the contour information or the radius information are recorded.
  5. 根据权利要求2所述的方法,其特征在于,所述统计设定时间段内在线定位日志中的定位结果与离线定位数据一致的数量Z包括:The method according to claim 2, wherein the number Z of the positioning result in the online positioning log and the offline positioning data in the statistical setting period includes:
    比较在线定位日志中某基站ID对应的定位结果与该基站ID所属LAC标识信息在离线定位数据中对应的位置信息是否一致,如果是,则统计该定位结果;Comparing the positioning result corresponding to a certain base station ID in the online positioning log with the location information corresponding to the LAC identification information of the base station ID in the offline positioning data, and if yes, counting the positioning result;
    针对设定时间段内在线定位日志中的所有定位结果均执行所述比较的操作, 最终统计的定位结果数量为所述Z。Performing the comparison operation for all the positioning results in the online positioning log in the set time period, The number of positioning results of the final statistics is the Z.
  6. 根据权利要求5所述的方法,其特征在于,在所述统计定位结果时,进一步记录离线定位数据中与所述定位结果一致的位置信息的定位精度信息;The method according to claim 5, wherein, in the statistical positioning result, the positioning accuracy information of the position information in the offline positioning data that is consistent with the positioning result is further recorded;
    该方法进一步包括:从统计的定位结果中确定预设精度信息的定位结果数目,利用该数目与X的比值确定所述预设精度对应的离线定位数据的成功率。The method further includes: determining a number of positioning results of the preset precision information from the statistical positioning result, and determining, by using the ratio of the number and the X, the success rate of the offline positioning data corresponding to the preset precision.
  7. 根据权利要求1至6任一权项所述的方法,其特征在于,利用确定的覆盖率和/成功率,优化离线定位数据的确定方法。The method according to any of claims 1 to 6, characterized in that the method for determining offline positioning data is optimized using the determined coverage and/or success rate.
  8. 根据权利要求7所述的方法,其特征在于,所述利用确定的覆盖率和/成功率,优化离线定位数据的确定方法包括:The method according to claim 7, wherein the determining method for optimizing offline positioning data by using the determined coverage rate and/or success rate comprises:
    针对确定离线定位数据时采用的不同聚类方法分别计算得到的离线定位数据的覆盖率和/或正确率,选择覆盖率和/或正确率满足预设覆盖率要求和/或正确率要求的聚类方式;或者,The coverage rate and/or the correct rate of the offline positioning data calculated by different clustering methods used when determining the offline positioning data are respectively selected, and the coverage ratio and/or the correct rate satisfying the preset coverage requirement and/or the correct rate requirement are selected. Class way; or,
    依据采用某聚类方法得到的离线定位数据的覆盖率和/或成功率,调整所述某聚类方法中的聚类参数值,直至得到的离线定位数据的覆盖率和/或成功率满足预设覆盖率要求和/或成功率要求。Adjusting the clustering parameter value in the clustering method according to the coverage and/or success rate of the offline positioning data obtained by using a clustering method, until the coverage and/or success rate of the obtained offline positioning data meets the pre-determination Set coverage requirements and/or success rate requirements.
  9. 根据权利要求8所述的方法,其特征在于,该方法进一步包括:The method of claim 8 further comprising:
    根据对离线定位数据的数据量要求,调整所述覆盖率要求和/或成功率要求,从而选择出得到的离线定位数据的数据量能够满足所述数据量要求的聚类方式和/或聚类参数。Adjusting the coverage requirement and/or the success rate requirement according to the data volume requirement of the offline positioning data, so as to select the clustering manner and/or clustering of the data amount of the obtained offline positioning data that can meet the data volume requirement. parameter.
  10. 一种确定离线定位数据质量的装置,其特征在于,用于利用服务器端的在线定位日志和离线定位数据,确定离线定位数据的覆盖率和/或成功率;该装置包括:覆盖率确定单元和/或成功率确定单元;An apparatus for determining the quality of offline positioning data, which is configured to determine coverage and/or success rate of offline positioning data by using online positioning logs and offline positioning data of a server; the apparatus includes: a coverage determining unit and/or Or success rate determination unit;
    所述覆盖率确定单元,用于统计所述在线定位日志中设定时间段内的总记录数X,以及统计在线定位日志中设定时间段内的LAC标识信息在离线定位数据库中存在的记录数目Y,利用Y与X的比值确定离线定位数据的覆盖率;The coverage determining unit is configured to count the total number of records X in the set time period in the online positioning log, and record the records in the offline positioning database of the LAC identification information in the set time period in the online positioning log. The number Y, using the ratio of Y to X to determine the coverage of offline positioning data;
    所述成功率确定单元,用于统计所述在线定位日志中设定时间段内的总记 录数X,以及统计设定时间段内在线定位日志中的定位结果与离线定位数据一致的数量Z,利用Z与X的比值确定离线定位数据的成功率。The success rate determining unit is configured to count the total records in the set time period in the online positioning log The number X, and the number Z of the positioning result in the online positioning log and the offline positioning data in the statistical setting time period, the ratio of Z to X is used to determine the success rate of the offline positioning data.
  11. 根据权利要求10所述的装置,其特征在于,所述LAC数据包括:LAC标识信息及其对应的位置信息。The apparatus according to claim 10, wherein the LAC data comprises: LAC identification information and corresponding location information.
  12. 根据权利要求11所述的装置,其特征在于,该装置还包括:离线数据确定单元,用于根据LAC对基站数据进行合并后得到所述LAC数据,所述基站数据包括基站标识、基站所属LAC标识以及基站的位置信息。The device according to claim 11, further comprising: an offline data determining unit, configured to combine the base station data according to the LAC to obtain the LAC data, where the base station data includes a base station identifier, and the LAC of the base station Identification and location information of the base station.
  13. 根据权利要求12所述的装置,其特征在于,所述离线数据确定单元,具体用于将属于同一LAC的基站数据基于位置信息进行聚类,得到至少一个类;记录LAC的标识信息及其对应类的中心点位置信息,以及轮廓信息或半径信息。The device according to claim 12, wherein the offline data determining unit is configured to cluster base station data belonging to the same LAC according to location information to obtain at least one class; and record the identification information of the LAC and the corresponding The center point position information of the class, as well as the contour information or radius information.
  14. 根据权利要求11所述的装置,其特征在于,所述成功率确定单元在统计所述Z时,具体执行:The device according to claim 11, wherein the success rate determining unit, when counting the Z, performs:
    比较在线定位日志中某基站ID对应的定位结果与该基站ID所属LAC标识信息在离线定位数据中对应的位置信息是否一致,如果是,则统计该定位结果;Comparing the positioning result corresponding to a certain base station ID in the online positioning log with the location information corresponding to the LAC identification information of the base station ID in the offline positioning data, and if yes, counting the positioning result;
    针对设定时间段内在线定位日志中的所有定位结果均执行所述比较的操作,最终统计的定位结果数量为所述Z。The comparison operation is performed for all the positioning results in the online positioning log in the set time period, and the number of the final statistical positioning results is the Z.
  15. 根据权利要求14所述的装置,其特征在于,所述成功率确定单元在所述统计定位结果时,进一步记录离线定位数据中与所述定位结果一致的位置信息的定位精度信息;从统计的定位结果中确定预设精度信息的定位结果数目,利用该数目与X的比值确定所述预设精度对应的离线定位数据的成功率。The device according to claim 14, wherein the success rate determining unit further records the positioning accuracy information of the position information in the offline positioning data that is consistent with the positioning result when the statistical positioning result is obtained; The number of the positioning results of the preset accuracy information is determined in the positioning result, and the ratio of the number to the X is used to determine the success rate of the offline positioning data corresponding to the preset precision.
  16. 根据权利要求10至15任一权项所述的装置,其特征在于,该装置还包括:优化单元,用于利用所述覆盖率确定单元确定的覆盖率和/或成功率确定单元确定的成功率,优化离线定位数据的确定方法。The apparatus according to any one of claims 10 to 15, characterized in that the apparatus further comprises: an optimization unit for using the coverage rate and/or the success rate determination unit determined by the coverage determination unit to determine the success Rate, the method of determining the offline positioning data.
  17. 根据权利要求16所述的装置,其特征在于,所述优化单元,具体用于针对确定离线定位数据时采用的不同聚类方法分别计算得到的离线定位数据的覆盖率和/或正确率,选择覆盖率和/或正确率满足预设覆盖率要求和/或正确率要 求的聚类方式;或者,The apparatus according to claim 16, wherein the optimization unit is specifically configured to select a coverage rate and/or an accuracy rate of the offline positioning data respectively calculated by using different clustering methods used when determining offline positioning data, and selecting Coverage and/or correct rate meets preset coverage requirements and/or correct rate Clustering method; or,
    依据采用某聚类方法得到的离线定位数据的覆盖率和/或成功率,调整所述某聚类方法中的聚类参数值,直至得到的离线定位数据的覆盖率和/或成功率满足预设覆盖率要求和/或成功率要求。Adjusting the clustering parameter value in the clustering method according to the coverage and/or success rate of the offline positioning data obtained by using a clustering method, until the coverage and/or success rate of the obtained offline positioning data meets the pre-determination Set coverage requirements and/or success rate requirements.
  18. 根据权利要求17所述的装置,其特征在于,所述优化单元,还用于根据对离线定位数据的数据量要求,调整所述覆盖率要求和/或成功率要求,从而选择出得到的离线定位数据的数据量能够满足所述数据量要求的聚类方式和/或聚类参数。 The device according to claim 17, wherein the optimization unit is further configured to adjust the coverage requirement and/or the success rate requirement according to a data volume requirement for offline positioning data, thereby selecting the obtained offline The amount of data of the positioning data can satisfy the clustering mode and/or clustering parameters of the data volume requirement.
PCT/CN2016/073271 2015-02-12 2016-02-03 Method and device for determining quality of offline positioning data WO2016127880A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510075882.2A CN105992267B (en) 2015-02-12 2015-02-12 A kind of method and apparatus determining offline location data quality
CN201510075882.2 2015-02-12

Publications (1)

Publication Number Publication Date
WO2016127880A1 true WO2016127880A1 (en) 2016-08-18

Family

ID=56614156

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/073271 WO2016127880A1 (en) 2015-02-12 2016-02-03 Method and device for determining quality of offline positioning data

Country Status (2)

Country Link
CN (1) CN105992267B (en)
WO (1) WO2016127880A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112564992A (en) * 2019-09-25 2021-03-26 中盈优创资讯科技有限公司 Method and system for judging off-line of access network equipment based on multiple judgments
CN112867145A (en) * 2021-01-08 2021-05-28 腾讯科技(深圳)有限公司 Base station positioning method, device, computer equipment and storage medium

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106792710A (en) * 2017-01-16 2017-05-31 北京奇虎科技有限公司 The method and device of pseudo-base station is recognized based on user terminal location
CN107422753B (en) * 2017-07-21 2021-01-01 深圳怡化电脑股份有限公司 Intelligent banknote sorting pressure adjusting system and method
CN107889062B (en) * 2017-11-24 2020-10-30 北京小米移动软件有限公司 Offline positioning data learning method and device
CN108508459B (en) * 2018-04-04 2022-05-10 千寻位置网络有限公司 Online positioning obstacle removing method and device and positioning system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013084032A1 (en) * 2011-12-09 2013-06-13 Nokia Corporation Positioning based on coverage area position information locally stored in a terminal
CN103634902A (en) * 2013-11-06 2014-03-12 上海交通大学 Novel indoor positioning method based on fingerprint cluster
WO2015014544A1 (en) * 2013-07-31 2015-02-05 Here Global B.V. Method and apparatus for determining quality of radio maps

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013084032A1 (en) * 2011-12-09 2013-06-13 Nokia Corporation Positioning based on coverage area position information locally stored in a terminal
WO2015014544A1 (en) * 2013-07-31 2015-02-05 Here Global B.V. Method and apparatus for determining quality of radio maps
CN103634902A (en) * 2013-11-06 2014-03-12 上海交通大学 Novel indoor positioning method based on fingerprint cluster

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112564992A (en) * 2019-09-25 2021-03-26 中盈优创资讯科技有限公司 Method and system for judging off-line of access network equipment based on multiple judgments
CN112867145A (en) * 2021-01-08 2021-05-28 腾讯科技(深圳)有限公司 Base station positioning method, device, computer equipment and storage medium
CN112867145B (en) * 2021-01-08 2021-12-07 腾讯科技(深圳)有限公司 Base station positioning method, device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN105992267B (en) 2019-08-09
CN105992267A (en) 2016-10-05

Similar Documents

Publication Publication Date Title
WO2016127880A1 (en) Method and device for determining quality of offline positioning data
US9026094B2 (en) Method and apparatus for use of performance history data in positioning method selection
US10097420B2 (en) Method and apparatus for determining area in which IP address is located
CN105992338B (en) Positioning method and device
KR101944725B1 (en) Determining timing for determination of applicable geo-fences
EP2620024B1 (en) Generation and use of coverage area models
US11151210B2 (en) Target location search method and apparatus
US20140258281A1 (en) Method And Server For Searching For Nearby User In Social Networking Services
US9220016B2 (en) Position monitoring for a coverage area
CN109936820B (en) User terminal positioning method and device
CN106575294B (en) Track data query method and device
CN105682024A (en) City hot spot identification method based on mobile signaling data
CN105989087B (en) Method and device for determining hot spot area
EP3461105B1 (en) Positioning method and server
CN112352256A (en) Optimizing geographical region selection
CN106899936B (en) Geohash-based gridding position privacy protection method and device
WO2018010693A1 (en) Method and apparatus for identifying information from rogue base station
CN106465175A (en) Method for collecting and aggregating network quality data
CN110298687B (en) Regional attraction assessment method and device
CN104185276B (en) A kind of localization method and device based on wireless network
CN108770057B (en) Method for determining predetermined fingerprint database, method and device for positioning fingerprint, and storage medium
CN106817712B (en) Positioning method and device and server
US8498636B2 (en) State-machine-based operation of a coverage-model-related process
US10582341B2 (en) Facilitating estimation of mobile device presence inside a defined region
CN112887910B (en) Method and device for determining abnormal coverage area and computer readable storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16748681

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16748681

Country of ref document: EP

Kind code of ref document: A1