我如何处理这个字符串,使我最终得到一个只包含全名&每个实体的相关jpg URL


How can I process this string so I end up with an array that just contains the full name & the associated jpg URL for each entity?

我有以下字符串,我需要将全名加上与之关联的URI/URL提取到每个字符串的数组中。它的构造不像典型的数据,所以我不确定如何继续。一开始我想爆炸,但是一开始就不需要"垃圾"数据,所以我需要一种方法来首先清理数据?但也许有一种有效的方法,在一个或两个步骤/阶段,这样我的代码是有效的?经过一些研究,我正在考虑mb_split(),但我在构造正则表达式方面很差。这里需要一些指导…

在下面的示例中,您可以看到初始"垃圾"之后出现的数据模式。形成的每个分组的第三"行"是我寻找的肉(又名数据)所在的位置。第一组中的第一个字符串,在例子中的第三行是"Amanda Grider"。这是全名及其在返回的数据示例中的位置,我正在从每个分组中寻找。这一行的后面是jpg图像的URI/URL,这是我要查找的第二部分数据。在我看来,其他一切都是垃圾,可以扔掉。

什么是最快,最有效的方式来处理这个块,并得到我寻找的值到一个数组,所以我可以在我的工作中使用它?

(顺便说一句,下面的代码我添加了返回字符,使数据更容易理解。实际上,没有返回字符,它表示为一个大字符串)

[
  [
    "tsg.lac",
    [],
    [
        [
            [null,null,"100829745667958569941"],
            [],
            ["Amanda Grider",null,null,null,"4b3347c83f0a1","8nwbFHob02C8CmojHF","BoZrAHx801Rz8o3h8k",null,"https://lh3.googleusercontent.com/-zIK8ZN_ZDt8/AAAAAAAAAAI/AAAAAAAAAAA/fsiR92bLDlU/photo.jpg",null,1,"Marina del Rey, CA",null,null,null,0,null,[],null,null,null,""],
            []
        ],[
            [null,null,"115014076410206782853"],
            [],
            ["VWvortex",null,null,null,"4b13c6667b3c9","JKCGFo_CApJ","JKCGFo_CApJ",null,"//lh6.googleusercontent.com/-X_wSt8nwpOU/AAAAAAAAAAI/AAAAAAAAACQ/R_jcIPcegbM/photo.jpg",null,1,null,null,null,null,0,null,[],[null,"http://WWW.VWVORTEX.COM",null,null,3],null,null,"World's largest Volkswagen enthusiast community and blog."],
            []
        ],[
            [null,null,"102608018926739248428"],
            [],
            ["Wale",null,null,null,"4b1ded89a3721","JmRxAk","JmRxAk",null,"//lh4.googleusercontent.com/-xyeyjc4Avow/AAAAAAAAAAI/AAAAAAAAABU/SY-9EKeDnhw/photo.jpg",null,1,null,null,null,null,0,null,[],[null,"http://www.ralphfolarin.com/",null,null,6],null,null,""],
            []
        ],[
            [null,null,"114161985228080012446"],
            [],
            ["The Opus Rhythm Music Blog",null,null,null,"4b177a5207d09","IIJj03C4Iog3HIKMIIJz02xEHnRf01ZxFnB","IIJj03C4Iog3HIKMIIJz02xEHnRf01ZxFnB",null,"//lh5.googleusercontent.com/-4QRl1IgDCLU/AAAAAAAAAAI/AAAAAAAAABI/pVoxTQ7SH8Y/photo.jpg",null,1,null,null,null,null,0,null,[],[null,"http://www.bacchusentertainment.com",null,null,6],null,null,"We are the team music blog of Bacchus Entertainment"],
            []
        ],[
            [null,null,"114645267718535118440"],
            [],
            ["Jalopnik",null,null,null,"4b12fccb6f809","DHRxFoK0Cng","DHRxFoK0Cng",null,"//lh6.googleusercontent.com/-_M1nn9mKyY8/AAAAAAAAAAI/AAAAAAAAABI/aXhkyN7cuuk/photo.jpg",null,1,null,null,null,null,0,null,[],[null,"http://jalopnik.com/",null,null,3],null,null,"Jalopnik: Drive Free or Die"],
            []
        ],[
            [null,null,"105503202599719238167"],
            [],
            ["Audi USA",null,null,null,"4b14db7535e99","8owhCkGEHmR","8owhCkGEHmR",null,"//lh3.googleusercontent.com/-mHHyVhWfARE/AAAAAAAAAAI/AAAAAAAAAC4/Qn0lYbilT8M/photo.jpg",null,1,null,null,null,null,0,null,[],[null,"http://www.audiusa.com","(800) 822-2834",null,3],null,null,"Progress is social media, and listening, and fans, and Google+. So here we are."],
            []
        ],[
            [null,null,"104108787932235341403"],
            [],
            ["Audi Sport",null,null,null,"4b23243c864b1","8owhCkGAGJC8IF","8owhCkGAGJC8IF",null,"//lh4.googleusercontent.com/-jGBNL9dbwYs/AAAAAAAAAAI/AAAAAAAAAUA/pgsAqvaX8XM/photo.jpg",null,1,null,null,null,null,0,null,[],[null,"http://www.facebook.com/AudiSportPage",null,null,6],null,null,"Unofficial Audi Sport fan page, not affiliated with or endorsed by Audi AG."],
            []
        ],[
            [null,null,"106689856342933829975"],
            [],
            ["Volkswagen USA",null,null,null,"4b20ca9b7fa69","JJBxDohI8nBjFFGEHmR","JJBxDohI8nBjFFGEHmR",null,"//lh5.googleusercontent.com/-i3MO9CsymQ8/AAAAAAAAAAI/AAAAAAAAAB4/ddmTW3D8s20/photo.jpg",null,1,null,null,null,null,0,null,[],[null,"http://www.vw.com","(800) 822-8987",null,3],null,null,"Take a look around, kick the tires, ask questions and get to know our community."],
            []
        ],[
            [null,null,"115425298803319911308"],
            [],
            ["Internal Frequency",null,null,null,"4b177b6d46119","Co4CAo_08no3BJZjGowjFHhM","Co4CAo_08no3BJZjGowjFHhM",null,"//lh4.googleusercontent.com/-lZeecuGL3Ig/AAAAAAAAAAI/AAAAAAAAABk/Afv5eGuBzUM/photo.jpg",null,1,null,null,null,null,0,null,[],[null,"http://www.internalfrequency.com",null,null,6],null,null,"The 1st hand ups-and-downs of the CEO of an up-and-coming entertainment label in Southern California"],
            []
        ],[
            [null,null,"101358795463286919640"],
            [],
            ["Music Think Tank",null,null,null,"4b1947fea8251","EoxACmg3IIJrFIg3IHS0Dk","EoxACmg3IIJrFIg3IHS0Dk",null,"//lh4.googleusercontent.com/-B2KTfl4uNyE/AAAAAAAAAAI/AAAAAAAAACM/N955ZhPV08E/photo.jpg",null,1,null,null,null,null,0,null,[],[null,"http://www.musicthinktank.com",null,null,6],null,null,"Where the music industry speaks out loud. Create the Chaos."],
            []
        ]
    ]
  ]
]

更新:

所以我偶然发现了一些东西,发现数据实际上是有效的JSON,确实被解码并传递回来,但仍然有些奇怪,似乎非常复杂(太复杂了,我需要什么)。我使用json_decode()对数据进行修饰,然后将其赋值给变量$jsondata。当我在后面加上以下内容时:

print_r ( print_r($jsondata));

我得到了这个(我添加了返回字符,所以它更有意义,可以很容易地阅读):

Array ( 
[0] => Array (
    [0] => tsg.lac [1] => Array () [2] => Array (
        [0] => Array (
            [0] => Array (
                [0] => [1] => [2] => 100829745667958569941 ) 
            [1] => Array ( ) 
            [2] => Array (
                [0] => Amanda Grider [1] => [2] => [3] => [4] => 4b33843806e03 [5] => 8nwbFHob02C8CmojHF [6] => BoZrAHx801Rz8o3h8k [7] => [8] => https://lh3.googleusercontent.com/-zIK8ZN_ZDt8/AAAAAAAAAAI/AAAAAAAAAAA/fsiR92bLDlU/photo.jpg [9] => [10] => 1 [11] => Marina del Rey, CA [12] => [13] => [14] => [15] => 0 [16] => [17] => Array ( ) [18] => [19] => [20] => [21] => ) 
            [3] => Array ( )
        )
        [1] => Array (
            [0] => Array (
                [0] => [1] => [2] => 115014076410206782853 )
            [1] => Array ( )
            [2] => Array (
                [0] => VWvortex [1] => [2] => [3] => [4] => 4b13c6667b3c9 [5] => JKCGFo_CApJ [6] => JKCGFo_CApJ [7] => [8] => //lh6.googleusercontent.com/-X_wSt8nwpOU/AAAAAAAAAAI/AAAAAAAAACQ/R_jcIPcegbM/photo.jpg [9] => [10] => 1 [11] => [12] => [13] => [14] => [15] => 0 [16] => [17] => Array ( ) [18] => Array ( [0] => [1] => http://WWW.VWVORTEX.COM [2] => [3] => [4] => 3 ) [19] => [20] => [21] => World's largest Volkswagen enthusiast community and blog. ) 
            [3] => Array ( )
        )
        [2] => Array (
            [0] => Array (
                [0] => [1] => [2] => 102608018926739248428 )
            [1] => Array ( )
            [2] => Array (
                [0] => Wale [1] => [2] => [3] => [4] => 4b1ded89a3721 [5] => JmRxAk [6] => JmRxAk [7] => [8] => //lh4.googleusercontent.com/-xyeyjc4Avow/AAAAAAAAAAI/AAAAAAAAABU/SY-9EKeDnhw/photo.jpg [9] => [10] => 1 [11] => [12] => [13] => [14] => [15] => 0 [16] => [17] => Array ( ) [18] => Array ( [0] => [1] => http://www.ralphfolarin.com/ [2] => [3] => [4] => 6 ) [19] => [20] => [21] => )
            [3] => Array ( )
        ) 
        [3] => Array (
            [0] => Array (
                [0] => [1] => [2] => 114161985228080012446 )
            [1] => Array ( )
            [2] => Array ( 
                [0] => The Opus Rhythm Music Blog [1] => [2] => [3] => [4] => 4b177a5207d09 [5] => IIJj03C4Iog3HIKMIIJz02xEHnRf01ZxFnB [6] => IIJj03C4Iog3HIKMIIJz02xEHnRf01ZxFnB [7] => [8] => //lh5.googleusercontent.com/-4QRl1IgDCLU/AAAAAAAAAAI/AAAAAAAAABI/pVoxTQ7SH8Y/photo.jpg [9] => [10] => 1 [11] => [12] => [13] => [14] => [15] => 0 [16] => [17] => Array ( ) [18] => Array ( [0] => [1] => http://www.bacchusentertainment.com [2] => [3] => [4] => 6 ) [19] => [20] => [21] => We are the team music blog of Bacchus Entertainment )
            [3] => Array ( )
        ) 
        [4] => Array (
            [0] => Array (
                [0] => [1] => [2] => 114645267718535118440 )
            [1] => Array ( )
            [2] => Array (
                [0] => Jalopnik [1] => [2] => [3] => [4] => 4b12fccb6f809 [5] => DHRxFoK0Cng [6] => DHRxFoK0Cng [7] => [8] => //lh6.googleusercontent.com/-_M1nn9mKyY8/AAAAAAAAAAI/AAAAAAAAABI/aXhkyN7cuuk/photo.jpg [9] => [10] => 1 [11] => [12] => [13] => [14] => [15] => 0 [16] => [17] => Array ( ) [18] => Array ( [0] => [1] => http://jalopnik.com/ [2] => [3] => [4] => 3 ) [19] => [20] => [21] => Jalopnik: Drive Free or Die )
            [3] => Array ( )
        ) 
        [5] => Array (
            [0] => Array (
                [0] => [1] => [2] => 105503202599719238167 )
            [1] => Array ( )
            [2] => Array (
                [0] => Audi USA [1] => [2] => [3] => [4] => 4b14db7535e99 [5] => 8owhCkGEHmR [6] => 8owhCkGEHmR [7] => [8] => //lh3.googleusercontent.com/-mHHyVhWfARE/AAAAAAAAAAI/AAAAAAAAAC4/Qn0lYbilT8M/photo.jpg [9] => [10] => 1 [11] => [12] => [13] => [14] => [15] => 0 [16] => [17] => Array ( ) [18] => Array ( [0] => [1] => http://www.audiusa.com [2] => (800) 822-2834 [3] => [4] => 3 ) [19] => [20] => [21] => Progress is social media, and listening, and fans, and Google+. So here we are. )
            [3] => Array ( )
        ) 
        [6] => Array (
            [0] => Array (
                [0] => [1] => [2] => 104108787932235341403 )
            [1] => Array ( )
            [2] => Array (
                [0] => Audi Sport [1] => [2] => [3] => [4] => 4b23243c864b1 [5] => 8owhCkGAGJC8IF [6] => 8owhCkGAGJC8IF [7] => [8] => //lh4.googleusercontent.com/-jGBNL9dbwYs/AAAAAAAAAAI/AAAAAAAAAUA/pgsAqvaX8XM/photo.jpg [9] => [10] => 1 [11] => [12] => [13] => [14] => [15] => 0 [16] => [17] => Array ( ) [18] => Array ( [0] => [1] => http://www.facebook.com/AudiSportPage [2] => [3] => [4] => 6 ) [19] => [20] => [21] => Unofficial Audi Sport fan page, not affiliated with or endorsed by Audi AG. )
            [3] => Array ( )
        ) 
        [7] => Array (
            [0] => Array (
                [0] => [1] => [2] => 106689856342933829975 )
            [1] => Array ( ) 
            [2] => Array (
                [0] => Volkswagen USA [1] => [2] => [3] => [4] => 4b20ca9b7fa69 [5] => JJBxDohI8nBjFFGEHmR [6] => JJBxDohI8nBjFFGEHmR [7] => [8] => //lh5.googleusercontent.com/-i3MO9CsymQ8/AAAAAAAAAAI/AAAAAAAAAB4/ddmTW3D8s20/photo.jpg [9] => [10] => 1 [11] => [12] => [13] => [14] => [15] => 0 [16] => [17] => Array ( ) [18] => Array ( [0] => [1] => http://www.vw.com [2] => (800) 822-8987 [3] => [4] => 3 ) [19] => [20] => [21] => Take a look around, kick the tires, ask questions and get to know our community. )
            [3] => Array ( )
        ) 
        [8] => Array (
            [0] => Array (
                [0] => [1] => [2] => 115425298803319911308 )
            [1] => Array ( ) 
            [2] => Array (
                [0] => Internal Frequency [1] => [2] => [3] => [4] => 4b177b6d46119 [5] => Co4CAo_08no3BJZjGowjFHhM [6] => Co4CAo_08no3BJZjGowjFHhM [7] => [8] => //lh4.googleusercontent.com/-lZeecuGL3Ig/AAAAAAAAAAI/AAAAAAAAABk/Afv5eGuBzUM/photo.jpg [9] => [10] => 1 [11] => [12] => [13] => [14] => [15] => 0 [16] => [17] => Array ( ) [18] => Array ( [0] => [1] => http://www.internalfrequency.com [2] => [3] => [4] => 6 ) [19] => [20] => [21] => The 1st hand ups-and-downs of the CEO of an up-and-coming entertainment label in Southern California )
            [3] => Array ( )
        ) 
        [9] => Array (
            [0] => Array ( 
                [0] => [1] => [2] => 101358795463286919640 )
            [1] => Array ( )
            [2] => Array (
                [0] => Music Think Tank [1] => [2] => [3] => [4] => 4b1947fea8251 [5] => EoxACmg3IIJrFIg3IHS0Dk [6] => EoxACmg3IIJrFIg3IHS0Dk [7] => [8] => //lh4.googleusercontent.com/-B2KTfl4uNyE/AAAAAAAAAAI/AAAAAAAAACM/N955ZhPV08E/photo.jpg [9] => [10] => 1 [11] => [12] => [13] => [14] => [15] => 0 [16] => [17] => Array ( ) [18] => Array ( [0] => [1] => http://www.musicthinktank.com [2] => [3] => [4] => 6 ) [19] => [20] => [21] => Where the music industry speaks out loud. Create the Chaos. )
            [3] => Array ( )
        )
    )
)
) 1

现在有很多数组了!不仅如此,我只需要超级嵌套数组[2],其中名称开始,值[0]&[8] !然后,当下一行(下面)运行时,我得到一个错误,但我不太关心这个,我想知道如何减少这些数据,这样它就不会成为这样的内存消耗....

$visiblepeople = $jsondata[2];

对我来说,这看起来像JSON (http://www.json.org),因此,您可以使用任何JSON实现来解析它。对于Python,有一个模块叫做…